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186 Human Secreted Proteins 
Field of the Invention 

This invention relates to newly identified polynucleotides and the polypeptides 
encoded by these polynucleotides, uses of such polynucleotides and polypeptides, and 
5 their production. 

Background of the Invention 

Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and otiier eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 

10 organelle, contains different proteins essential for the function of the organelle. The cell 
uses "sorting signals," which are amino acid motifs located within the protein, to target 
proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a leader 
sequence, directs a class of proteins to an organelle called the endoplasmic reticulum 

15 (ER). The ER separates the membrane-bounded proteins from all other types of 

proteins. Once localized to the ER, bofli groups of proteins can be further directed to 
anotiier organeUe called the Golgi apparatus. Here, the Golgi distributes the proteins to 
vesicles, including secretory vesicles, the cell membrane, lysosomes, and the otiier 
organelles. 

20 Proteins targeted to die ER by a signal sequence can be released into die 

extracellular space as a secreted protein. For example, vesicles containing secreted 
proteins can fuse with the cell membrane and release their contents into the extracellular 
space - a process called exocytosis. Exocy tosis can occur constitutively or after receipt 
of a triggering signal. In die latter case, the proteins are stored in secretory vesicles (or 

25 secretory granules) until exocytosis is triggered. Similarly, protems residing on die cell 
membrane can also be secreted into die extracellular space by proteolytic cleavage of a 
"linker^' holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins include 

30 die commercially valuable human insulin, interferon. Factor VIII, human growth 
hormone, tissue plasminogen activator, and eryduopoeitin. Thus, in light of die 
pervasive role of secreted proteins in human physiology, a need exists for identifying 
and characterizing novel human secreted proteins and die genes diat encode diem. This 
knowledge will allow one to detect, to treat, and to prevent medical disorders by using 

35 secreted proteins or the genes that encode them. 



wo 98/39448 



2 



PCT/DSSW/04493 



Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host ceUs, antibodies, 
5 and recombinant methods for producing the polypeptides and polynucleotides. Also 
provided are diagnostic methods for detecting disorders related to the polypeptides, and 
therapeutic methods for treating such disorders. The invention further relates to 
screening methods for identifying binding partners of the polypeptides. 

10 Detailed Description 

Definitions 

The following definitions are provided to facilitate understanding of certain 
terms used throughout this specification. 

In the present invention, "isolated" refers to material removed fiom its original 

15 environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered **by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 
matter, or particular cell is not the original environment of the polynucleotide. 

20 In the present invention, a "secreted" protein refers to those proteins capable of 

being directed to the ER, secretory vesicles, or the extraceUular space as a result of a 
signal sequence, as well as those proteins released into the extracellular space without 
necessarily containing a signal sequence. If the secreted protein is released into the 
extraceUular space, the secreted protein can undergo extracellular processing to pioduce 

25 a "mature" protein. Release into the extracellular space can occur by many 
mechanisms, including exocytosis and proteolytic cleavage. 

As used herein , a "polynucleotide" refers to a molecule having a nucleic acid 
sequence contained in SEQ ID NO:X or the cDNA contained within the clone deposited 
with the ATCC. For example, the polynucleotide can contain the nucleotide sequence 

30 of the full length cDN A sequence, including the 5* and 3* untranslated sequences, the 
coding region, with or without the signal sequence, the secreted protein coding region, 
as well as fragments, epitopes, domains, and variants of the nucleic acid sequence. 
Moreover, as used herein, a "polypeptide" refers to a molecule having the translated 
amino acid sequence generated from the polynucleotide as broadly defined. 

35 In the present invention, the full length sequence identified as SEQ ID NO:X 

was often generated by overlapping sequences contained in multiple clones (contig 
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analysis). A representative clone containing ail or most of the sequence for SEQ ID 
NO:X was deposited with the Anierican Type Culture Collection ("ATCC"). As 
shown in Table 1, each clone is identified by a cDNA Clone ID (Identifier) and the 
ATCC Deposit Number. The ATCC is located at 12301 Park Lawn Drive, Rockville, 
5 Maryland 20852» USA. The ATCC deposit was made pursuant to the terms of the 
Budapest Treaty on the international recognition of the deposit of microorganisms for 
purposes of patent procedure. 

A "polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences contained 
10 in SEQ ID NO:X, the complement thereof, or the cDNA contained within the clone 
deposited with the ATCC. "Stringent hybridization conditions" refers to an overnight 

incubation at AT C in a solution comprising 50% formamide, 5x SSC (750 mM NaQ, 

75 mM sodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 
10% dexuan sulfate, and 20 Jig/ml denatured, sheared salmon sperm DNA, followed 

15 by washing the filters in 0. Ix SSC at about 65X. 

Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished through the manipulation of formamide concentration (lower percentages 
20 of formamide result in lowered stringency); salt conditions, or temperature. For 

example, lower stringency conditions include an overnight incubation at 37°C in a 

solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M NaH^PO^; 0.02M EDTA, 
pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml salmon sperm blocking DNA; 

followed by washes at 50''C with IXSSPE, 0.1% SDS. In addition, to achieve even 

25 lower stringency, washes performed following stringent hybridization can be done at 
higher salt concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 

30 Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 

commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, due 
to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such 

35 as any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
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complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone). 
5 The polynucleotide of the present invention can be composed of any 

polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA 
or modified RNA or DNA. For example, polynucleotides can be composed of single- 
and double-stranded DNA, DNA that is a mixture of single- and double-stranded 
regions, single- and double-stranded RNA, and RNA that is mixture of single- and 

10 doublcrstranded regions, hybrid molecules conq)rising DNA and RNA that may be 
single-stranded or, more typically, double-stranded or a mixture of single- and double- 
stranded regions. In addition, the polynucleotide can be composed of triple-stranded 
regions comprising RNA or DNA or both RNA and DNA. A polynucleotide may also 
contain one or more modified bases or DNA or RNA backbones modified for stability 

15 or for other reasons. "Modified" bases include, for example, thtylated bases and 
unusual bases such as inosine. A variety of modifications can be made to DNA and 
RNA; thus, "polynucleotide" embraces chemically, enzymatically, or metaboUcally 
modified forms. 

The polypeptide of the present invention can be composed of amino acids joined 

20 to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and 
may contain amino acids other than the 20 gene-encoded amino acids. The 
polypeptides may be modified by either natural processes, such as posttransladonal 
processing, or by chemical modification techniques which are well known in the art. 
Such modifications are well described in basic texts and in more detailed monographs, 

25 as well as in a voluminous research literature. Modifications can occur anywhere in a 
polypeptide, including the peptide backbone, the amino acid side-chains and the amino 
or carboxyl termini. It will be appreciated that the same type of modification may be 
present in the same or varying degrees at several sites in a given polypeptide. Also, a 
given polypeptide may contain many types of modifications. Polypeptides may be 

30 branched , for example, as a result of ubiquitination, and they may be cyclic, with or 
without branching. Cyclic, branched, and branched cyclic polypeptides may result 
from posttranslation natural processes or may be made by synthetic methods. 
Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 

35 nucleotide or nucleotide derivative, covalent attachment of a Upid or lipid derivative, 
covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-links, formation of cysteine, 
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fonnation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI 
anchor formation, hydroxylation, iodination, melhylation, myristoylation, oxidation, 
pegylation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins 
such as arginylation, and ubiquitination. (See. for instance, PROTEINS - 
STRUCTURE AND MOLECULAR PROPERTIES. 2nd Ed., T. E. Creighton, W. 
H. Freeman and Company, New Yoric (1993); POSTTRANSLATIONAL 
COVALENT MODIFICATION OF PROTEINS, B, C Johnson, Ed, Academic 
Press, New York, pgs. 1-12 (1983); Seifter et al„ Meth Enzymol 182:626-646 (1990); 
Rattan et al., Ann NY Acad Sci 663:48-62 (1992).) 

"SEQ ID NO:X" refers to a polynucleotide sequence while "SEQ ID NO: Y" 
refers to a polypeptide sequence, both sequences identified by an integer specified in 
Table 1. 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
activity similar, but not necessarily identical to, an activity of a polypeptide of the 
present invention, including mature forms, as measured in a particular biological assay, 
with or without dose dependency. In the case where dose dependency does exist, it 
need not be identical to that of the polypeptide, but rather substantially similar to the 
dose-dependence in a given activity as compared to the polypeptide of the present 
invention (i.e., the candidate polypeptide will exhibit greater activity or not more than 
about 25-fold less and, preferably, not more than about tenfold less activity, and most 
preferably, not more than about three-fold less activity relative to the polypeptide of the 
present invention.) 

Polynucleotides and Polypeptides of the Invention 

FEATURES OF PROTEIN ENCODED BY GENE NO: 1 

This gene is expressed primarily in testes tumor and to a lesser extent in fetal 

brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue{s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer particularly of the testes, and defects of die central nervous system 
such as seizure and neurodegenerative disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly cancer of the testes and central nervous system. 
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expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and ceil types (e.g., testes and other reproductive tissue, brain and 
other tissue of the nervous system, and blood cells, and spleen, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
5 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of testicular cancer and 
10 treatment of central nervous system disorders since this gene is primarily expressed in 
the testes tumor and developing brain. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

This gene is expressed primarily in cancer tissues, such as breast cancer and 

15 Wilm's tumor, and to a lesser extent in fetal tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, and/or tumors, particularly, those found in the breast, and developmental 

20 abnormalities or disorders. Similarly, polypeptides and antibodies directed to these 

polypeptides are usefid in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the glandular tissues, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., mammary 

25 tissue, and fetal tissue and, cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken firom an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 

30 sequence shown in SEQ ID NO. 314 as residues: Pro-1 1 to Thr-I8, Leu-43 to Pro-50, 
Gly-64 to Leu-72, and Leu-8 1 to Lys-86. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of cancers and/or tumors, 
particularly, those foimd in the breast since expression is mainly in cancer/tumor 

35 tissues. May serve as therapeutic proteins for proliferation/differentiation of fetal 
tissues. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 3 

This gene is expressed primarily in CD34 depleted buffy coat and to a lesser 
extent in spleen» chronic lynipbocytic leukemia. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: blood disorders or 
leukemias, diseases of the immune system. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing inununological probes for 

10 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., blood cells, and spleen, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 

1 S cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of blood disorders or 

20 leukemias, diseases of the immune system since expression is in tissues related to 
immune function. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 4 

This gene is expressed primarily in CD34 depleted buffy coat. 

25 Therefore, polynucleotides and polypeptides of the invention arc useful as 

reagents for differential identification of the ussue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: blood disorders or 
lymphocytic diseases. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inmiunological probes for differential identification 

30 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the inmiune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 

35 having such a disorder, relative to the standard gene expression level, i.e., the 

expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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The tissue distribution indicates that polynucleotides and polypq)tides 
corresponding to this gene are useful for treatment/diagnosis of blood disorders since 
expression is in tissues related to immune function. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 5 

This gene is expressed primarily in CD34 depleted buffy coat 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: blood or inmiune 

10 diseases. Similarly, polypeptides and antibodies directed to these polypeptides are 

useful in providing inununological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particidarly of 
the inunune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., blood cells, and cancerous 

15 and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO. 317 as residues: 

20 Pro-13 toLys-21. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of blood disorders since 
expression is in tissues related to inmiune function. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 6 

This gene is expressed primarily in CD34 depleted buffy coat. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: blood or inmiune 

30 diseases. Similarly, polypeptides and antibodies directed to these polypeptides are 

useful in providing inununological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the inunune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., and blood cells, and 

35 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
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in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO. 318 as residues: 
\ Lys-31 to Lys-39. 

The tissue distribution indicates that polynucleotides and polypeptides 
5 corresponding to this gene are useful for treatment/diagnosis of blood diseases since it 
is expressed in tissues related to immime function. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

This gene is expressed primarily in CD34 depleted buffy coat and to a lesser 

10 extent in pineal gland. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tis5ue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: diseases of the immune 
system and brain associated diseases. Similarly, polypeptides and antibodies directed to 

15 these polypeptides arc useful in providing immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the inmiune system, expression of this gene at 
significandy higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., blood cells, and pineal gland, and cancerous and woimded tissues) or 

20 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypq)tides 

25 corresponding to this gene are useful for treatment/diagnosis of blood disorders, 
immune diseases or brain associated diseases (specifically of the pineal gland) since 
expression is in tissues related to inunune function. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 8 

30 The translation prckluct of this gene shares sequence homology with an organic 

cation transporter which is thought to be important in organic cation uptake in the 
kidney and liver. (See Accession No. 2343059.) Preferred polypeptide fragments 
comprise the amino acid sequence riTAIQMlCLVhDCELYinT^vnRhDCGV^ 
FIVFRU^EVWQAIJ^UIJ^AVI^IXAAGVT^^ 

35 VQTSEPSGT (SEQ ID NO: 615) or TMKDAENLGRKAKPKENT (SEQ ID NO: 616) as well 
as N-terminal and C-terminal deletions of these fragments. Also preferred are 
polynucleotide fragments encoding these polypeptide fragments. 
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This gene is expressed primarily in liver. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hepatic and renal 
5 diseases where drug elimination/cation exchange (organic cation uptake) in the liver and 
kidney are problematic. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing imnrumological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the hepatic or renal system, expression of this gene at significantly higher 

10 or lower levels may be routinely detected in certain tissues and cell types (e.g., kidney 
and liver, and cancerous and wounded tissues) or bodily fluids (e.g., sertmi, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

1 5 disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO. 320 as residues: Asn-64 to Asn-74, and Gln-81 to Gly-87. 

The tissue distribution and homology to organic cation transporter indicate that 
polynucleotides and polypeptides corresponding to this gene are useful as a polyspecific . 
transporter that is important for drug elimination in the liver (and possibly kidney) since 

20 expression is found in the liver. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 9 

This gene is expressed primarily in eosinophil induced with IL-5 and to a lesser 
extent in fetal Uver and spleen. This gene also maps to chromosome 15, and therefore 

25 can be used in linkage analysis as a marker for chromosome 15. 

Therefore, polynucleotides and polypeptides of the invention are useftil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: diseases of the immune 
system, particulariy allergies or asthma. Similarly, polypeptides and antibodies directed 

30 to these polypeptides are useful in providing inmiunological probes for differential 
identification of the tissue(s) or ceU type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely det^ted in certain tissues and cell 
types (e.g., blood cells, liver, and spleen, and cancerous and wounded tissues) or 

35 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken fix)m an individual having such a disorder, relative to the 
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Standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating/diagnosis of diseases involving 
5 esosinphil reactions since expression seems to be concentrated in eosinophils and other 
tissues involved in immunity such as the liver and spleen. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 10 

This gene is expressed primarily in tissues of hematopoietic lineage and to a 

10 lesser extent in Hodgkins lymphoma. Any frame shifts in this sequence can easily be 
clarified using known molecular biology techniques. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

1 5 not limited to, and inunune deficiency or dysfunction. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues and 

20 cell types (e.g., hematopoietic cells, lymphoid and reticuloendothelial tissues, and 

cancerous tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

25 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are usefiil for treatment/ diagnosis for lymphomas or 
immune dysfuction or as a therapeutic protein useful in immune modulation based on 
expression in anergic T-cells and lymphomas. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 11 

This gene is expressed primarily in neutrophils and to a lesser extent in activated 
lymphoid cells. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the cell type present in a biological sample and 
35 for diagnosis of diseases and conditions: inflamation. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
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of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues and 
cell types (e.g., blood cells and lymphoid tissue, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
S tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO. 323 as residues: Glu-40 to Lys-46. 
The tissue distribution indicates that polynucleotides and polypeptides 
10 corresponding to this gene are useful for modulation of an inunune reaction or as a 
growth factor for the differentiation or proliferation of neutrphils for the treatment of 
neutropenia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 12 

15 This gene is expressed primarily in brain and to a lesser extent in activated T- 

cells. It is likely that the open reading frame containing the predicted signal peptide 
continues in the 5' direction. Preferred polypeptide fragments comprise the amino acid 
sequence PRVRNSPEDLGLSLTGDSCKL (SEQ ID NO:6l7). 

Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurodegenerative 
disorders including ischemic shock, alzheimers and cognitive disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
irrmiunological probes for differential identification of the tissue(s) or cell type(s). For a 

25 number of disorders of the above tissues or cells, particularly of the central nervous 
system, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., blood cells, and brain, and other tissue of 
the nervous system and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 

30 an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO. 324 as residues: Ser-5 to Glu-14, Ile-21 to Pro-35, Ser-65 to Asp-81, Cys-89 to 
Val-96, Lys-136 to Ser-145, De-152 to Met-169, and Arg-189 to Lys-196. 

35 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnostic/treatment for cancers of the given 
tissue or in the treatment of neurological disorders of the CNS. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 13 

This gene was also recently cloned by other groups, naming this calcium-activated 
potassium channel gene, hKCa4. (See Accession No. AF033021, see also. Accession 
5 No. 2584866.) This gene is mapped to human chromosome I9ql3.2. A second signal 
sequence likely exists upstream from the predicted signal sequence as described in 
Table 1. Preferred polypeptide fragments comprise: QADDLQATVAALCVLRGGGPWAG 
SWLSPKTPGAMGGDLVLGLX5ALRRRKRLL (SEQ NO: 618); or EQEKSLAGWALVLAXXGIGL 
KmJiAEMLWFGGCSAVNATGHl^DTLWUPrm.TlG 
10 TAIXVAWARKLEF>nCAEiamiNFMMDIQYTKEM 

XHQRXUJSAINAFRQVRLKHRKUtEQVNSMVDISK^^ 

KLDALTELLSTALGPRQLPEPSQQSK (SEQ ID NO: 619), as well as N-terminal and C- 
terminal deletions. Also preferred are polynulcleotide firagments encoding these 
polypeptide fragments. 

1 5 This gene is expressed primarily in breast lymph node and T-cells, and to a 

lesser extent in placenta. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hematologic and 

20 immune disorders. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing inmiunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significandy higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., lymphoid 

25 tissue, blood cells and placenta, and cancerous and wounded tissues) or bodily fluids 
(e.g., senmri, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the.standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 

30 sequence shown in SEQ ID NO. 325 as residues: Arg-13 to Lys-23. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment/diagnosis of hematologic and 
diseases involving immime modulation based or distribution in the lymph node and T- 
cells. 



35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 14 

This gene was recently cloned by another group, calling it PAPS synethase. 
(See Accession No. e 1204 135.) Preferred polypeptide fragments comprise the andno 
acid sequence YQAHHVSRNKRGQWGTRGGFRGCTVWLTGLSGAGK (SEQ ID NO: 620). 
5 Also preferred are the polynucleotide fragments encoding this polypeptide fragment. 

It has been discovered that this gene is expressed primarily in benign prostate 
hyperplasia. Human Umbilical Vein Endothelial Cells and to a lesser extent in smooth 
muscle and Human endometrial stromal cells-treated with estradiol. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

10 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: inflamation, ischemia, 
and restenosis, based on endothelial cell and smooth muscle ceU expression, and 
prostate diseases such as benign prostate hyperplasia or prostate cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are usefiil in providing 

15 immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the prostate or vessels 
of the circulatory system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and ceU types (e.g., prostate, endothelial 
cells, smooth muscle, and endometrium, and cancerous and wounded tissues) or bodily 

20 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken firom an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid fiom an 
individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO. 326 as residues: Arg-21 to Asp-26, Lys-35 to Lys-44, 

25 Glu-49 to Asn-58. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating/diagnosing diseases or conditions 
where the endothelial cell lining of the veins and arteries of underlying smooth muscle 
are involved. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 15 

This gene is expressed primarily in human 6 week embryo and to a lesser extent 
in placenta. 

Therefore, polynucleotides and ix)lypeptides of the invention are useftil as 
35 reagents for differential identiflcation of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: developmental 
anomalies or fetal deficiencies. Similarly, polypeptides and antibodies directed to these 
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polypeptides arc useful in providing immunological probes for differratial identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly developmental in nature, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., embryonic 

5 tissue, and placenta, and cancerous and woimded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 

0 NO. 327 as residues Lys-50 to Glu-57. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are usefiil for detection of developmental abnormalities. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 16 

5 This gene is expressed primarily in kidney and amygdala and to a lesser extent 

in fetal tissues. This gene is mapped to chromosome 14, and therefore is usefiil in 
linkage analysis as a marker for chromosome 14. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) present in a biological sample and 

0 for diagnosis of diseases and conditions: kidney diseases, neurological disorders and 
developmental abnormalities. Similarly, polypeptides and antibodies directed to these 
polypeptides are usefiil in providing immunological probes for differential identification 
of the tissue(s). For a number of disorders of the above tissues, particularly of the renal 
system or developing fetal tissues, expression of this gene at significandy higher or 

5 lower levels may be routinely detected in certain tissues and cell types (e.g., kidney, 
amygdala, and fetal tissues, and cancerous and wounded tissues) or bodily fluids (e.g., 
semm, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken bom an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid ftom an 

0 individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment or diagnosis of conditions affecting 
the brain, kidneys and fetal development 



FEATURES OF PROTEIN ENCODED BY GENE NO: 17 

This gene is expressed primarily in ovarian cancer. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: solid tumors similar to 
ovarian cancer Similarly, polypeptides and antibodies directed to these polypeptides are 
5 useful in providing inmiunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the reproductive system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., ovarian and other 
reproductive tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 

10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken fit>m 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid fix)m an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO. 329 as residues Ser-5l to Val-56. 

1 S The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the treatment of solid tumors of the 
reproductive system such as ovarian cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 18 

This gene is expressed primarily in brain medulloblastoma.. Preferred polypeptide 
fragments comprise the amino acid sequence: IRHEQHPNFSUa^HSKGSSLLLFLPQL 
ILBLPVCAHLHEELNC (SEQ ID NO: 643) and SFHSEEKGHLLLQAERHPWVAGALVGVSG 
GLTLTTCSGPTEKPATKNYFLKRLLQEMHIRAN (SEQ ID NO: 644). as well as N-terminal 
and C-terminal deletions. Also preferred are polynucleotide fragments encoding these 
polypeptide fragments. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: timiors particularly of 
the CNS or Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 

25 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the Central nervous system, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., brain and other 
tissue of the nervous system, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 

30 sample taken from an individual having such a disorder, relative to the standard gene 
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expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating meduUoblastoma or similar tumors. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 19 
This gene is expressed primarily in adipocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

10 biological sample and for diagnosis of diseases and conditions: obesity. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the adipose tissues 
expression of this gene at significantly higher or lower levels may be routinely detected 

15 in certain tissues and cell types (e.g., adipocytes and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell san^)le taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

20 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useftil for treating obesity by regulating the function and 
number of adipocytes 

FEATURES OF PROTEIN ENCODED BY GENE NO: 20 

25 This gene is expressed primarily in B cell l3anphoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, of the immune system with an emphasis on B cell lymphoma. Similarly, 

30 polypeptides and antibodies directed to these polypeptides are useful in providing 

inununological probes for differential identification of the tissue(s) or cell type(s). For a 
nunlber of disorders of the above tissues or cells, particularly of the tiunors of the 
immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., blood cells, and lyn^}hoid 

35 tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
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the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of B cell derived 
5 tumors based on its expression in b cell lymphomas 

FEATURES OF PROTEIN ENCODED BY GENE NO: 21 

This gene is expressed primarily in immune cells and to a lesser extent in fetal 

tissues 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: inflanunatory diseases 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 

15 type(s). For a number of disorders of the above tissues or cells, particularly of the 

immune expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., cells of the immune system, and fetal 
tissues, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 

20 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid fix>m an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ED 
NO:333 as residues Asp-10 to Pro- 19, Ser-74 to Tyr-79, Glu-95 to Lys-1 10. 
The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are usefid for treatment of diseases involving alterations in T 
cell activity. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 22 

It has been discovered that this gene is expressed primarily in ovarian tumor. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors particularly of 
the ovary. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 

35 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
tumors of the reproductive organs, expression of this gene at significantiy higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., ovarian 
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and other repnxluctive tissue and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
5 individual not having the disorder Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO. 334 as residues: Leu-22 to Gln-27. 

The tissue distribution indicates that polynucleotides and polypq)tides 
corresponding to this gene are useful for diagnosis and treatment of ovarian tumors as it 
has only been identified in ovarian tumors. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 23 

It has been discovered that this gene is expressed primarily in fetal tissues and to 
a lesser extent in osteoclastoma cell line 

Therefore, polynucleotides and polypeptides of the invention are useful as 

1 5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: osteoporosis or arthritis 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing iiimiunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 

20 skeletal expression of this gene at significandy higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., bone cells, and fetal tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

25 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of conditions of abnormal bone 
remodeling due to enhanced activity of osteoclasts. This may be useful as a specific 
marker for malignancies derived from osteoclasts or their precursors. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 24 

The translation product of this gene shares sequence homology with a 
periplasmic ribonuclease which is thought to be important in degrading extracellular 
polynucleotides 

35 It has been discovered that this gene is expressed primarily in serum treated 

smooth muscle cells 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: vascular disease such as 
restenosis. Similarly, polypeptides and antibodies direct^ to these polypeptides arc 
5 useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the vasculaturc expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., smooth muscle, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 

10 spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 336 as residues: 
Gln-30 to Lys-36, and Pro-41 to Arg-48. 

IS The tissue distribution and honK)logy to ribonucleases indicate that 

polynucleotides and polypeptides corresponding to this gene are useful for treatment of 
pathological conditions of smooth muscle associated with bacterial or viral infiltration 

FEATURES OF PROTEIN ENCODED BY GENE NO: 25 

20 This gene is expressed primarily in Early Stage Human Brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: human brain 
development and related diseases. Similarly, polypeptides and antibodies directed to 

25 these polypeptides arc useful in providing immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the human brain development and related diseases, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., brain and other tissue of the nervous system, and 

30 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to this gene indicate that polynucleotides 

35 and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
diseases affecting human brain development and related diseases. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 26 

It has been discovered that this gene is expressed primarily in human brain 

tissue. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: human brain diseases 
and other diseases related to brain diseases, which may be caused by brain diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

10 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
human brain diseases, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., brain and other tissue of 
the nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 

15 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to the gene indicate that polynucleotides 
20 and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
human brain diseases and other diseases related. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 27 

It has been discovered that this gene is expressed primarily in Anergic T-cells. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: inomune diseases, 
inflanmiatory diseases and diseases related to T lymph cells. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 

30 probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the inmiune diseases, 
inflanmiatory diseases and diseases related to T lymph cells, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and ceU 
types (e.g., blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., 

35 serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell saii^>le 
taken from an individual having such a disorder, relative to the standard gene 
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expression level, i.e., the expression level in healthy tissue or bodily fluid fipom an 
individual not having the disorder. 

The tissue distribution and homology to the gene indicate that polynucleotides 
and polypeptides corresponding to this gene are useful for immune diseases, 
5 inflammatory diseases and diseases related to T lymph cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 28 

The translation product of this gene shares sequence homology with Shigella 

flexneri positive transcriptional regulator CriR (criR) gene which is thought to be 
1 0 important in regulation of gene expression. 

This gene is expressed primarily in human synovial sarcoma and normal human 

brain tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

IS biological sample and for diagnosis of diseases and conditions: human brain diseases 
particularly sarcomas of the synovium. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing inununological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the human brain and synovium and other related human 

20 brain diseases, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain (e.g., synovial tissue, and brain and other tissue of the 
nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or ceU sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 

25 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of human synovial 
sarcoma and other related human brain diseases. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 29 

This gene is expressed in bone marrow, infant brain, fetal liver and spleen, 
prostate and to a lesser extent in pineal gland, adipose tissue, kidney, adrenal gland, 
umbilical vein endothelial cells, and T cells. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for identification of the tissue(s) or cell type(s) present in a biological sample 
and for diagnosis of diseases and conditions: diseases related to bone marrow or 
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hematoplastic tissues, prostate, kidney, adrenal gland, and cardiovascular tissue or 
organs. Similarly, polypeptides and antibodies directed to these polypeptides are usefiil 
in providing inununological probes for identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the diseases related to 
5 hematoplastic tissues, immune system, prostate, kidney, adrenal gland, and 

cardiovascular tissue or organs, expression of this gene at significandy higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., bone marrow, 
hematopoietic cells, pineal gland, adipose tissue, kidney, adrenal gland, endothelial 
cells, and blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., 

10 serum, plasma, urine, synovial fluid or spmal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, reladve to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to the gene indicate that polynucleotides 

15 and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
diseases related to hematoplastic tissues, immune system, prostate, kidney, adrenal 
gland, and cardiovascular tissue or organs. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 30 

20 This gene is expressed primarily in meningea and to a lesser extent in breast and 

adult brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: Diseases of the 

25 meningea and related brain diseases. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the meningea and related brain diseases, expression of 
this gene at significandy higher or lower levels may be routinely detected in certain 

30 tissues and cell types (e.g., miningea, mammary tissue, and brain and other tissue of 
the nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

35 disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gme are useful for diagnosis and treatment of diseases of the 
meningea and related brain diseases. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 31 

This gene is expressed in meningea, fetal spleen, osteoblast and to a lesser 
extent in activated T-cells, endometrial stromal cells, fetal lung, HL-60, thymus, testis 
and endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are usefid as 

10 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: meningeal disease, 
osteoporosis, inmiune diseases, and hematoplastic diseases. Similarly, polyp^tides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for identification of the tissue(s) or cell type(s). For a number of disorders of the 

IS above tissues or cells, particularly of the meningeal diseases, osteoporosis, inunune 
diseases, and hematoplastic diseases, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, endometrium, lung, thymus, testis, and endothelial cells, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

20 fluid) or another tissue or cell sample taken froni an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to gene indicate that polynucleotides and 
polypeptides corresponding to this gene are usefid for diagnosis and treatment of 

25 meningeal, osteoporosis, inmiune diseases, hematoplastic diseases, testis diseases and 
lung diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 32 

This gene is expressed primarily in human thymus and to a much lesser extent 

30 in infant brain, T-cells, smooth muscle, endothelial cells, bone marrow, human ovarian 
tumor and keratinocytes testes, osteoclastoma, breast, and tonsils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: Diseases involving the 

35 thymus, particularly thymic cancer and diseases involving T-cell maturation. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inmiunological probes for differential identification of the tissue(s) or cell type(s). For a 
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number of disorders of the above tissues or cells, particularly of the thymus, expression 
of this gene at significandy higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., thymus, brain, and other tissue of the nervous system, 
blood cells, bone marrow, ovaries, and testes, and other reproductive tissue, mammary 
5 tissue, tonsils, melanocytes and cancerous and wounded tissues) or bodily fluids (e.g., 
senmi, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
10 The tissue distribution and homology to gene indicate that poljmucleotides and 

polypeptides corresponding to this gene are useful for diagnosis and treatment of 
diseases of the thymus particularly thymic cancer and diseases involving T-cell 
maturation. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 33 

This gene is expressed primarily in human tonsils, and placenta, and to a lesser 
extent in adipocytes, melanocyte, and infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(5) present in a 

20 biological sample and for diagnosis of diseases and conditions: inflammatory diseases, 
immune diseases, and obesity. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the inflaixunatory diseases, immune diseases, and obesity, expression of 

25 this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., tonsils, placenta, adipocytes, melanocytes, and brain and 
other tissue of the nervous system, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

30 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to this gene indicate that polynucleotides 
and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
diseases such as inflanunation, inunune diseases, and obesity. 

35 



wo 98/39448 



26 



PCTAJS98A)4493 



FEATURES OF PROTEIN ENCODED BY GENE NO: 34 

This gene is expressed in activated T ceils, and to a lesser extent in pituitary, 
testis, and breast lymph node. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: diseases relating to T 
cells. Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
10 disorders of the immune system, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., pituitary, testes 
and other reproductive tissue, mammary tissue, and lymphoid tissue, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken fix)m an individual having such a 
15 disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid finom an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of immune disorders. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 35 

This gene is expressed primarily in infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurological disorders. 

25 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
diseases relating to neurological disorders, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 

30 brain, and other tissue of the nervous system, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

35 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnosis and treatment of neurological 
disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 36 

This gene is expressed primarily in infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: neurological disorders, 
Similariy, polypeptides and antibodies directed to these polypeptides are useful in 
providing inununologicai probes for difTerential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 

10 diseases relating to neurological disorders, expression of this gene at significandy 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
brain and other tissue of the nervous system, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 

15 standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of neurological 
disorders. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 37 

This gene is expressed primarily in human ovary. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: ovarian cancer. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
ovarian disorders such as those involving germ cells, ovarian follicles, stromal cells, 

30 expression of this gene at significandy higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., ovary and oUier reproductive tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an mdividual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

35 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of ovariopathy. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 38 

This gene is expressed primarily in lymph node breast cancer. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of die tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: breast cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inununological probes for differential identification of die tissue(s) or cell type(s). For a 
number of disorders of the above tissues or ceUs, particularly of the breast cancer, 
1 0 expression of this gene at significanUy higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., mammary tissue and lymphoid tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken fiom an individual having 
such a disorder, relative to die standard gene expression level, i.e., die expression level 
1 5 in healdiy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for used as a diagnostic marker for breast cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 39 

20 This gene is expressed primarily in brain and to a lesser extent in odier tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of die tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neuronal disorders such 
as trauma, brain degeneration, and brain tumor. Similarly, polypeptides and antibodies 

25 directed to diese polypeptides are useful in providing hnmunological probes for 

differential identification of die tissue(s) or cell type(s). For a number of disorders of 
die above tissues or ceUs, particularly of die brain, expression of this gene at 
significandy higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., brain and other tissue of the nervous system, and cancerous and wounded 

30 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
anodier tissue or cell sample taken from an individual having such a disorder, relative to 
die standard gene expression level, i.e., die expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates dial polynucleotides and polypeptides 

35 corresponding to this gene are useful for diagnosis and ther^utic treatment of 
neuronal disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 40 

This gene is expressed in early stage human embryo, adrenal gland tumor, and 
5 immune tissues such as fetal liver, fetal spleen. T-cell, and myoloid progenitor cell line 
and to a lesser extent in ovary, colon cancer, and a few orther tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue{s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumorigenesis including 

10 adrenal gland tumor, colon cancer and various other tumors, developmental and 

inamune disorders. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
tissue{s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the cancer tissues, early stage human tissues, and inmiune system, 

1 5 expression of this gene at significandy higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., liver, spleen, blood cells, bone marrow, ovary 
and other reproductive tissue, and colon, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

20 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and tiierapeutic treatment of immune 
and developmental disorders, and tumorigenesis. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 41 

This gene is expressed primarily in fetal lung, endothelial cells, liver, thymus 
and a few other inmiune tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

30 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: immune disorders such 
as inmiune deficiency and autoirmnune diseases, puhnonary diseases, hver diseases, 
and tumor matasis. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

35 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the fetal lung, hver, endotheUal cells, and immune tissues, expression of 
this gene at significandy higher or lower levels may be routinely detected in certain 
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tissues and cell types (e.g., lung, endothelial cells, liver, thymus, and other tissue of 
the immune system, and cancerous and wounded tissues) or bodily fluids (e.g., semm, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e.. 
5 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are usefril for diagnosis of inunune disorders and pulmonary 
and hepatic diseases. Its promoter may also be used for inmiune system and limg- 
10 specific gene therapies. The expression of this gene in endothelial cells indicates that it 
may also involve in angiogenesis which therefore may play role in tumor matasis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 42 

This gene is expressed primarily in Uver, thyroid, parathyroid and to a lesser 

15 extent in fetal lung, stomach and early embryos. 

Therefore, polynucleotides and polypeptides of the invention are usefrd as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: metabolic regulation, 
obesity, heptic failure, heptacellular tumors or thyroiditis and thyroid tumors. Similarly, 

20 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the digestive/endocrine 
system expression of this gene at significandy higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., liver, thyroid, parathyroid, lung, 

25 stomach, and embryonic tissue, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

30 The tissue distribution and the extracellular locations indicates that 

polynucleotides and polypeptides corresponding to this gene are useful for the detection 
and treatment of digestive/endocrine disorders, including metabolic regulation, heptic 
failure, malabsortion, gastritis and neoplasms. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 43 

This gene is expressed primarily in Schizophrenic adult brain, pituitary, fh)nt 
cortex, hypothalmus and to a lesser extent in retina, adipose and stomach cancer and 
placenta. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: schizophrenia and other 
neurological disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are usefiil in providing immunological probes for differential identification 

10 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central nerve system, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., retinal 
tissue, adipose, stomach, and placenta, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 

IS cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and poljrpepddes 
corresponding to this gene are useful in treatment/detection of disorders in the nerve 

20 system, including schizophrenia, neurodegeneration, and neoplasia. Additionally, a 
secreted protein in brain may serve as an endocrine. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 44 

The translation product of this gene shares sequence homology with GTP 
25 binding proteins which are thought to be important in signal transduction and protein 
transport. 

This gene is expressed primarily in umbiUcal vein and microvascular endothelial 
cells, GM-CSF treated macrophage, anergic T cells, osteoblast, osteoclast, CD34+ cells 
and to a lesser extent in gall bladder. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: bone formation and 
growth, osteonecrosis, osteoporosis, angiogenesis and/or hematopoeisis. Similarly, 
polypeptides and antibodies directed to these polypeptides are usefiil in providing 

35 inmiunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the skeletal and 
hematopoeisis systems, expression of this gene at significantly higher or lower levels 
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may be routinely detected in certain tissues and cell types (e.g., endothelial cells, blood 
cells, bone, and gall bladder, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
5 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to OTP binding proteins indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for 
treatment/detection of bone formation and growth, osteonecrosis, osteoporosis, and/or 
10 hematopoeisis because its involvement in the growth signaling or angiogenesis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 45 

The translation product of this gene shares sequence homology with signal 
sequence receptor gamma subunit which is thought to be important in protein 

1 5 translocation on endoplasmic reticuliun. 

This gene is expressed primarily in adrenal gland, salivary gland, prostate, and 
to a lesser extent in endothelial cells and smooth muscle. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions: protein secretion. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
secretory organs, expression of this gene at significandy higher or lower levels may be 

25 routinely detected in certain tissues and cell types (e.g., adrenal gland, salivary gland, 
prostate, endothelial cells, and smooth muscle, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 

30 fluid firom an individual not having the disorder. 

The tissue distribution and homology to SSR gamma subunit indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for endocrine 
disorders, prostate cancer, xerostomia or sialorrhea. 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 46 

This gene is expressed primarily in osteoclastoma cells and to a lesser extent in 
melanocyte, amygdala, brain, and stomach. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: ossification, 
osteoporosis, fracture, osteonecrosis, osteosarcoma. Similarly, polypeptides and 
5 antibodies directed to these polypeptides are useful in providing inununological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the skeletal systems, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues and 
cell types (e.g., melanocytes, amygdala, brain and other tissue of the nervous system, 
10 and stomach, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 

plasma, urine, synovial fluid or spinal fluid) or another tissue or cell san4)le taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

15 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful in intervention of ossification, osteoporosis, 
fracture, osteonecrosis and osteosarcoma. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 48 
20 The translation product of this gene shares sequence homology with proline rich 

proteins which is thought to be important in protein-protein interaction. 
This gene is expressed primarily in brain. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: neurological and 
psychological disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central nerve system and endocrine system, expression of this gene at 

30 significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 

35 fluid from an individual not having the disorder. 

The tissue distribution and homology to proline-rich proteins indicate that 
polynucleotides and polypeptides corresponding to this gene are useful in intervention 
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and detection of neurological diseases, including trauma, neoplasia, degenerative or 
metabolic conditions in the central nerve sysient Additionally, the gene product may be 
a secreted by the brain as an endocrine. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 49 

The translation product of this gene shares sequence homology with the AOCB 
gene from Aspergillus nidulans which is important in asexual development 

This gene is expressed primarily in infant brain and to a lesser extent in the 
developing embryo, trachea mmors, B<ell lymphoma and synovial sarcoma. 
10 Therefore, polynucleotides and polypeptides of the invention.are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurodegenerative 
diseases, leukemia and sarcoma's. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 

1 5 identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the brain and inmiune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., embryonic tissue, blood cells, trachea, and synovial tissue, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 

20 spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid firom an individual not having the disorder. 

The tissue distribution in infant brain and sarcoma's and homology to a gene 
involved in a key step of eukaryotivc development (fungal spore formation) indicates 

25 that the protein product of this clone could play a role in neurological diseases such as 
schizophrenia, particularly in infants. The existence of the gene in a B-cell lymphoma . 
indicates the gene may be used in the treatment and detection of leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 50 

30 This gene is expressed primarily in fetal lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: puhnonary disorders 
including lung cancer Similarly, polypeptides and antibodies directed to these 

35 polypeptides are useful in providing inununological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the pulmonary system, expression of diis gene at significantly higher or 
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lower levels may be routinely detected in certain tissues and cell types (e.g., lung, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
5 in healthy tissue or bodily fluid from an individual not having the disorder. 

TTie tissue distribution of this gene only in fetal lung indicates that it plays a key 
role in development of the pulmonary system. This would suggest that misregulation of 
the expression of this protein product in the adult could lead to lymphoma or sarcoma 
formation, particularly in the lung. It may also be involved in predisposition to certain 
10 pulmonary defects such as pulmonary edema and embolism, bronchitis and cystic 
fibrosis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 51 

This gene is expressed primarily in hematopoietic cell types and fetal cells and to 
15 a lesser extent in all tissue types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects in the immune 
system and hematopoeisis. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune and hematopoietic systems, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., hematopoietic cells, and fetal tissue, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution of this gene predominanUy in hematopoeitic cells and in 
the developing embryo indicates that polynucleotides and polypeptides corresponding to 
this gene are useful for detection and treatment of lymphomas and disease states 
affecting the inunune system or hematopoeisis disorders such as leukemia, AIDS, 
arthritis and asthma.. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 52 

This gene is expressed primarily in prostate and to a lesser extent in fetal spleen, 
fetal liver, infant brain and T cell leukemias. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: prostate disorders, 
prostate cancer, leukemia. Similarly, polypeptides and antibodies directed to these 
5 polypeptides are useful in providing inmiunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, and/or prostate gland expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., thymus, spleen, liver, brain and other tissue of the nervous system, and 
10 blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

15 The tissue distribution of this gene in prostate indicates that polynucleotides and 

polypeptides corresponding to this gene are usefiil for detection or treatment of prostate 
disorders or prostate cancer. Its distribution in fetal liver and fetal spleen indicates it 
may play a role in the immune system and its misrcgulation could lead to immune 
disorders such as leukemia, arthritis and asthma. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 53 

The translation product of this gene shares sequence homology with dyriein. 
This gene is expressed primarily in brain. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neuro-degenerative 
diseases of the brain. Similarly, polypeptides and antibodies directed to these 
polypeptides are usefiil in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disoiders of the above tissues or cells, 
30 particularly neuro-degenerative diseases expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., brain 
and other tissue of the nervous system, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
ceU sample taken fix)m an individual having such a disorder, relative to the standard 
35 gene expression level, i.e., the expression level in healthy tissue or bodily fluid fit>m an 
individual not having the disorder. 
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The piedominant tissue distribution in the brain and homology to dynein* a 
niicrotubule motor protein involved in the positioning of cellular organelles and 
molecules indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for detection/treatment of neurodegenerative diseases, such as Alzheimers, 
5 Huntigtons, Parkinsons diseases and shizophrenia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 54 

The translation product of this gene shares sequence homology with ubiquitin- 
conjugation protein, an enzyme which is thought to be important in the processing of 
10 the Huntingtons Disease causing gene. 

This gene is expressed primarily in brain and to a lesser extent in activated 
macrophages. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions: neurodegenerative 
disease states including Huntington's disease. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of brain tissues. For a number of disorders of the above 
tissues or cells, particularly of the neurological systems expression of this gene at 

20 significandy higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., brain and other tissue of the nervous system, and blood cells, and 
cancerous and woimded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken firom an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

25 in healthy tissue or bodily fluid from an individual not having the disorder. 

The predominant tissue distribution of this gene in the brain and its homology to 
a Huntington interacting protein indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the regulation of the expression of the 
Himtington disease gene and other neurodegenerative diseases including 

30 spmocerebullar ataxia types I and HI, dentatorubropallidoluysian and spinal bulbar 

muscular atrophy. In addition, the existence of elevated levels of free ubiquitin pools in 
Alzheimer's disease, Parkinson's disease and amylotrophic lateral sclerosis indicates 
that the ubiquitin pathway of protein degradation plays a role in these disease states. 
TTius, considering the gene described here is homologous to a ubiquitin-conjugation 

35 protein it may play a general role in neurodegenarative conditions. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 56 

This gene is expressed primarily in T-celis (anergic T-cells, resting T-CeUs, 
apoptotic T-cells) and lymph node (breast), as well as brain (hypothalamus, 
hipiM)campus, pituitary, infant brain, early-stage brain). 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune (e.g. 
mmiunodeficiencies, autoimmunities, inflanmiation, leukemias & lymphomas) and 
neurological (e.g. Alzheimer's disease, dementia, schizophrenia) disorders. Similarly, 

10 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the central nervous, 
hematopoietic and immune systems, expression of this gene at significanUy higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 

1 5 cells, lymphoid tissue, and brain and otiier tissue of the nervous system, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., die expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 
20 The tissue distribution indicates tiiat polynucleotides and polypeptides 

corresponding to this gene are useful in the intervention or detection of pathologies 
associated with the hematopoietic and immune systems, such as anemias Geukemias). 
In addition, die expression in brain (including fetal) might suggest a role in 
developmental brain defects, neuro-degenerative diseases or behavioral abnomalities 
25 (e.g. schizophrenia, Alzheimer's, dementia, depression, etc.). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 57 

This gene is expressed primarily in lung, and to a lesser extent in a variety of 
odier hematological cell types (e.g. Raji cells, bone marrow cell line, activated 
30 monocytes). 

Therefore, polynucleotides and polypeptides of the invention are usefid as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: pulmonary and/or 
hematological disfunction. Similarly, polypeptides and antibodies directed to these 
35 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of die above tissues or cells, 
particularly of the vasculo-pulmonary and hematopoietic systems, expression of this 
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gene at significantly higher or lower levels niay be routinely detected in certain tissues 
and cell types (e.g., lung and blood cells, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, luine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
5 standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful in the intervention and detection of pathologies 
associated with the vasculo-pulmonary system. In addition the expression of this gene 
10 in a variety of leukocytic cell types and a bone marrow cell line might suggest a role in 
hematopoietic and immune system disorders, such as leukemias & lymphomas, 
inflammation, immimodeficiencies and autoimmunities. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 58 

1 5 The translation product of this gene shares sequence homology with adenylate 

kinase isozyme 3 (gill63528 GTP:AMP phosphotransferase (EC 2.7.4.10) [Bos 
taurus]), which is thought to be important in catalyzing the phosphorylation of AMP to 
ADP in the presence of ATP or inorganic triphosphate. 

This gene is expressed primarily in fetal liver, heart and placenta, and to a lesser 

20 extent in many other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceU type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hepatic, cardiovascular 
or reproductive disorders. Similarly, polypeptides and antibodies directed to these 

25 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a nimiber of disorders of the above tissues or cells, 
particulariy of the hepatic, cardiovascular and reproductive systems, egression of this 
gene at significandy higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., liver, heart, and placenta, and cancerous and wounded tissues) or 

30 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual havmg such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

35 corresponding to this gene are useful for the treatment and diagnosis of conditions 
related to hepatic function and pathogenesis, in particular, those dealing with liver 
development and the differentiation of hepatocyte progenitor cells. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 59 

TTiis gene is expressed primarily in CD34 positive cells (Cord Blood). 
Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hematopoietic 
differentiation and inmiune disorders. Similarly, polypeptides and antibodies directed to 
tiiese polypeptides are useful in providing immunological probes for differential 
identification of tiie tissue(s) or cell type{s). For a number of disorders of the above 
10 tissues or cells, particularly of hematopoietic and immune systems, expression of this 
gene at significantiy higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., hematopoietic cells, and blood cells, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
anodier tissue or cell sample taken from an individual having such a disorder, relative to 
15 the standard gene expression level, i.e., the expression level in healtfiy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to diis gene are useful in the detection and treatment of conditions 
associated widi CD34-positive cells, and therefore as a marker for cell differentiation in 
20 hematapoiesis, as well as immunological disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 60 

The translation product of the predicted open reading frame of this contig has 
sequence identity to the murine gene designated Insulin-Like Growth Factor-Binding 
25 Protein (IGFBP)- 1 as described by Lee and colleagues (Hepatology 19 (3), 656-665 
(1994)). 

This gene is expressed exclusively in hemangiopericytoma. 
Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

30 biological sample and for diagnosis of hemangiopericytoma and other pericyte or 

endoUielial cell proliferative disorders. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type($). For a number of disorders of die above 
tissues or cells, particularly of the circulatory and inmiune systems, expression of this 

35 gene at significantiy higher or lower levels may routinely be detected in certain tissues 
arid cell types (e.g., pericyte or endotixelial cells, and liver, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
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another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expr^ion level in healthy tissue or bodily 
fluid from an individual not having the disorder 

Polynucleotides and polypeptides corresponding to this gene are useful as cell 
5 growth regulators since IGFBP- 1-like molecules function as modulators of insulin-like 
growth factor activity. In addition, since IGFBP-1 is expressed at high levels 
following hepatectomy and during fetal liver development, polynucleotides of the 
present invention may also be used for the diagnosis of developmental disorders. 
Further, polypeptides of the present invention may be used therapeutically to treat 
1 0 developmental liver disorders as well as to regulate hqjatocyte and supporting cell 
growth following hepatectomy or to treat liver disorders. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are usefiil for diagnosis and treatment of 
hemangiopericytoma and liver disorders. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 61 

This gene is expressed primarily in schizophrenic frontal cortex. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions: nervous system and 
cognitive disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides arc useful in providing inmiunological probes for differential identification 
of the tissue(s) or ceil type(s). For a number of disorders of the above tissues or cells, 
particularly of the frontal cortex and CNS expression of this gene at significantly higher 

25 or lower levels may be routinely detected in certain tissues and cell types (e.g., brain 
and other tissue of the nervous system, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

30 individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study, treatment and diagnosis of frontal 
cortex, neuro-degenerative and CNS disorders 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 62 

This gene is expressed primarily in human adrenal gland tumor, and to a lesser 
extent in human kidney, medulla and adult pulmonary tissue. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: metabolic, endocrine 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 
5 useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the endocrine and nervous system disorders and neoplasia, expression of this gene at 
significandy higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., adrenal gland, kidney, brain and other tissue of the nervous system, 
10 pulmonary tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or anotiier tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healtiiy tissue or bodily fluid from an individual not having the 
disorder. 

15 The tissue distribution mdicates tiiat polynucleotides and polypeptides 

corresponding to this gene are useful for study, treatment and diagnosis of neurological 
and endocrine disorders including neoplasia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 63 

20 This gene is expressed primarily m human adipocytes, and to a lesser extent in 

spleen, 12- week old human, and testes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune, metabolic and 

25 growth disorders. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significandy higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., adipocytes, 

30 spleen, and testes and other reproductive tissue, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spmal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

35 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for study, diagnosis and treatment of immune, 
developmental and metabolic disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 64 

One translated product of this clone is homologous to the mouse zinc finger protein 
PZF. (See Accession No. 453376; see also Gene 152 (2), 233-238 (1995).) Preferred 
5 polypeptide fragments correspond to the highly conserved domains shared between 
mouse and man. For example, preferred polypeptide fragments comprise the amino acid 
sequence: UK:mCGFTCRQKASUWHMKKHDADSFYQFS^ 

PEV (SEQ ID NO: 621); ITSTDILGTNPESLTQPSD (SEQ ID NO: 622); NSTSGECLLLEAEGM 
SKSY (SEQ ID NO: 623); CSGTERVSLMAIXJKIFVGSGSSGGTEGLVMNSDIl^ATre 

10 SAG? (SEQ ID NO: 624); IQYVRCEMEGCGTVLAHPRYLQHmiCYQHLLKKKYVCPHrera 

RLQKQLLRHAKHHT (SEQ ID NO: 625); IXJRDYICEYCARAFKSSHNlA\ramiHTGEK (SEQ 
ID NO: 626); RSSRTSVSRHRDTENTRSSRSKTGSLQLICKSEPNTDQLDY (SEQ ID NO: 627); 
PFKDDPRDETYKFHLERETPKPRRKSG (SEQ ID NO: 630); QYVRCEMEGCGTVLAHPRYLQ 
HfflKYQHLLKKKYVCPHPSCGRLFRLQKQLLRHAKHHTD (SEQ ID NO: 629); or residues 

15 151-182ofQRDYICEYCARAFKSSHNLAVHRMIHTGEKHY(SEQmNO:^^^^^ Also preferred 
are polynucleotide fragments encoding these polypeptide fragments. 

This gene is expressed primarily in Rhabdomyosarcoma, melanocyte and colon 
cancer tissue and to a lesser extent in smooth muscle, pancreatic tumor, and aptotic 
T-cells. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to.. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 

25 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and hemopoetic, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., striated muscle, 
melanocytes, colon, smooth muscle, pancreas, and blood cells, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

30 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study, diagnosis and treatment of cancer and 

35 hemopoetic disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 65 

This gene is expressed primarily in human adipose and salivary gland tissue and 
to a lesser extent in human bone marrow and fetal kidney. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
5 reagents for differential identification of tiie tissue(s) or ceU type(s) present in a 

biological sample and for diagnosis of diseases and conditions: metabolic and immune 
disorders. Similarly, polypeptides and antibodies directed to tiiese polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or ceU type(s). For a number of disorders of tiie above tissues or cells, particulariy of 
10 the metabolic and hemopoetic systems, expression of this gene at significantiy higher or 
lower levehj may be routinely detected in certain tissues and ceU types (e.g., adipose, 
salivary gland, bone manow, and kidney, and cancerous and wounded tissues) or 
bodUy fluids (e.g.. serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to tiie 
15 standard gene expression level, i.e.. Uie expression level in healtiiy tissue or bodUy 
fluid from an individual not having tiie disorder. 

The tissue distribution indicates tiiat polynucleotides and polypeptides 
corresponding to titis gene are useful for smdy, diagnosis of metabolic and immune 
disorders. 



20 



25 



FEATURES OF PROTEIN ENCODED BY GENE NO: 66 
This translated product of tiiis gene was recentiy identified as oxytocinase spUce variant 
1 (See Accession Nos. 2209276 and dl010078.) Preferred polypeptide fragments 
comprise tiie amino acid sequence: EMFDSl^YFKGSSLLLMLKTYLSEDVFQHAVVLYLHN 
HSYASIQSDDLWDSFNEVTNQTLDVKRl^iaWnXJKGFPLVTVQKKG^ 
PHQPSimiYM (SEQ ID NO: 631). Also preferred are polynucleotide fragments encodmg 
this polypeptide fragment 

FEATURES OF PROTEIN ENCODED BY GENE NO: 67 

30 This gene is expressed primarily in hemopoetic ceUs. particulariy apoptotic T- 

cells. and to lesser extent in primary dendritic cells and adipose tissue. 

Therefore, polynucleotides and polypeptides of ttie invention are useful as 
reagents for differential identification of apoptotic T^ells. primary denritic ceUs. and 
adipose tissue present in a biological sample and for diagnosis of diseases and 

35 conditions: hemopoetic diseases including cancer and general immune disorders. 
Sinulariy. polypeptides and antibodies directed to tfiese polypeptides are useful in 
providing immunological probes for differential identification of tiie tissue(s) or cell 
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type(s). For a number of disorders of the above tissues or ceDs, particularly of the oral 
and intestinal mucosa as well as hemopoetic and immume systems, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., hematopoietic ceUs, and cancerous and wounded tissues) or bodily 
5 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
10 corresponding to this gene are useful for treatment of diseases of the immune system, 
including cancer, hemopoetic and infectious diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 68 

This gene is expressed primarily in kidney cortex and to a lesser extent in infant 

IS brain, heart, uterus, and blood. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of kidney tissue present in a biological sample and 
for diagnosis of diseases and conditions: soft tissue cancer, inflanunation, kidney 
fibrosis. Similarly, polypeptides and antibodies directed to these polypeptides are useful 

20 in providing inmiunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
nervous and endocrines systems, expression of this gene at significandy higher or 
lower levels may be routinely detected in certain dssues and ceU types (e.g., kidney, 
brain, and other nervous tissue, heart, uterus, and blood cells, and cancerous and 

25 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

30 corresponding to this gene are useful for study and treatment of cancer and fibroses. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 69 

The translation product of this gene shares strong sequence homology with 
vertebrate and invertebrate protein tyrosine phosphatases. 
35 This gene is expressed primarily in endometrial tumors, melanocytes, myeloid 

progenitors and to a lesser extent in infant brain, adipocytes, and several hematopoietic 
stem cells. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
leagents for differential identification of transformed hematopoietic and epithelial cells 
presmt in a biological sample and for diagnosis of diseases and conditions which 
include, but arc not limited to, of skin and endometrium, leukemia. Similarly, 
5 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the nervous and 
hemopoietic systems, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., endometrium, melanocytes, 

10 bone marrow, adipocs^tes, hematopoietic cells, and brain and other tissue of the nervous 
system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

IS disorder 

The tissue distribution and sequence similarity with tyrosine phosphatases 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
study and treatment of cancer and hematopoietic disorders. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 70 

This gene is expressed primarily in osteoclastoma, breast, and infant brain and 
to a lesser extent in various fetal and transformed bone, ovarian, and neuronal cells. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differmtial identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: degenerative conditions 
of the brain and skeleton. Similarly, polypeptides and antibodies directed to these 
polypeptides arc useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous and skeletal system, expression of this gene at significantiy 

30 higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
bone, mammary tissue, and brain and other tissue of the nervous system, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or ceil sample taken firom an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

35 in healthy tissue or bodily fluid from an individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypq)tides 
corresponding to this gene are useful for study and treatment of degenerative, 
neurological and skeletal disorders. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 71 

This gene was originally cloned from tumor cell lines. Recently another group 
has also cloned this gene, calling it the human malignant melanoma metastasis- 
suppressor (KiSS-1) gene. (See Accession No. U43527.) Preferred polypeptide 
fragments comprise the amino acid sequence: LEKVASVGNSRPTGQQLESLGLLA (SEQ ID 

10 NO: 632); VHREEASCYCQAEPSGDL (SEQ ID NO: 633); RPALRQAGGGTREPRQKRWAGL 
(SEQ ID NO: 634); and AVNFRPQRSQSM (SEQ ID NO: 635). Any frame shifts can easUy be 
resolved using known molecular biology techniques. 

This gene is expressed primarily in many types of carcinomas and to a lesser 
extent in many normal organs. 

15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissues(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer particularly melanomas, and other hyperproliferative disorders. 
Similarly, polypeptides and antibodies directed to these poljrpeptides are useful in 

20 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of 
transformed organ tissue, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 

25 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. As a tumor suppressor gene, increase 
amounts of the polypeptide can be used to treat patients having a particular cancer. 
The tissue distribution indicates that this gene and the translated product is 

30 useful for diagnosing and study of cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 72 

This gene is expressed primarily in striatum and to a lesser extent in adipocytes 
and hemangioperiocytoma. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of striatal cells present in a biological sample and 
for diagnosis of diseases and conditions: neurological, fat and lysosomal storage 
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diseases. Similariy, polypeptides and antibodies directed to these polypq)tides are 
useful in providing immunological probes for difTeiential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the nervous and immune systems, expression of this gene at significantly higher or 
5 lower levels may be routinely detected in certain tissues and cell types (e.g., striatal 
tissue, adipocytes, and vascular tissue, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell san4>le taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
10 individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis, study and treatment of 
neurodegenerative and growth disorders. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 73 

This gene is expressed primarily in bone marrow stromal cells and to a lesser 
extent in smooth muscle, testes, endothelium, and brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of bone marrow present in a biological sample and 

20 for diagnosis of diseases and conditions: connective tissue and hematopoietic diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing inununological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
skeletal and hematopoietic systems, expression of this gene at significantly higher or 

25 lower levels may be routinely detected in certain tissues and cell types (e.g., bone 
marrow, stromal cells, smooth muscle, testes and other reproductive tissue, 
endothelium, brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

30 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid bom an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study, diagnosis, and treatment of connective 
tissue and blood diseases. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 74 

This gene is expressed primarily in brain, fetal liver and lung and to a lesser 
extent in retina, spinal chord, activated T-cells and endo±elial cells. 

Therefore, polynucleotides and polypeptides of the invention are usefid as 
S reagents for differential identification of brain and regenerating Over present in a 

biological sample and for diagnosis of diseases and conditions: CNS and spinal chord 
injuries, immune disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 

10 particularly of the nervous and inunune system, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
brain and other tissue of the nervous system, liver, pulmonaiy tissue, blood cells, and 
endothelial cells, and cancerous and wounded tissues) or bodily fluids (e.g., senmi, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 

15 an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study and treatment of hematopoietic and 
20 neurological conditions. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 75 

The translation product of this gene shares sequence homology with GTP 
binding proteins (intracellular). 

25 This gene is expressed primarily in bone marrow, brain, and melanocytes and to 

a lesser extent in various endocrine and hematopoietic tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hematopietic and 

30 nervous system conditions. Similarly, polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous and immune, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., bone 

35 marrow, melanocytes, brain and other tissue of the nervous system, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder. 
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relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid firom an individual not having the disorder. 

The tissue distribution and homology to nucleotide binding factors indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for study, 
5 diagnosis, and treatnient of brain degenerative, skin and blood diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 76 

This gene is expressed primarily in activated T-cells and to a lesser extent in 
retina, brain, and fetal bone. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of activated T-cells and developing brain present 
in a biological sample and for diagnosis of diseases and conditions: inmiune 
deficiencies and skeletal and neuronal growth dismxlers. Similarly, polypeptides and 
antibodies directed to these polypeptides are usefid in providing immunological probes 

15 for differential identification of the tissue(s) or ceU type(s). For a number of disorders 
of the above tissues or cells, particularly of the nervous, immune, and skeletomuscular 
sustems, expression of this gene at significandy higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., blood cells, brain and other tissue of the 
nervous system, retinal tissue, and bone, and cancerous and wounded tissues) or 

20 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) Or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for diagnosis, study and treatment of cancer, 
urogenital, and brain degenerative diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 77 

This gene is expressed primarily in fetal liver, activated monocytes, osteoblasts 

30 and to a lesser extent in synovial, brain, and lymphoid tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of myeloid and lyiiq)hoid present in a biological 
sample and for diagnosis of diseases and conditions: inflanmiation, immune 
deficiencies, cancer. Similarly, polypeptides and antibodies directed to these 

35 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system and skeleton, expression of this gene at significantly 
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higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
liver, blood cells, bone, synovial tissue, brain and other tissue of the nervous system, 
and lymphoid tissue, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
5 taken from an individual having such a disorder, relative to the standard gene 

expression level, i.e.. the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study, diagnosis, and treatment of lymphoid 
10 and mesenchymal cancers and nervous system diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 78 

The translation product of this gene shares sequence homology with polymerase 
polyprotein precursor which is thought to be important in DNA repair and replication 

1 5 This gene is expressed primarily in infant brain and to a lesser extent in tumors 

and tumor cell lines 

Therefore, polynucleotides and polypeptides of the invention are useftil as 
reagents for differential identiflcation of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

20 not limited to, especially of the neural system and developing organs. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inmiunological probes for diffi^ntial identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the neural system 
expression of this gene at significandy higher or lower levels may be routinely detected 

25 in certain (e.g., brain and other tissue of the nervous system, and cancerous and 

wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken fix)m an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid firom an individual not having the disorder. 

30 The tissue distribution and homology to polymerase polyprotein precursor 

indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of cancers especially of the neural system and developing 
organs 
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This gene is expressed primarily in muscle and endotheUal cells and to a lesser 
extent in brain. 
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TTierefore, polynucleotides and polypeptides of the invention are useful as 
leagents for differential identification of the tissue(s) or ceU type(s) present in a 
biological sample and for diagnosis of diseases and conditions: vascular diseases. 
Siniilarly, polypeptides and antibodies directed to these polypeptides are useful in 
5 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
vascular system, expression of this gene at significandy higher or lower levels may be 
routinely detected in certain (e.g., muscle, endotheUal cells, brain and other tissue of the 
nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken firom 
an individual havmg such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
15 corresponding to this gene are useful for treatment and diagnosis of disorders of the 
vascular and neural system including cardiovascular and endotheUal. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 80 

This gene is expressed primarily in placenta and to a lesser extent in fetal Uver 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differmtial identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: developmental disorders 
and disorder of the haemopoietic system, fetal liver and placenta Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

25 inmiunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of developmental 
disorders and disorder of the haemopoietic system, fetal liver and placenta, expression 
of this gene at significandy higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., placenta and liver, and cancerous and wounded tissues) or 

30 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid fix)m an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

35 corresponding to this gene are useful for diagnosis and treatment of developmental 
disorders and disorders of the haemopoietic system, fetal Uver and placenta. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 81 

This gene is expressed primarily in bone marrow, placenta and tissues and 
organs of the hematopoietic system. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: disorders of the bone 
and haemopoietic system. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differendal idendfication 

10 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune, bone and hematopoietic system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., bone marrow, placenta, and hematopoietic cells, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

15 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of disorders of the 

20 immune, bone and hematopoietic system 

FEATURES OF PROTEIN ENCODED BY GENE NO: 82 

The translation product of this gene shares sequence homology with secretory 
carrier membrane protein which is thought to be important in protein transport and 

25 export Any frame shifts in coding sequence can be easily resolved using standard 
molecular biology techniques. Another group recentiy cloned this gene, calling it 
SCAMP. (See Accession No. 2232243.) 

This gene is expressed primarily in prostate, breast and spleen, and to a lesser 
extent in several other tissues and organs. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: disorders of the breast 
prostate and spleen. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inununological probes for differential identification 

35 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly disorders of the breast prostate and spleen, expression of this gene at 
significantiy higher or lower levels may be routinely detected in certain tissues and cell 
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types (e.g., prostate, mammaiy tissue, and spleen, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
5 fluid ftora an individual not having the disorder. 

The tissue distribution and homology to secretory carrier membrane protein 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of disorders of the breast, prostate and spleen. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 83 

This gene is expressed primarily in developing organs and tissue like placenta 
and infant brain and to a lesser extent in developed organs and tissue like cerebellum 
and heart. 

Therefore, polynucleoddes and polypepddes of the invention are useful as 
15 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: neurological diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue{s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
20 neural system, expression of this gene at significantiy higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., placenta, heart, brain and other 
tissue of the nervous system, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken fix>m an individual having such a disorder, relative to the standard gene 
25 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of diseases of the 
neiu'al system including neurological disorders and cancer. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 84 

The translation product of this gene shares sequence homology with ATPase 6 
in Trypanosoma brucei which is thought to be important in metabolism. 

This gene is expressed primarily in tumor and fetal tissues and to a lesser extent 
35 in melanocytes, kidney cortex, monocytes and ovary. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions: metabolism disorders. 
Sinulariy, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the fetal 
5 systems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., fetal tissues, melanocytes, kidney, blood 
cells, ovary and other tissue of the reproductive system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sanq)le taken from an individual having such a disorder, relative to 
10 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to ATPase indicate that polynucleotides 
and polypeptides corresponding to this gene are useful for treatment and diagnosis of 
metabolism disorders, especially in fetal and tumor tissue growth. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 85 

The translation product of this gene shares sequence homology with the 
immunoglobulin superfamily of proteins which are known to be important in immime 
response and immunity. 

20 This gene is expressed primarily in stromal cells, colon cancer, lung, amygdala, 

melanocyte and to a lesser extent in a variety of other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceU type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects of stromal cell 

25 development and cancer. Similarly, polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the stromal cells, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., stromal cells, 

30 colon, lung, amygdala, and melanocytes, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

3S The tissue distribution and homology to immunoglobulin indicate that 

polynucleotides and polypeptides corresponding to this gene are useful for treatment 
and diagnosis of immune system disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 86 

The translation product of this gene shares sequence homology with 
transcription iniation factor eIF-4 gamma which is thought to be important in gene 
5 transcription. 

This gene is expressed primarily in tumor tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumorigenesis. 
10 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing inmiunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly in tumor 
tissues, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., endometrium and lung, and cancerous 
15 and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to transcription iniation factor eIF-4 
20 gamma indicate that polynucleotides and polypeptides corresponding to this gene are 
useful for gene regulation in tumorigenesis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 87 

The translation product of this gene shares sequence homology at low level in 

25 prolines with secreted basic proline-rich peptide II-2 which is thought to be important in 
protein structure or inhibiting hydroxyapatite formation in vitro. 

This gene is expressed primarily in endometrial tumor and fetal lung. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identiflcation of the tissue(s) or cell type(s) present in a 

30 biological sample and for diagnosis of diseases and conditions: endometrial tumors. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
muscular/skeletal and reproductive systems, expression of this gene at significantly 

35 higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
endometrium, and lung, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
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taken from an individual having such a disorder, relative to the standard gent 
expression level, i.e., the expression level in healthy tissue or bodily fluid ftom an 
individual not having the disorder. 

The tissue distribution and homology to secreted basic proline-rich peptide 11-2 
5 indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
inhibiting hydroxyapatite formation or establishing cell/tissue structure. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 88 

This gene is expressed primarily in: anmiodc cells inducted with TNF in culture; 
10 and to a lesser extent in colon tissue from a patient with Crohn's Disease; parathyroid 
tumor; activated T-cells; cells of the human Caco-2 cell line; adenocarcinoma; colon; 
corpus colosum; fetal kidney; pancreas tumor, fetal brain; early stage brain, and anergic 
T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

I S reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors. Similarly, 
polypeptides and antibodies directed to these polypeptides are usefid in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the immune system; 

20 e.g., tumors, expression of this gene at significantiy higher or lower levels may be 

routinely detected in certain (e.g., amniotic cells, colon, kidney, pancreas, parathyroid, 
brain and other tissue of the nervous system, blood cells, hematopoietic cells, liver, 
spleen, bone, testes and other reproductive tissue, brain and other tissue of the nervous 
system, and epithelial cells, and cancerous and wounded tissues) or bodily fluids (e.g., 

25 serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is usefid 

30 for modulating tumorigenesis and other immune system conditions such as disorders in 
immune response. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 89 

This gene is expressed primarily in fetal liver/spleen and hematopoietic ceUs and 
35 to a lesser extent in brain, osteosarcoma, and testis tumor. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions: leukemia and 
hematopoietic disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
5 particularly of the hematopoietic and inunune systems, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., hematopoietic cells, liver, spleen, bone, testes, and other reproductive 
tissue, brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 

10 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of hematopoietic and 

1 5 immune disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 90 

The translation product of this gene shares weak sequence homology with 
mouse Gcapl protein which is developmentally regulated in brain. 

20 This gene is expressed primarily in infant and adult brain and fetal liver/spleen 

and to a lesser extent in smooth muscle, T cells, and a variety of other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurological or 

25 hematopoietic disorders. Similarly, polypeptides and antibodies directed to these 

polypeptides arc useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous, hematopoietic, immune, and endocrine systems, expression 
of this gene at significandy higher or lower levels may be routinely detected in certain 

30 tissues and cell types (e.g., brain and other tissue of the nervous system, blood ceUs, 
liver, spleen ,and smooth muscle, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

35 individual not having the disorder. 
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The tissue distribution and its homology to Gcapl protein indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for treatubg and 
diagnosis of disorders in neuronal, hematopoietic, immune, and endocrine systems. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 91 

This gene is expressed primarily in brain and hematopoietic cells and to a lesser 
extent in tumor tissues. 

Therefore, polynucleotides and polypeptides of the invention are usefid as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

10 biological sample and for diagnosis of diseases and conditions: disorder in nervous, 
hematopoietic, inmiune systems and tumorigenesis. Sunilarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the in nervous, hematopoietic, immune 

15 systems, expression of this gene at sigruficandy higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., brain and other tissue of the nervous 
system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 

20 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of disorders in the nervous, hematopoietic, and immune 
systems. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 92 

The translation product of this gene shares sequence homology with 
neuroendocrine-specific protein A which is thought to be important in neurologic 
systems. 

30 This gene is expressed primarily in brain tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of die tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neural disorders and 
degeneration disease. Similarly, polypeptides and antibodies directed to these 

35 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central or peripheral nervous systems, expression of this gene at 
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significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., hematopoietic cells, and brain and other tissue of the nervous system, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
5 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to neuroendocrine-specific protein A 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
treatment or diagnosis of neural disorders and degeneration disease. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 93 

The translation product of this gene shares sequence homology with collagen- 
like protein and prolin-rich protein which are thought to be important in connective 
tissue function and tissue structure. 

15 This gene is expressed primarily in fetal Uver/spleen and brain tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological saniple and for diagnosis of diseases and conditions: neuronal or 
hematopoietic disorders. Similarly, polypeptides and antibodies directed to these 

20 polypeptides are useful in providing immunological probes for diflerential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous and hematopoietic systems, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., liver, spleen, and brain and other tissue of the nervous system, and 

25 cancerous and wounded tissues) or bodily fluids (e.g., senun, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken fit)m an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to coUagen-like protein and proline-rich 

30 proteins indicate that polynucleotides and polypeptides corresponding to this gene are 
useful for supporting brain and hematopoietic tissue function and diagnosis and 
treatment of disorders in these functions. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 94 

35 This gene is expressed primarily in embryonic tissues and tumor tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to,. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
5 the immune system (e.g., tumors), expression of this gene at significandy higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., embryonic 
tissue and cancerous and wounded tissues) or bodily fluids (e.g.. serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken bom an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
10 the expression level in healthy tissue or bodily fluid ftx)m an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of cancer. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 95 

This gene is expressed primarily in brain mmor, placenta, and melanoma. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: brain tumor or 

20 melanoma. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the brain or melanocytes, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., brain and other tissue of 

25 the nervous system, placenta, and melanocytes, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

30 The tissue distribution indicates that the translation product of this gene is useful 

in the diagnosis and treatment of brain tumors and melanoma. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 96 

The translation product of this gene shares sequence homology with a yeast 
35 membrane protein, SUR4. which encodes for APAl that acts on a glucose-signaling 
pathway that controls the expression of several genes that are transcriptionally regulated 
by glucose. 
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This gene is expressed primarily in fetal liver, and to a lesser extent in placenta 
and breast tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
5 biological sample and for diagnosis of diseases and conditions: defects of fetal liver or 
defects of glucose-regulated ATPase activities in tissues. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inununological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the fetal immune/hematopoietic system, 

10 expression of this gene at significanUy higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., liver, placenta, and mammary tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell san^)le taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., die expression level 

1 5 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to yeast SUR4 membrane protein indicate 
that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of defects of fetal hver or defects of glucose-regulated ATPase 
activities. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 97 

This gene is expressed primarily in fetal liver, brain, and amniotic fluid. 
Therefore, polynucleotides and polypeptides of the invention are usefril as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: defects of the fetal 

immune system and adult brain. Similarly, polypeptides and antibodies directed to these 
polypeptides are useftil in providing immunological probes for differential identification 
of the tissue(s) or ceU type(s). For a number of disorders of the above tissues or cells, 
particularly of the fetal immune system and adult brain, expression of this gene at 

30 significanUy higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., liver, and brain and other tissue of the nervous system, and cancerous and 
wounded tissues) or bodily fluids (e.g., amniotic fluid, serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

35 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for detecting defects of the fetal immune and hematopoietic systems since fetal liver is 
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the predominant organ responsible for hematopoiesis in the fetus. In addition, the gene 
product of this gene is thought to be usefid for detecting certain neurological defects of 
the brain. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 98 

The translation product of this gene shares sequence homology with an yolk 
protein precursor, Vitellogenin which is thought to be important in binding Upids such 
as phosvitin. 

This gene is expressed primarily in amniotic ccUs and fetal liver 
10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects in amniotic 
cells, fetal liver development and the fetal immune system. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
1 5 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the [insert system where a related disease 
state is likely, e.g., inmiune], expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., amniotic cells, 
and liver, and cancerous and wounded tissues) or bodily fluids (e.g., senun, plasma, 
20 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid fh)m an individual not having the 
disorder. 

The tissue distribution and homology to vitellogenin indicate that the protein 
25 product of this clone is useful for treatment and diagnosis of defects in amniotic cells, 
fetal Uver development and the fetal immune system* 

FEATURES OF PROTEIN ENCODED BY GENE NO: 99 

This gene is expressed primarily in placenta, endometrial tumor, osteosarcoma 
30 and stromal cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present m a 
biological sample and for diagnosis of diseases and conditions: tumor of the 
endometrium or bone, and osteosarcoma. Similarly, polypeptides and antibodies 
35 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the obstetric system (e.g. placenta. 
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endometrium) and the bones, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., placenta, 
endometrium, bone, and stromal cells, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
5 cell sample taken from an individual havmg such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of tumors and 
1 0 abnormalities of the endometrium, and the bones because of its abundance in the 
aforementioned tissues.. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 100 

This gene is expressed primarily in hepatocellular tumor. 

15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue{s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hepatocellular tumor. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing inmiunological probes for differential identification of the tissue(s) or cell 

20 type(s). For a number of disorders of the above tissues or cells, particularly of the liver, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., liver, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 

25 standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of hepatocellular cancer because of its abundant expression 
in this tissue. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 101 

This gene is expressed primarily in Corpus Colosum, fetal lung and infant 

brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: defects of the Corpus 
Colosum or defects of the fetal lung. Similarly, polypeptides and antibodies directed to 



wo 98/39448 



65 



PCT/US98/04493 



these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the Corpus Colosum and brain in general, and fetal lung, 
expression of this gene at significantly higher or lower levels may be routinely detected 
5 in certain tissues and cell types (e.g., lung, and brain and other tissue of the nervous 
system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken fit)m an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level m healthy tissue or bodily fluid from an individual not having the 
10 disorder. 

The tissue distribution indicates that the protein product of this clone is usefid 
for diagnosis and treatment of defects of the Corpus Colosum and brain in general, and 
defects of fetal lung. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 102 

This gene is expressed primarily in T cells and stromal cells, and to a lesser 
extent in adrenal gland. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions: defects of T cell 

immunity and stromal cell development Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system^ expression of this gene at 

25 significandy higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g.. blood cells, stromal cells, and adrenal gland, and cancerous and wounded 
tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level. i.e., the expression level in healthy tissue or bodily 

30 fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of defects of T cell immunity and stromal cell development 
because of its abundant expression m these tissues. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 103 
This gene is expressed primarily in infant brain and placenta. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differenUal identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects of the brain and 
nervous system. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disoitlers of the above tissues or cells, 
particularly of the nervous system, especially brain, expression of this gene at 
significantiy higher or lower levels may be routinely detected in certain tissues and ceU 
types (e.g.. brain and other tissue of the nervous system, and placenta, cancerous and 
wounded tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e.. the expression level in healtfiy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
15 for detecting defects of the brain, especially in young children. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 105 

This gene is expressed primarily in human osteoclastoma and to a lesser extent 
in human pancreas tiunor. 
20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer particularly osteoclastoma and pancreatic tumor. Similariy, 
polypeptides and antibodies directed to tiiese polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or ceU type(s). For a 
number of disorders of the above tissues or cells, particularly in transformed tissues, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., bone and pancreas, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or ceU sample taken firora an individual having such a disorder, relative to 
the standard gene expression level, i.e.. the expression level in healthy tissue or bodUy 
fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein pixxluct of this clone is useful 
for diagnosis and treatment of some types of dimors. particularly pancreatic cancer and 
35 osteoclastoma. 



25 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 106 

This gene is expressed primarily in fetal liver/spleen, and to a lesser extent in 
activated T-Cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides arc useful in 
providing inununological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 

10 inmiune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., liver, spleen, and blood cells, 
and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, luine, 
synovial fluid or spinal fluid) or another tissue or cell saii^)le taken fix)m an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 

15 expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis or treatment of inraiune disorders. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 107 

This gene is expressed primarily in human embryo and to a lesser extent in 
spleen and chronic lynq>hoc)^c leukemia. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: leukemia. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inmiunological probes for differential identification of the tissue(s) or cell type{s). For a 
number of disorders of the above tissues or cells, particularly of the irrunune or 
hemopoietic systems, expression of this gene at significantiy higher or lower levels may 

30 be routinely detected in certain tissues and cell types (e.g., embryonic tissue, spleen, 
and blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

35 disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for the diagnosis and treatment of leukemia. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 108 

This gene is expressed primarily in placenta, and to a lesser extent in early stage 
human brain and in lung. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: fetal developmental 
abnormalities. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 

10 or cell type(s). For a number of disorders of the above tissues or cells, particularly in 
fetal and amniotic tissue, expression of this gene at significandy higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., placenta, brain and 
other tissue of the nervous system, and lung, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 

1 5 tissue or cell san^)le taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., tiie expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this is useful for 
production of growth factor(s) associated with fetal development. Preferred 

20 polypeptides comprise the full-lengtii polypeptide shown in the sequence hsting, 
truncated however, at the amino terminus and beginning with QTIE. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 109 

This gene is expressed primarily in fetal spleen, and to a lesser extent in B-Cell 

25 lymphoma and T-Cell lymphoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: lymphoma. Similarly, 
polypeptides and antibodies directed to these polypeptides arc useful in providing 

30 immunological probes for differential identification of flie tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of die immune system, 
expression of tiiis gene at significanUy higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., spleen and blood cells, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

35 fluid) or anodier tissue or cell sample taken from an individual having such a disorder, 
relative to die standard gene expression level, i.e., die expression level in healdiy tissue 
or bodily fluid from an individual not having the disorder. 
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The tissue distribution indicates that the protein product of this clone is useful 
for the treatment and diagnosis of human lymphomas. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 110 
5 The translation product of this gene shares sequence homology with sarconia 

amplified sequence (SAS), a tetraspan receptor which is thought to be important in 
malignant fibrous histiocytoma and liposarcoma. 

This gene is expressed primarily in human osteoclastoma, and to a lesser extent 
in pineal gland and infant brain. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological saniple and for diagnosis of diseases and conditions: malignant fibrous 
histiocytoma and liposarcoma. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

15 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., bone, 
pineal gland, and brain and other tissue of the nervous system, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

20 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder 

The tissue distribution and homology to sarcoma amplified sequence (SAS) 
indicate that the protein product of this clone is useful for treatment of, osteosarcoma, 

25 malignant fibrous histiocytoma and liposarcoma and related cancers, jiarticularly 
sarcomas. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 111 

The translation product of this gene shares sequence homology with 6.8K 
30 proteolipid protein, mitochondrial - bovine. 

This gene is expressed primarily in Wilm's tumor and to a lesser extent in 
cerebellum and placenta. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions: Wihn's tumor. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
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type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune or renal systems, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., brain and other tissue of 
the nervous system, and placenta, and cancerous and wounded tissues) or bodily fluids 
5 (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to 6.8K proteolipid protein indicate that 
10 the protein product of this clone is useful for diagnostic and therapeutics associated with 
tumors, particularly Wilm's tumor disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 112 

This gene is expressed primarily in embryonic tissue and to a lesser extent in 

15 osteoblasts, endothelial cells, macrophages (GM-CSF treated), and bone marrow. 

Therefore, polynucleotides and polypeptides of the invention are usefid as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: inmiune disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides ar^ useful in 

20 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
roudnely detected in certain tissues and cell types (e.g., embryonic tissue, bone, 
endotheUal cells, blood cells and bone marrow, and cancerous and wounded tissues) or 

25 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healdiy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

30 corresponding to this gene are useful for treatment or diagnosis of inmiune disorders. 
Preferred polypeptides encoded by tiiis gene comprise the following amino acid 
sequence: MTTOVQLAIFANML^VSmiVVLYHYVAVNNPK^ (SEQ ID NO: 636). 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 113 

This gene is expressed primarily in hepatocellular tumor, and to a lesser extent 
in fetal liver/spleen. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors, particularly 
hepatoceUular tumors. Similarly, polypeptides and antibodies directed to these 
5 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the hepatic system, expression of this gene at significanUy higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., liver, and 
spleen, and cancerous and wounded dssues) or bodUy fluids (e.g., senim, plasma, 
1 0 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid firom an individual not having the 
disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
15 for diagnosis and treatment of hmiors, particularly hepatocellular tumors. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 114 

The translation product of this gene exhibits a very high degree of sequence 
identity with the human Pig8 gene which is thought to be important in p53 mediated 

20 apoptosis. The sequence of this gene has since been published by Polyak and 

colleagues (Nature 389. 300-306 (1997)). In addition, the predicted translation product 
of this contig exhibits very high sequence homology with a murine gene denoted as 
EI24 which is also thought to be important in p53 mediated apoptosis. 

This gene is expressed primarily in infant brain and activated T-cells and to a 

25 lesser extent in bone marrow, fetal liver, and prostate. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, and tissue damage by radiation and anti-cancer drugs. Similariy, 

30 polypeptides and antibodies directed to ttese polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the nervous and 
immune systems, expression of this gene at significanUy higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g.. brain and other tissue of the 

35 nervous system, blood cells, bone marrow, liver, and prostate, and cancerous and 

wounded dssues) or bodily fluids (e.g., senim, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder. 
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relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

TTie tissue distribution and homology to human Pig8 and murine EI24 genes 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
5 preventing apoptosis in patients being treated with anti-oncogenic drugs such as 

etoposide, hydroperoxycyclophosphamide, and X-irradiation, since this protein product 
is upregulated in cells undergoing such treatment where p53 was overexpressed. It may 
also be useful in the treatment of hematopoietic disorders and in boosting numbers of 
hematopoietic stem cells by interfering with the apoptosis of progenitor cells. The 

1 0 mature polypeptide is predicted to con^rise the foUowing amino acid sequence: 
EEMADSVKTFLQDLARGtKDSrWGICTISKLDARIQQKREEQRRRR^ 
EPRTVSRIFQCCAWNGGVFWSLIIJ^RVFIP^^ 
LWVLPmO^KVVNAIWFQDIADIJVFEVSGRKPHP 
FPIHLVGQLVSLLHMSLLYSLYCFEYRWFNKGIEMHQRI^hnERhn^ 

1 5 SSYnSGCLFSIUTLniSANEAKTPGKAYLFQUUJ^LVVH^ 

FPSPHPSPAKLKATAGH (SEQ ID NO: 637). Accordingly, polypeptides comprising the 
foregoing amino acid sequence arc provided as are polynucleotides encoded such 
polypeptides. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO; 115 

This gene is expressed primarily in stromal cells and to a lesser extent in 
multiple sclerosis. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceU type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: affecting the nervous 
system. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of tiie above tissues or cells, particularly of the 
nervous system, expression of this gene at significanUy higher or lower levels may be 

30 routinely detected in certain tissues and cell types (e.g., stromal cells and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or <^11 sample taken from an individual having such a disorder, 
relative to tiie standard gene expression level, i.e., die expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

35 The tissue distribution indicates that polynucleotides and polypeptides 

con-esponding to this gene are useful for treatment and diagnosis of multiple sclerosis 
and other autoimmune diseases. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 116 

This gene is expressed primarily in the gaU bladder 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: gall stones or infection 
of the digestive system . Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the digestive system or renal system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., gall bladder and tissue of the digestive system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sanq)le taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for possible prevention of digestive disorders 
where there may be a lack of digestive enzymes produced or in the detection and 
possible prevention of gall stones. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 117 

The translation product of this gene shares sequence homology with dystrophin 
gene which is thought to be important in building and maintenance of muscles. 

This gene is expressed primarily in placenta and to a lesser extent in fetal brain 
and fetal liver, and spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: muscular dystropy, 
Duchenne and Becker's muscular dysux)pies. Similarly, polypeptides and antibodies 
directed to tiiese polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of tiie skeletal muscle system, expression of tiiis 
gene at significandy higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., placenta, brain and oUier tissue of the nervous system, muscle, 
liver, and spleen, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or anoUier tissue or cell sample taken from 
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an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to the dystrophin gene indicate that 
5 polynucleotides and polypeptides corresponding to this gene are useful for diseases 
related the degenerative myopathies that are characterized by the weakness and atrophy 
of muscles without neural degradation; such as Duchenne and Becker's muscular 
dystropies. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 118 

This gene is expressed primarily in olfactory tissue and to a lesser extent in 
cartilage. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceU type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions: connective tissue 
diseases; chondrosarcoma. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the connective tissue, expression of this gene at significantly higher or 

20 lower levels may be routinely detected in certain tissues and cell types (e.g., olfactory 
tissue and cartilage, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

25 disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene arc usefril for tumors of connective tissues, osteoarthritis 
and the treatment and diagnosis of chondrosarcoma. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 119 

This gene is expressed primarily in Activated Neutrophils and to a lesser extent 
in fetal spleen, and CD34 positive cells from cord blood. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions: allergies, defects in 
hematopoiesis and inflammation. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
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identification of the tissue(s) or ceil type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system and hematopoiesis system the, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and ceil types (e.g., blood cells, and spleen, and cancerous and 
5 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
10 corresponding to this gene are usefiil for reducing the allergic effects felt by allergy 

suffers by neuU:aUzing the activity of the immune system, especially since neutrophils . 
are abundant in persons suffering from allergies and otiier inflanunatory conditions. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 120 
15 The translation product of this gene shares sequence homology with poly A 

binding protein II which is thought to be important in RNA binding for transcription of 
RNAtoDNA 

This gene is expressed primarily in colon and to a lesser extent in brain and 
irrmiune system. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) pr^nt in a 
biological sample and for diagnosis of diseases and conditions: colon cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are usefril in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 

25 number of disorders of the above tissues or cells, particularly of the immune and 

digestive system, expression of this gene at significandy higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., colon, tissue and cells of the 
immune system, and brain or other tissue of the nervous system, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

30 fluid) or anotiier tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to poly A binding protein n indicate that 
polynucleotides and polypeptides corresponding to this gene are usefiil for detection 

35 and treatment of colon cancer and other disorders of the digestive system.. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 121 

The translation product of this gene shares sequence homology with thymidine 
diphosphoglucose 4.6 dehydrase which is thought to be important in the metaboUsm of 
sugar. 

5 This gene is expressed primarily in fetal liver and spleen and to a lesser extent in 

infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceU type(s) present in a 
biological sample and for diagnosis of diseases and conditions: diabetes. Similarly, 

10 polypeptides and antibodies directed to these polypeptides are useful in providing 

inmiunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particulariy of the endocrine system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
m certain tissues and cell types (e.g.. liver, spleen, and brain and other tissue of the 

15 nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid fit)m an individual not having the 
disorder. 

20 The tissue distribution and homology to thymidine diphospoglucose 4.6 

dehydrase indicate that polynucleotides and polypeptides corresponding to this gene are 
useful for treatment of persons with diabetes since it appears that this protein is needed 
in the metaboUsm of sugar in to its more basic components. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 122 

The translation product of this gene shares sequence homology with 
ceruloplasmin which is thought to be important in the metabolism and transport of iron 
and copper. Ceruloplasmin also contains domains with homology to clotting factors V 
and Vm. Defects in the circulating levels of ceruloplasmin (aceruloplasminemia) have 
30 been associated with certain disease conditions such as Wilson disease, and the 
accompanying hepatolenticular degeneration. 

This gene is expressed primarily in brain and retina and to a lesser extent in 
endotheUal cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: diseases mailced by 
defects in iron metabolism; aceruloplasminemia not characterized by defects in the 
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known ceruloplasmin gene locus; nonclassical Wilson disease; movement disorders; 
and tumors derived from a brain tissue origin. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or ceU type(s). For a number of disorders of 
5 the above tissues or cells, particularly of the brain, retina, and nervous system, 

expression of this gene at significanUy higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., brain and other tissue of the nervous system, 
retinal tissue, and endothelial cells, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 

10 cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to ceruloplasmin indicate that 
polynucleotides and polypeptides corresponding to this gene ar« useful for treatment of 

1 5 patients with aceruloplasminemia, or other defects in, iron and/or copper metabolism. 
Mutations in this locus could also be diagnostic for patients currentiy experiencing or 
predicted to experience aceruloplasminemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 123 

20 This gene is expressed primarily in brain and B cell lymphoma and to a lesser 

extent in fetal liver and spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: B cell lymphoma; 

25 tumors and diseases of the brain and/or spleen; hematopoietic defects. Similariy, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the brain and 
hematopoietic system, expression of this gene at significanUy higher or lower levels 

30 may be routinely detected in certain tissues and cell types (e.g., brain and other tissue of 
the nervous systenu blood cells, liver, and spleen, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or anotiier 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 

35 fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of disorders in neuronal. 
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hematopoietic, and immune systems. It could potentially be useful for 
neurodegenerative disorders and neuronal and/or hematopoietic cell survival or 
proliferation. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 124 

This gene is expressed primarily in osteoclastoma, dermatofibrosaicoma, and B 
cell lymphoma and to a lesser extent in endothelial ceUs. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

10 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer in particular osteoclastoma, dermatofibrosarcoma, and B cell 
lymphoma- Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing inmiunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

15 the bone, inunune, and circulatory system, expression of this gene at significantiy 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
bone, epidermis, blood cells, and endothelial cells, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken fix)m an individual having such a disorder, relative to 

20 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of cancers and 
lymphoma; osteoporosis; and the control of cell proliferation and/or differentiation. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 125 

This gene is expressed primarily in immune tissues and hematopoietic cells, 
particularly in activated T cells and neutrophils, spleen, and fetal liver, and to a lesser 
extent in infant adrenal gland. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects in T cell 
activation; hematopoietic disorders; tiunors of a hematopoietic and/or adrenal gland 
origin. Similarly, polypeptides and antibodies directed to these polypeptides are useful 

35 in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
hematopoietic and/or endocrine systems, expression of this gene at significantiy higher 
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or lower levels may be routinely detected in certain tissues and cell types (e.g., cells and 
tissues of the immune system, hematopoietic cells, blood cells, liver, and adrenal gland, 
and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
5 having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for immune and/or hematopoiedc disorders; 
10 diseases related to proliferation and/or difTerentiation of hematopoietic cells; defects in T 
cell and neutrophil activation and responsiveness; and endocrine and/or metaboUc 
disorders, particularly of early childhood. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 126 

15 This gene is expressed primarily in placenta and endothehal cells and to a lesser 

extent in melanocytes and embryonic tissues. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors of an endothelial 

20 cell origin; angiogenesis associated with tumor development and metastasis. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inununological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the vascular system 
and developing embryo, expression of this gene at significantly higher or lower levels 

25 may be routinely detected in certain tissues and cell types (e.g., placenta, endothelial 
ceUs, melanocytes, and embryonic tissues, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or anodier 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 

30 fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of developmental disorders; 
inhibition of angiogenesis; and vascular patterning. 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 127 

This gene is expressed primarily in endothelial cells and hematopoietic tissues, 
including spleen, tonsils, leukocytes, and both B- and T-cell lymphomas. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type{s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors of an endothelial 
ceU and/or hematopoietic origin; leukemias and lynq)homas. Similarly, polypeptides 
5 and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue{s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune and vascular systems, 
expression of this gene at significandy higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., endothelial cells, hematopoietic cells, spleen, 

10 tonsils, and blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, reladve to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

IS The tissue distribution indicates that polynucleotides and polypeptides 

connesponding to this gene are useful for the manipulation of angiogenesis; the 
differentiation and morphogenesis of endothelial cells; the prohferation and/or 
differentiation of hematopoietic cells; and the commitment of hematopoietic cells to 
distinct cell lineages. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 128 

This gene is expressed primarily in kidney medulla and to a lesser extent in 
spleen fix)m chronic myelogenous leukemia patients, prostate cancer, and some other 
tissues. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors of a kidney 
origin; chromic myelogenous leukemia; prostate cancer. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 

30 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the kidney and spleen, expression of this 
gene at significandy higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., kidney, spleen, and prostate, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 

35 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 



wo 98/3^448 



PCT/US58AM493 



8) 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatnient of kidney 
disorders and cancer, particularly chromic myelogenous leukemia and prostate cancer. 
It may also be useful for the enhancement of kidney tubule regeneration in the treatment 
S of acute renal failure. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 129 

This gene is expressed primarily in adult and infant brain and to a lesser extent 
in mesenchymal or fibroblast cells, as well as tissues with a mesenchymal origin. 

10 Therefore, polynucleotides and polypeptides of the mvention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors of a brain and/or 
mesenchymal origin; neurodegenerative disorders; cancer, fibrosis. Sinailarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

15 immimological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the brain and of 
mesenchymal cells and tissues, expression of this gene at sigiuficantiy higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., brain and other 
tissue of the nervous system and cancerous and wounded tissues) or bodily fluids 

20 (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for the diagnosis of tumors of a brain and/or 

mesenchymal origin; neurodegenerative disorders; cancer, and fibrosis, based upon the 
expression of this gene within those tissues. Fibrosis is considered as mesenchymal 
cells and fibroblasts are the primary cellular targets involved in this pathological 
condition. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 130 

This gene is expressed primarily in hepatocellular cancer and to a lesser extent in 

fetal tissues as well as testes tumor. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: liver cancer. Similarly, 

polypeptides and antibodies directed to these polypeptides are useful in providing 
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immunol gical probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the digestive system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., hver, fetal tissue, and testes and other 
5 reproductive tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

10 The tissue distribution mdicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnosis and treatment of Hver cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 131 
This gene is expressed only in infant early brain. 

15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: development and 
diseases of the nervous systeoL Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

20 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the brain and nervous system, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
brain and other tissue of the nervous system and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 

25 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid firom an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating diseases of the brain in children and in 

30 treating nervous system disorders such as Alzheimer's disease, schizophrenia, 
dementia, depression, etc. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 132 

This gene is expressed primarily in brain and to a lesser extent in gUoblastoma. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: Alzheimer's disease. 
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schizophrenia, depression, mania, and dementia Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain and nervous system, expression of 
5 this gene at significantly higher or lower levels may be routinely detected in certain 

tissues and cell types (e.g., brain and other tissue of the nervous system, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
1 0 healthy tissue or bodily fluid ftom an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating brain disorders such as Alzheimer's 
disease, schizophrenia, depression, mania, and dementia. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 133 

The translation product of this gene shares sequence homology with ribitol 

dehydrogenase of bacteria which is thought to be important in metabolism of sugars. 
This gene is expressed primarily in macrophage and to a lesser extent in T-ceU 

lymphoma and lung. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tissue destruction in 
inflammation. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing inmiunological probes for differential identification of the tissue(s) 

25 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at signiGcantiy higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., blood cells and lung, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

30 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to ribitol dehydrogenase indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for altering 
macrophage metabolism in diseases such as inflammation where macrophages are 

35 causing excess tissue destruction. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 134 

This gene is expressed primarily in pancreatic tumor and to a lesser extent in 
synovial sarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differenUal identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not ymited to,. Similarly, polypeptides and anUbodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

1 0 the endocrine and connective tissue systems, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
pancreas, and synovial tissue, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 

15 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating and diagnosing various cancers. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 135 

This gene is expressed primarily in T cell lines such as Raji and to a lesser 
extent in infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: immune system 

disorders and inflammation. Similarly, polypeptides and antibodies directed to these 
polypeptides are usefid in providing inununological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the unmune system, expression of this gene at significantiy higher or 

30 lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, and brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 

35 fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for tteating and diagnosing inflammatory diseases 



wo 9809448 



85 



PCT/Ua>»04493 



such as rheumatoid arthritis, sepsis, inflammatory bowel disease, and psoriasis, as well 
as neutropenia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 136 
5 The translation product of this gene shares high sequence homology with SARI 

subfamily of GTP-binding proteins which is thought to be important in vesicular 
transport in mammalian ceils. 

This gene is expressed primarily in serum-stimulated smooth muscle cells and to 
a lesser extent in a T-cell lymphoma. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: diseases affecting 
vesicular transport. Similarly, polypeptides and antibodies directed to these 
polypeptides arc useful in providing immunological probes for differential identification 

15 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the muscular system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, and smooth muscle, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 

20 taken from an individual having such a disorder, relative to the standard gene 

expression level, i.e., the expression level in healthy tissue or bodily fluid fh)m an 
individual not having the disorder. 

The tissue distribution and homology to GTP-binding proteins indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for gene therapy 

25 in treating the large number of diseases involved in defective vesicular'transport within 
cells.. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 137 

The translation product of this gene shares sequence homology with a protein 
30 found in C. elegans cosmid F25B5. 

This gene is expressed primarily in a fetal tissues and to a lesser extent in 

melanocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions: abnormal fetal 
development, especially of the pulmonary system. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
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for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the fetal pulmonary system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., fetal tissue, pulmonary tissue, and melanocytes, and 
5 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
10 corresponding to this gene are useful for treatment and diagnosis of diseases affecting 
the pulmonary system, such as emphysema. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 138 

This gene is expressed primarily in gall bladder and to a lesser extent in smooth 
15 muscle. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: digestive system disease 
and gall bladder problems. Similarly, polypeptides and antibodies directed to these 

20 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a nimiber of disorders of the above tissues or cells, 
particularly of the digestive system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., gall 
bladder and tissue of the digestive system, and smooth muscle, and cancerous and 

25 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

30 corresponding to this gene are useful for treating diseases of the digestive system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 139 

This gene is expressed primarily in placenta and to a lesser extent in brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: abnormal fetal 
development Similarly, polypeptides and antibodies directed to these polypeptides are 
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useful in providing immunological probes for differential identification of the lissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
developing tissues, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., placmta, and brain and other 
5 tissue of the nervous system, and cancerous and wounded tissues) or bodily fluids 
(e.g., senmi, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
san^le taken fiom an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
10 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treating and diagnosing abnornial fetal 
development. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 140 

15 This gene is expressed primarily in smooth muscle and to a lesser extent in 

ovary, prostate cancer, and activated monocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hypertension and 

20 atherosclerosis. Similarly, polypeptides and antibodies directed to these polypeptides 
arc useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the circulatory system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., smooth 

25 muscle, ovary and other reproductive tissue, prostate, and blood cells, and cancerous 
and wounded tissues) or bodily fluids (e.g., semm, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

30 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treating diseases of the circulatory system, 
such as hjrpertension, atherosclerosis, etc. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 141 

35 This gene is expressed primarily in fetal spleen and to a lesser extent in placenta 

and bone marrow. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: anemia and other 
diseases affecting blood cells. Sunilarly, polypeptides and antibodies directed to these 
5 polypeptides are useful in providing inmiunological probes for differential identification 
of the tissue{s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the circulatory and pulmonary systems, expression of this gene at 
significandy higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., spleen, placenta, bone marrow, and blood cells, and cancerous and 

10 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

1 5 corresponding to this gene arc useful for the generation of red and white blood cells and 
for the diagnosis of disease of these cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 142 

The predicted translation product of this contig is a human homolog of the 
20 murine tetracycline/sugar transporter molecule recently reported by Matsuo and 

colleagues (Biochem. Biophys. Res. Commun. 238 (1), 126-129 (1997)). 

This gene is expressed primarily in synovium and to a lesser extent in 

endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: rheumatoid arthritis and 
inflanmiation. Similarly, polypeptides and antibodies directed to ttese polypejHides arc 
useful in providing inununological probes for differcntial identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
30 the immune and lymphatic systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., synovial 
tissue, and endothelial cells, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
35 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of inflammatory 
diseases, such as rheumatoid arthritis, leukemia, neutropenia, inflanmiatory bowel 
disease, psoriasis, sepsis, and the like. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 143 

This gene is expressed primarily in placenta and to a lesser extent in melanocyte, 
fetal liver and spleen, and bone marrow. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

10 reagents for differential identification of the dssue{s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: abnormal eariy 
development Sunilarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing inununological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, lower levels 

15 may be routinely detected in certain tissues and cell types (e.g., placenta, melanocytes, 
liver, spleen, and bone marrow, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e.. the expression level in healthy tissue or bodily fluid from an 

20 individual not having the disorder. 

The tissue distribution mdicates that polynucleotides and polypeptides 
corresponding to this gene arc useful for die treatment and diagnosis of abnormal early 
development phenomena and diseases. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 144 

This gene is expressed primarily in fetal liver and spleen. 
Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: anemia and neutropenia 

30 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providuig immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of die above tissues or cells, particularly of the 
inunune and blood systems, expression of this gene at significantiy higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., liver and spleen, 

35 and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 

synovial fluid or spinal fluid) or anotiier tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
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expression level in healthy tissue or bodily fluid ftom an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are usefiil in hematopoeisis and bone marrow regeneration 
5 as it is most abundant in fetal tissues responsible for the generation of hematopoeitic 
cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 145 

The translation product of this gene shares sequence homology with protein 
10 t>TOsine phosphatase which is thought to be important in transducing signal to activate 
cells such as T cell, B cell and other cell types. 

This gene is expressed primarily in T cells and tissues in early stages of 
development and to a lesser extent in cancers. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
15 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: inmiuno-related 
diseases and cancer. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
20 particularly of the inunune system expression of this gene at significandy higher or 

lower levels may be routinely detected in certain tissues and cell types (e.g., embryonic 
and fetal tissue, undifferendated cells, and blood cells, and cancerous and wounded 
tissues) or bodily fluids (e.g., scrum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
25 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to the protein tyrosine phosphatase family 
indicate that polynucleotides and polypeptides corresponding to this gene arc useful for 
modulating the immune system. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 146 

This gene is expressed primarily in T cell and to a lesser extent in B cell, 
macrophages and tumor tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: immuno-disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
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providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and ceU types (e.g., blood cells, and cancerous and 
5 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
10 corresponding to this gene are useful for regulating the inamune system therefore can be 
used in treating diseases such as autoinunune diseases and cancers, 

FEATURES OF PROTEIN ENCODED BY GENE NO: 147 

This gene is expressed primarily in placenta and to a lesser extent in endothelial 

15 cells, testis tumor, ovarian cancer, uterine cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceU type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not Umited to cancer. Similarly, polypeptides and antibodies directed to these 

20 polypeptides are useful in providing inununological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected m certain tissues and cell types (e.g., placenta, 
endothelial cells, testis and ovary and other reproductive tissue, and cancerous and 

25 wounded tissues) or bodily fluids (e.g., serum, plasma, mine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

30 corresponding to this gene are useful for diagnosis arid treatment of cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 148 

This sequence has significant homology to mouse torsin A. Recently, another 

group cloned the human Torsin A gene. (See, Accession No. 2358279; see also Nature 
35 Genet. 17, 40-48 (1997).) 

This gene is expressed primarily in osteoclastoma, T-cell, and placenta and to a 

lesser extent in fetal lung, fetal liver, fetal brain, adult brain and tumor tissues 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
leagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: disease conditions in 
hematopoiesis and cancers. Siniilarly. polypeptides and antibodies directed to these 
5 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the hematopoiesis system, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, bone, placenta, lung, Uver, and brain and other tissues of the nervous system. 
10 and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 

synovial fluid or spinal fluid) or another tissue or ceU sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

1 5 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treating blood related diseases such as 
deficiencies in red blood ceU, white blood cell, platelet and other hematopoiesis cells, 

FEATURES OF PROTEIN ENCODED BY GENE NO: 149 

20 This gene is expressed primarily in T cell, prostate and prostate cancer, 

endotheUal cells and to a lesser extent in monocyte, dendritic cell, bone marrow, 
salivary gland, colon cancer, stomach cancer, pancreatic tumor, uterine cancer, fetal 
spleen and osteoclastoma 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immuno-related 
diseases and cancers. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inununological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 

30 particularly of the immune system, expression of this gene at significandy higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, prostate, endothelial cells, dendritic cells, bone marrow, salivary gland, colon, 
stomach, pancreas, uterus, spleen and bone, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 

35 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 150 

5 This gene was recendy cloned by another group, calling it eIF3-p66. (See 

Accession No. 2351378.) This gene plays a role in RNA binding and macromolecular 
assembly, and therefore, any mutations in this gene would likely result in a diseased 
phenotype. Preferred polypeptide fragments comprise the amino acid sequence: 
MAKFKrrPVIQDM>SGWGPCAVPEQFRDMPYQPFSKGDRLGKA^ADW 

10 QFGGGSQYAYFHEEDESSFQLVDTARTQKTAYQRNRMRFAQRNUmDK^ 
KSAKQKm^ERIRUJKKFQKQFGVRQKWDQKSQKPRDSSVEVRS^ 
LEVSEPQDIECCGAlJBYYDKAFDRITTRSEmJOCXK^^ 
AI1J\T1JS4SCTRSWSWDIWQRVGSKIJTOKR^ 
AMEATYINHNFS<KK^LRMGKERYNFPNPNP 

15 EHIX3VXmiANGEVSFINIKTLJ^EWDSRHa^GW 

CAIXAGSEYLKLGYVSRYHVKDSSRHVILGTQQFKPNEFASQI^ 
EEGKYLILKDPNKQVIRVYSLPDGTFSS (SEQ ID NO: 638), as well as N-terminal and C- 
terminal deletions of this polypeptide fragment. 

This gene is expressed primarily in T cell, bone marrow, embryo and 

20 endothelial ceUs and to a lesser extent in testis tumor and endometrial tumor. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: inomune diseases and 
tumors. Similarly, polypeptides and antibodies directed to these polypeptides are useful 

25 in providing inmiunological probes for differential identification of the tissue(s) or cell 
type{s). For a number of disorders of the above tissues or cells, particularly of the 
inmiune system and reproductive system, expression of this gene at significantiy higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

30 fluid or spinal fluid) or anotiier tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., tiie expression level 
in healthy tissue or bodily fluid from an individual not having die disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for immune disorders and cancers. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 151 

This gene is expressed primarily in testis and to a lesser extent in T cell, spinal 
cord, placenta, neutrophil and monocyte. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: male reproductive and 
endocrine disorders* Similariy, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes few differendal identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
1 0 particularly of the reproductive, immune and endocrine systems, expression of this 
gene at significantly higher or lower levels may be routinely detected 'in certain tissues 
and cell types (e.g., testis and other reproductive tissue, blood cells, tissue of the 
nervous system, and placenta, and cancerous and wounded tissues) or bodily fluids 
(e.g., seram, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
15 sample taken firom an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for regulating inamune and reproductive functions. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 152 

The translation product of this gene shares sequence homology with tyrosyl- 

tRNA synthetase which is thought to be important in cell growth. 

This gene is expressed primarily in brain, liver, keratinocytes, tonsils, and 

25 heart. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not linfiited to, cancer autoinunune diseases. Similarly, polypeptides and antibodies 

30 directed to these polypeptides are usefiil in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain, liver, keratinocytes, tonsils, heart 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., brain and other tissues of the nervous system, 

35 Uver, keratinocytes, tonsils and heart, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
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gene expression level, i.e., the expression level in healthy tissue or bodily fluid ftom an 
individual not having the disorder. 

The tissue distribution and homology to tyrosyl-tRNA synthetase indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for modulating 
5 cell growth. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 153 

This gene is homologous to the Drosophila transcriptional regulator dre4. (See 
Accession No. 25 1 1745.) Dre4 is a gene required for steroidogenesis in I>rosophiIa 

1 0 melanogaster and encodes a de velopmentally expressed homologue of the yeast 

transcriptional regulator CDC68. Preferred polypeptide fragments comprise the amino 
acid sequence: KKRHTDVQFYTEVGEnTDUJKHQHMHDRDDLYAEQMEREM^^ 
FIEKVEALTKEELEFEWFRDU}FNGAPYRSTCLLQPTSSALVNATEW 
VQFHLKNFDMVIVYKDYSKKVTMINAIPVASIJDPI^ 

1 5 DPEGFFEQGGWSFL (SEQ ID NO: 639). as well as N-terminal and C-tenninal deletions of 
this fragments. Also preferred are polynucleotide fragments encoding this polypeptide 
fragment. 

This gene is expressed primarily in fetal liver, spleen, placenta, lung, T cell, 
thyroid, testes. 

20 Therefore, polynucleotides and polypeptides of the mvention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: brain tumor, heart and 
liver diseases. Similarly, polypeptides and antibodies directed to these polypeptides arc 
useful in providing immunological probes for differential identification of die tissue(s) 

25 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the fetal liver, spleen, placenta, lung, T cell, thyroid, testes expression of this gene at 
significanUy higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., liver, spleen, placenta, lung, blood cells, tiiyroid, and testes and otiier 
reproductive tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 

30 plasma, urine, synovial fluid or spmal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to die standard gene expression level, i.e., 
the expression level in healtiiy tissue or bodily fluid from an individual not having the 
disorder. 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 154 

This gene is expressed primarily in brain and to a lesser extent in fetal heart, 
testis, spleen, lung. 
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Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue{s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: heart, liver and spleen 
diseases, immunological diseases. Similarly, polypeptides and antibodies directed to 
5 these polypeptides arc useful in providing mimunological probes for differcntial 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the brain, fetal heart, testis, spleen, lung expression of 
this gene at significantly higher or lower levels may be routinely detected in c^tain 
tissues and cell types (e.g., brain and other tissue of the nervous system, heart, testes 
10 and other reproductive tissue, spleen, and lung, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken firom an individual having such a disortier, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid firom an individual not having the disorder. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 155 

Activation of T cells through the T cell antigen receptor (TCR) results in the 
rapid tyrosine phosphorylation of a number of cellular proteins, one of the earliest being 
a 100 kDa protein. This gene is the human equivalent of murine valosin containing 

20 protein (VCP). VCP is a member of a family of ATP binding, homo-oligomeric 

proteins, and the mammalian homolog of Saccharomyces cerevisiae cdc48p, a protein 
essential to the completion of mitosis in yeasL Both endogenous and expressed murine 
VCP are tyrosine phosphorylated in response to T cell activation. Thus we have 
identified a novel conqwnent of the TCR mediated tyrosine kinase activation pathway 

25 that may provide a link between TCR activation and cell cycle control. 

This gene is expressed primarily in brain, liver, spleen, placenta. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

30 not limited to, cancer inununological disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides arc useful m providing inmiunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disordeis of 
the above tissues or cells, particularly of the brain, liver, spleen, placenta expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 

35 tissues and cell types (e.g., brain and other tissue of the nervous system, liver, spleen, 
and placenta, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
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an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to VCR indicate that polynucleotides and 
5 polypeptides corresponding to this gene are useful for treating cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 156 

The translation product of this gene shares sequence homology with rat growth 
response protein which is thought to be inqwrtant in cell growth. A group recently 

10 cloned the human homolog of this gene, calling it insulin induced protein 1 . (See 
Accession No. 2358269, see also. Genomics 43 (3), 278-284 (1997).) Preferred 
polypeptide fragments comprise the amino acid sequence: RSGLGLXJrnAFLATLITQF 
LVYNGVYQYTSPDFLYIRSWLPCIFFSGG VIVGhnGRQIJVM (SEQ ID NO: 640), 

as well as N-terminal and C-terminal deletions of this polypeptide fragment. Also 

1 5 preferred are polynucleotide fragments encoding these polypeptide fragments. 

This gene is expressed primarily in brain, liver, placenta, heart, spleen, 
lymphoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions which include, but arc 
not limited to, cancer immunological disorders. Similariy, polypeptides and antibodies 
directed to these polypeptides are useful in providing inununological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain, liver, placenta, heart, spleen. 

25 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and ceU types (e.g., brain and other tissue of the nervous system, 
liver, placenta, heart, spleen, and lymphoid tissue, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 

30 standard gene expression level, Le., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to growth-response protein indicate that 
polynucleotides and polypeptides corresponding to this gene are useftil for modulating 
cell growth. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 157 

This gene is expressed primarily in GUoblastoma, endometrial tumor, 
lymphoma and pancreas tumor. 

niercfore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and condiUons: Glioblastoma, 
Endometrial tumor, lymphoma and pancreas tumor. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or ceUs, particularly of the immune, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., endometrium, lymphoid tissue, pancreas, and tissue of the nervous system, 
and cancerous and wounded tissues) or bodily fluids (e.g.. serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e.. the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 158 

The translation product of this gene shares sequence homology with ICE 
receptor which is thought to be important in allergy and asthma. 
This gene is expressed primarily in T cell, and fetal liver. 
Therefore, polynucleotides and polypeptides of the invention are usefid as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: allergy and asthma and 
other immunological disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful m providing immunological probes for differential identification 
of the tissue(s) or ceU type(s). For a number of disorders of tiie above tissues or cells, 
particularly of the immune, expression of this gene at significantiy higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., blood cells, and 
liver, and cancert)us and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to die standard gene expression level. i.e.. die 
expression level in healUiy tissue or bodily fluid from an individual not having die 
disorder. 
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The tissue distribution and homology to IgE receptor indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for allergy and 
asthma. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 159 

The translation product of this gene shares sequence homology with 
immunoglobin heavy chain which is thought to be important in inunune response to the 
antigen. 

This gene is expressed primarily in activated neutrophil and to a lesser extent in 
10 activated T cell, monocyte and heart 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: infection , inflammation 
and cancer. Similarly, polypeptides and antibodies directed to these polypeptides arc 
1 5 useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the inMnune, expression of this gene at significandy higher or lower levels may be 
routinely detected in certain tissues and ceU types (e.g., blood cells, and heart, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, syriovial 
20 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to immunoglobin heavy chain variable 
region indicate that polynucleotides and polypeptides corresponding to this gene arc 
25 useful for making the ligand to block specific antigen which cause certain disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 160 

The translation product of this gene shares sequence homology with mouse X 
inactive specific transcript protein which is thought to be important in X chromosome 
30 inactivation. 

This gene is expressed primarily in HSA172 cell and to a lesser extent in normal 
ovary tissue, ovarian cancer, fix)ntal cortex and brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions: ovarian tumor, 

schizophrenia and other neurological disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 



wo 98^39448 



100 



PCT/US98/04493 



differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or ceUs, particularly of the immune and neural system, expression of 
this gene at significanUy higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., ovary and other reproductive tissue, and brain and other 
5 tissue of the nervous system, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid bom an 
individual not having the disorder. 

The tissue distribution and homology to X inactive specific u^anscript protein 
indicate that polynucleotides and polypeptides corresponding to this gene arc useful for 
diagnosis and treatment of reproductive system tumors and CNS tumors. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 161 

This gene is expressed primarily in adipose cell and to a lesser extent in liver 
and prostate. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: obesity and liver 
disorder. Sunilarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the adipose cell, expression of this gene at significanUy higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., adipose cells, liver, and 
prostate, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken ftom an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodUy fluid from an individual not having tte 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of obesity and liver disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 162 

The translation product of this gene shares sequence homology with yeast 
ubiquitin activating enzyme homolog which is thought to be important in protein 
posttraslation processing. 
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This gene is expressed primarily in stromal cell and to a lesser extent in retina, 
H. Atrophic Endometrium, colon carcinoma and myeloid progenitor cell. 

Therefore, polynucleotides and polypeptides of the invention are usefid as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
5 biological sample and for diagnosis of diseases and conditions: defects of stromal cell 
development, neuronal growth disorders and tumors. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providmg immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of die above tissues or cells, particularly of die immune and neural system, expression 
10 of Uiis gene at significantiy higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., retinal ceUs, endometrium, colon, and bone marrow, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or anotiier tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., die expression level 
1 5 in healdiy tissue or bodily fluid from an individual not having die disorder. 

The tissue distribution and homology to ubiquitm-activating enzyme homolog 
indicate diat polynucleotides and polypeptides corresponding to tiiis gene are useful for 
diagnosis or treatment of some type of tumors , fiicosidosis and neuronal growdi 
disorders. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 163 
This gene is expressed primarily in primary breast cancer and 
hemangiopericytoma and to a lesser extent in adult brain and cerebellum. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of die tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: breast cancer, leukemia 
and cerebellum disorders, Similariy, polypeptides and antibodies directed to diese 
polypeptides are useful in providing immunological probes for differential identification 
of die tissue(s) or cell type(s). For a number of disorders of die above tissues or cells, 

30 particularly of die immune system and neural system , expression of diis gene at 

significantiy higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., mammary tissue, brain and other tissue of die nervous system, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken horn an individual havmg 

35 such a disorder, relative to die standard gene expression level, i.e., die expression level 
in healdiy tissue or bodily fluid from an individual not having die disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis or treatment of various tumors and 
disease involved in neural system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 164 

The translaUon product of this gene shares sequence homology with proline rich 
proteins. RecenUy. another group has also cloned this gene, calling it CD84 leukocyte 
antigen, a new member of the Ig superfamily. (See Accession No. U82988. see also. 
Blood 90 (6), 2398-2405 (1997).) 

This gene is expressed primarily in Weizmann olfactory tissue and 
osteoclastoma and to a lesser extent in anergic T-cell. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differenUal identification of the tissue(s) or ceU type(s) present in a 
biological sample and for diagnosis of diseases and conditions: ostsis and immune 
disease. Similariy, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or ceUs, particulariy of the 
immune system, expression of this gene at significanUy higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., olfactory tissue, bone, and 
blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken bom an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to the Ig superfamily indicate that the 
protein product of this clone is useful for treatment of osteoporosis, autoimmune 
disease, and other immune disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 165 

This gene is expressed primarily in atrophic endometrium and colon cancer and 
to a lesser extent in some fetal tissues. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the immune system. 
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expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., endometrium, colon, and fetal tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
5 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of tumors, specifically 
endometrium and colon tumors. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 166 

This gene is expressed primarily in human primary breast cancer and to a lesser extent 
in activated monocyte. Although the predicted signal sequence is identified in Table 1, 
other upstream sequences are also relevant. Preferred polypeptide fragments comprise 

15 the amino acid sequence: VTQPKHLSASMGGSVEIPFSFYYPWELAXXPXVRISWRRGHFHG 
QSFYSTRPPSIHKDYVNRIJFIJ^WTEGQESGFLRISh^ (SEQ ID 

NO: 641), as well as N-terminal and C-terminal deletions. Also preferred are 
polynucleotide fragments encoding these polypeptide fragments. 

Therefore, polynucleotides and polypeptides of the invention are usefril as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: breast cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the immune system, 

25 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., mammary tissue, and blood cells, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 

30 healthy tissue or bodily fluid fix)m an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis of breast cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 167 

35 This gene is expressed primarily in fetal tissues and to a lesser extent in adult 

lung. This gene has also been mapped to chromosomal location 9q34, and thus, can be 
used as a marker for linkage analysis for chromosome 9. 
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Tlierefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
5 tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the embryo tissues, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., fetal 
tissues, and lung, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
10 an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 168 

1 5 The translation product of this gene shares sequence homology with Ig Heavy 

Chain which is thought to be important in immune response. 

This gene is expressed primarily in prostate cancer tissue specifically 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions: prostate cancer. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of die tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
prostate, expression of this gene at significandy higter or lower levels may be routinely 

25 detected in certain tissues and cell types (e.g.. prostate, tissue and cells of tte immune 
system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

30 disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 169 

The translation product of this gene shares sequence homology wiUi cytosolic 
acyl coenzyme-A hydrolase, which is thought to be important in neuron-specific fatty 
35 acid metabolism. The gene represented by this contig has since been published by Hajra 
and coUeagues (GenBank Accession No. U91316). 
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This gene is expressed primarily in human pituitary gland and to a lesser extent 
in colorectal cancer tissue. This gene has also been observed in the LNCAP cell line. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) presait in a 
5 biological sample and for diagnosis of diseases and conditions: hyperlipidemias of 
familial and/or idiopathic origins. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for difforendal 
identification of the tissue(s) or ceD type(s). For a number of disorders of the above 
tissues or cells, particularly blood, expression of this gene at significantly higher or 

10 lower levels may be routinely detected in certain tissues and cell types (e.g., pituitary 
and colon, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

15 disorder. 

The tissue distribution and homology to rat cytosolic acyl coenzyme-A 
hydrolase indicate that polynucleotides and polypeptides corresponding to this gene are 
useful for the detection or treatment of hyperlipidemia disease states by virtue of the 
ability of specific drugs to activate the enzyme. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 170 

The translation product of this gene shares sequence homology with a 
Caenorhabditis elegans gene which is thought to be important in organism 
development. 

25 This gene is expressed primarily in human synovial sarcoma tissue, bone 

marrow, and to a lesser extent in human brain. 

Therefore, polynucleotides and polypeptides of the invetition are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

30 not limited to, of bone, specifically synovial sarcoma. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing irrununological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the bone, connective tissues and possibly 
inunune system, expression of this gene at significantiy higher or lower levels noay be 

35 routinely detected in certain tissues and cell types (e.g., synovial tissue, bone marrow, 
brain and other tissue of the nervous system, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
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tissue or cell sample taken fiiom an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to Caenorhabditis elegans indicate diat 
5 polynucleotides and polypeptides corresponding to this gene are useful as a diagnostic 
and/or therapeutic modality directed at the detection and/or treatment of connective 
tissue sarcomas or other related bone diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 171 

1 0 The translation product of this gene shares sequence homology with betal - 

6GlcN Ac transferase which is thought to be important in the transfer and metabolism of 
beta 1-6, N-acetylglucosamine. This gene product has previously been shown to 
suppress melanoma lung metastasis in both syngeneic and nude mice, decreased 
invasiveness into the matrigel, and inhibition of cell attachment to coUagen and laminin 

15 without affecting ceU growth. 

This gene is expressed primarily in human testes and prostate tissues, and to a 
lesser extent in kidney, medulla, and pancreas. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer particularly nielanoma. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing inmiunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the inmiune system, expression of this gene at 

25 significantiy higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., testes and other reproductive tissue, prostate, kidney, pancreas, brain and 
other tissue of the nervous system, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

30 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to betal-6GlcNAc transferase indicate that 
the protein product of this clone is usefril for the development of diagnostic and/or 
therapeutic modalities directed at the detection and/or treatnnent of cancer, the metastasis 

35 of malignant tissue or cells. Defects in this potentially secreted enzyme may play a role 
in metastasis. 



wo 98/39448 



107 



PCT/US5W04493 



FEATURES OF PROTEIN ENCODED BY GENE NO: 172 
This gene is expressed primarily in fetal spleen and liver. 
Therefore, polynucleotides and polypeptides of the invration are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: inunune disorders, 
Wihn*s tumor disease, hepatic disorders, and hematopoietic disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides arc useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
10 number of disorders of the above tissues or cells, particularly of the hematopoiesis and 
immune systems, expression of this gene at significantiy higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., spleen and liver, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
15 disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid fix)m an individual not having the disorder. 

The tissue distribution indicates that pol)mucleotides and polypeptides 
corresponding to this gene arc useful for the treatment and identification of fetal defects 
along with correcting diseases that affect hematopoiesis and the inmiune system. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 173 

The translation product of this gene shares sequence homology with rct U 
oncogene which is thought to be important in Hirschsprung disease and many types of 
cancers. 

25 This gene is exprcssed in multiple tissues including the lymphatic system, brain, 

and thyroid. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for identification of the tissue(s) or cell type(s) present in a biological sample 
and for diagnosis of diseases and conditions: Hirschsprung disease and multiple 

30 cancers. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune and 
central nervous system, expression of this gene at significantiy higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., lymphoid tissue, 

35 thyroid, and brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
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the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid bom an individual not having the disorder. 

The tissue distribution and homology to ret n oncogene indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
5 and treatment of various cancers. It would also be useM for the diagnosis and treatment 
of Hirschspnmg disease. Preferred polypeptides of the invention comprise the amino 
acid sequence: MEA(XiVNEAESAREQLQXUiIX}IAGQKASKQEl£TE^ 
RTKmTXJSRnmRDEHQKUWQLTNKTLSNSSQSEm 
VFQLERI^QQMNSASGSSSNGSSD^SGroNGEGTRUUWPVU^ 
10 SnXJFSIRIXtlFLRRYPIARVFVIIYMAIXHLWV^^ HHDQPYGK (SEQ ID NO: 

642). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 174 

The translation product of this gene shares sequence homology with testis 
1 5 enhanced gene transcript which is thought to be important in regulation of himian 
development. 

This gene is expressed primarily in infant brain and to a lesser extent in a variety 
of other tissues and cell types, including the prostate, testes, monocytes, macrophages, 
dendritic cells, keratinocytes, and adipocytes. 

20 Therefore, polynucleotides and polypeptides of the invention are usefiil as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurological, 
developmental, immune and inflammation disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are usefixl in providing inununological probes 

25 for differential identiflcation of the tissue(s) or cell type(s). For a niunber of disorders 
of the above tissues or cells, particularly of the brain and immune systems, expression 
of this gene at significantiy higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., brain and other tissue of the nervous system, prostate, 
testes and other reproductive tissue, blood cells, keratinocytes, and adipocytes, and 

30 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken firom an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to testis enhanced gene transcript indicate 

35 that the protein product of this clone is useful for diagnosis and treatment of disorders 
involving the developing brain and the immune system. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 175 

This gene is expressed primarily in prostate and to a lesser extent in various 
other tissues, including placenta. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancers, especially of the prostate. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 

10 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the prostate, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., prostate and placenta, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 

15 sample taken fiom an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of prostate disorders and cancer. It may also be useful for 

20 the diagnosis and treatment of endocrine disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 176 

The translation product of this gene shares sequence homology with 
Sacchromyces cerevisiae Y^^T20 gene which is thought to be important in 

25 mitochondrial function. 

This gene is expressed at a particularly high level in muscle tissue. 
Therefore, polynucleotides and polyp^tides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases related to such tissues and cell types 

30 including: muscle wasting diseases. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particulariy of the neuromuscular system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 

35 types (e.g., muscle and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or anotho: tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e.. 
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the expression level in healthy tissue or bodily fluid iBrom an individual not having the 
disorder. 

The tissue distribution and homology to the YNT20 gene indicate that this 
protein is useful for treatment and detection of neuromuscular diseases caused by loss 
5 of mitochondrial function. For example this gene or its protein product could be used in 
replacement therapy for such diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 177 

This gene is expressed primarily in the brain and to a lesser extent in kidney, 

10 placenta, smooth muscle, heart and lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neuromuscular 
diseases, degenerative diseases of the central nervous system, and heart disease. 

15 Similarly, polypeptides and antibodies directed to these polypeptides arc useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
neuromuscular system, central nervous system, and heart, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 

20 types (e.g., brain and other tissue of the nervous system, kidney, placenta, muscle, 
heart and lung, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

25 disorder. 

This gene or its protein product could also be used for replacement therapy for 
the above mentioned diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 178 
30 The translation product of this gene shares sequence homology with caldesmon 

which is thought to be important in the cellular response to changes in glucose levels. 
This gene is expressed primarily in multiple tissues including brain and retina. 
Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for identification of the tissue(s) or cell type(s) present in a biological sample 
35 and for diagnosis of diseases and conditions: central nervous system disorders and 
retinopathy. Similarly, polypeptides and antibodies directed to these polypeptides arc 
useful in providing immunological probes for identification of the tissue(s) or cell 
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type(s). For a number of disorders of the above tissues or cells, particularly of the CNS 
disorders and retinopathy, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and ceU types (e.g., brain and other tissue of 
the nervous system, and retinal tissue, and cancerous and wounded tissues) or bodily 
5 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid fiom an 
individual not having the disorder. 

The tissue distribution and homology to caldesmon indicate that polynucleotides 
10 and polypeptides corresponding to this gene are useful for treatment of retinopathies. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 179 

The translation product of this gene shares sequence homology with mouse 
fibrosin protein which is thought to be important in regulation of fibrinogenesis in 

IS certain chronic inflammatory diseases. 

This gene is expressed primarily in amniotic cells and breast tissue. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of breast cancer and abnormal embryo 

20 development Similarly, polypeptides and antibodies directed to these polypeptides are 
useful m providing immimological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the reproductive system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., amniotic cells, and 

25 mammary tissue, and cancerous and wounded tissues) or bodily fluids (e.g., senun, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or ceU sample taken fix)m 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in tealthy tissue or bodily fluid firom an individual not having the 
disorder. 

30 The tissue distribution and homology to fibrosin indicate that the protein product 

of this clone is useful for treatment of breast cancer. This gene or its protein product 
could be used in replacement therapy for breast cancer. In addition the protein product 
of this gene is useful in the treatment of chronic inflarrunatory diseases. 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 180 

This gene is expressed several infant tissues including brain and liver and 
various adult tissues including brain, lung, liver, testes, and prostate. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, brain cancer, lung cancer, liver cancer and cancers of the reproductive 
5 system. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing inununologiqal probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
central nervous system, hepatic system, and reproductive system, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 

10 and cell types (e.g., brain and other tissue of the nervous system, lung, liver, testes and 
other reproductive tissue, and prostate, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

1 5 individual not having the disorder. 

The tissue distribution of this gene product indicates that the protein product of 
this clone is involved in growth regulation and could be used as a growth factor or 
growth blocker in a variety of settings including treatment of cancers. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 181 

This gene is expressed primarily in activated monocytes and to a lesser extent in 
melanocytes and dendritic cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of inmiune system diseases and cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are usefbl in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the irrmiune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 

30 in certain tissues and cell types (e.g., blood ceUs, melanocytes, and dendritic ceUs, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

35 The tissue distribution indicates that the protein product of this clone could be 

involved in growth regulation and could be used as a growth factor or growth blocker 
in a variety of settings including treatment of cancers. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 182 

This gene is expressed primarily in placenta and several tumors of various tissue 
origin and to a lesser extent in normal tissues including liver, lung, brain, and skin, 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of cancers of all kinds. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing imnmnological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

10 of the above tissues or cells, particularly of the central nervous system, respiratory 
system and skin, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., liver, lung, brain and other 
tissues of the nervous system, and skin, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 

1 5 cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The high expression of this gene in multiple tumors indicates that the protein 
product of the clone may be involved in cell growth control and therefore would be 

20 useful for treatment of certain cancers. Likewise molecules developed to block the 
activity of the protein product of this clone could be used to block its potential role in 
tumor growth promotion. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 183 

25 The translation product of this gene shares sequence homology with the mouse 

Ndrl gene which is thought to be important in cancer progression. 

This gene is expressed multiple cell types and tissues including brain, lung, 
kidney, bone marrow, liver, and spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

30 reagents for differential idendficalion of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of all types of cancers. Similarly, polypeptides and 
antibodies directed to these polypeptides arc useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the nervous, immune, and endocrine 

35 systems, expression of this gene at significandy higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., brain and other tissue of the nervous 
system, lung, kidney, bone marrow, Uver and spleen, and cancerous and wounded 
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tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disord^, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
5 The tissue distribution and homology to Ndrl gene, which is thought to be 

involved in cancer progression, indicate that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of certain cancers. Likewise 
molecules developed to block the activity of the protein product of this clone could be 
used to block its potential role in tmnor growth promotion. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 184 

This gene is expressed primarily in early stage human brain and liver and to a 
lesser extent in several other fetal tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

1 5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: brain and liver cancers. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 

20 central nervous system and inunune system, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
brain and other tissue of the nervous system, liver, and fetal tissue, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 

25 relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The expression of this gene in embryonic tissues indicates that the protein could 
be involved in growth regulation and could be used as a growth factor or growth 
blocker in a variety of settings including treatment of cancers. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 185 
This gene is expressed primarily in infant and embryonic brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of degenerative nervous system disorders and brain 

cancer. Similarly, polypeptides and antibodies directed to diese polypeptides are useful 

in providing inmiunological probes for differential identification of the tissue(s) or cell 
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type(s). For a number of disorders of the above tissues or cells, particularly of the 
central nervous system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., embryonic tissue, brain 
and other tissue of the nervous system, and cancerous and wounded tissues) or bodily 
5 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The expression of this gene in embryonic tissues indicates that the protein could 
10 be involved in growth regulation and could be used as a growth factor or growth 
blocker in a variety of settings including treatment of cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 186 

This gene is expressed primarily in multiple tissues including placenta, fetal 

15 lung, fetal liver, and brain. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of all types of cancers including liver, brain and 
lung. Similarly, polypeptides and antibodies directed to these polypeptides arc useful in 

20 providing irrunimological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
central nervous system, pulmonary system, and hepatic system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues and 
cell types (e.g., placenta, lung, liver, and brain and other tissue of the nervous system, 

25 and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 

synovial fluid or spinal fluid) or another tissue or cell sample taken horn an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid fipom an individual not having the 
disorder. 

30 The expression of this gene in embryonic tissues indicates that the protein could 

be involved in growth regulation and could be used as a growth factor or growth 
blocker in a variety of settings including treatment of cancers. 
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Table 1 summarizes the infonnation corresponding to each 'Xjiene No." 
described above. The nucleotide sequence identified as 'WT SEQ ID NO:X" was 
assembled from partially homologous ("overlapping") sequences obtained from the 
"cDNA clone ID" identified in Table 1 and, in some cases, from additional related DNA 
clones. The overiapping sequences were assembled into a single contiguous sequence 
of high redundancy (usually three to five overtyping sequences at each nucleotide 
position), resulting in a fmal sequence identified as SEQ ID NO:X. 

The cDNA Clone ID was deposited on the date and given the corresponding 
deposit number listed in *^ATCC Deposit No:Z and Date." Some of the deposits contain 
multiple different clones corresponding to the same gene. "Vector'* refers to the type of 
vector contained in the cDNA Clone ID. 

'Total NT Seq." refers to the total number of nucleotides in the contig identified 
by "Gene No," The deposited clone may contain all or most of these sequences, 
reflected by the nucleotide position indicated as "5' NT of Clone Seq." and the "3' NT 
of Clone Seq." of SEQ ID NO:X. The nucleotide position of SEQ ID NO:X of the 
putative start codon (methionine) is identified as "5' NT of Start Codon." Similarly , 
the nucleotide position of SEQ ID NO:X of the predicted signal sequence is identified as 
"5' NT of Furst AA of Signal Pep." 

The translated amino acid sequence, beginning with the methionine, is identified 
as "AA SEQ ID NO: Y " although other reading frames can also be easily translated 
using known molecular biology techniques. The polypeptides produced by these 
alternative open reading firames are specifically contemplated by the present invention. 

The fust and last amino acid position of SEQ ID NO: Y of the predicted signal 
peptide is identified as "First AA of Sig Pep" and "Last AA of Sig Pep." The predicted 
first amino acid position of SEQ ID NO: Y of die secreted portion is identified as 
"Predicted First AA of Secreted Portion." Finally, the amino acid position of SEQ ID 
NO: Y of the last amino acid in the open reading frame is identified as "Last AA of 
ORF." 

SEQ ID NO:X and the translated SEQ ID NO: Y are sufficienUy accurate and 
otherwise suitable for a variety of uses well known in die art and described further 
below. For instance, SEQ ID NO:X is usefiil for designing nucleic acid hybridization 
probes that will detect nucleic acid sequences contained in SEQ ID NO:X or die cDNA 
contained in the deposited clone. These probes wiU also hybridize to nucleic acid 
molecules in biological samples, tiiereby enabling a variety of forensic and diagnostic 
methods of die invention. Similarly, polypeptides identified from SEQ ID NO:Y may 
be used to generate antibodies which bind specifically to die secreted proteins encoded 
by die cDN A clones identified in Table 1 . 



wo 98^9448 



167 



PCTAJS98A)4493 



Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. ITie errors exist as misidentified nucleotides, or as insertions or 
deletions of nucleotides in the generated DNA sequence. The ent)neously inserted or 
deleted nucleotides cause franae shifts in the reading frames of the predicted amino acid 
sequence. In these cases, the predicted amino acid sequence diverges from the actual 
amino acid sequence, even though the generated DNA sequence may be greater than 
99.9% identical to the actual DNA sequence (for example, one base insertion or deletion 
in an open reading frame of over 1000 bases). 

Accordingly, for those plications requiring precision in the nucleotide 
sequence or the amino acid sequence, the present invention provides not only the 
generated nucleotide sequence identified as SEQ ID NO:X and the predicted translated 
amino acid sequence identified as SEQ ID NO: Y, but also a sample of plasmid DNA 
containing a human cDNA of the invention deposited with die ATCC, as set forth in 
Table 1. The nucleotide sequence of each deposited clone can readily be determined by 
sequencing tiie deposited clone in accordance witii known methods. The predicted 
amino acid sequence can then be verified fiiom such deposits. Moreover, die amino 
acid sequence of the protein encoded by a particular clone can also be direcUy 
determined by peptide sequencing or by expressing die protein in a suitable host ceU 
containing the deposited human cDNA, collecting die protein, and determining its 
sequence. 

The present invention also relates to die genes corresponding to SEQ ID NO:X, 
SEQ ID NO: Y, or die deposited clone. The corresponding gene can be isolated in 
accordance widi known methods using die sequence information disclosed herein. 
Such mediods include preparing probes or primers fix)m die disclosed sequence and 
identifying or amplifying die corresponding gene fix)m appropriate sources of genomic 
material. 

Also provided in die present invention are species homologs. Species 
homologs may be isolated and identified by making suitable probes or primers from die 
sequences provided herein and screening a suitable nucleic acid source for the desired 
homologue. 

The polypeptides of die invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinandy produced 
polypeptides, syndietically produced polypeptides, or polypeptides produced by a 
combination of diese mediods. Means for preparing such polypeptides are well 
understood in the art. 

The polypeptides may be m die form of die secreted protein, including die 
mature form, or may be a part of a larger protein, such as a fusion protein (see below). 
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It is often advantageous to include an additional aniino acid sequence which contains 
secretory or leader sequences, pro-sequences, sequences which aid in purification , 
such as multiple histidine residues, or an additional sequence for stability during 
recombinant production. 

The polypeptides of the present invention arc preferably provided in an isolated 
form, and preferably are substantially purified. A recombinandy produced version of a 
polypeptide, including the secreted polypeptide, can be subslantiaUy purified by the 
one-step method described in Smith and Johnson, Gene 67:31-40 (1988). 
Polypeptides of the invention also can be purified from natural or recombinant sources 
using antibodies of the invention raised against the secreted protein in methods which 
are well known in the art. 

Signal S equences 

Methods for predicting whether a protein has a signal sequence, as.well as the 
cleavage point for that sequence, are available. For instance, the method of McGeoch, 
Virus Res. 3:271-286 (1985), uses the information from a short N-tenninal charged 
region and a subsequent uncharged region of the complete (uncleaved) protein. The 
method of von Heinje, Nucleic Acids Res. 14:4683-4690 (1986) uses the information 
from the residues surrounding the cleavage site, typicaUy residues -13 to +2, where +1 
indicates die amino terminus of the secreted protein. The accuracy of predicting the 
cleavage points of known mammalian secretory proteins for each of these methods is in 
the range of 75-80%. (von Heinje, supra.) However, the two methods do not always 
produce the same predicted cleavage point{s) for a given protein. 

In the present case, the deduced amino acid sequence of the secreted polypeptide 
was analyzed by a computer program caUed Signal? (Henrik Nielsen et al., Protein 
Engineering 10: 1-6 (1997)), which predicts the cellular location of a protem based on 
the amino acid sequence. As part of this computational prediction of localization, the 
methods of McGeoch and von Heinje are incorporated. The analysis of the amino acid 
sequences of the secreted proteins described herein by this program provided the results 
shown in Table 1. 

As one of ordinary skill would ^preciate, however, cleavage sites sometimes 
vary from organism to organism and cannot be predicted with absolute certainty. 
Accordingly, the present invention provides secreted polypeptides having a sequence 
shown in SEQ ID NO:Y which have an N-terminus beginning within 5 residues (i.e., + 
or - 5 residues) of the predicted cleavage pomt. Similarly, it is also recognized that in 
some cases, cleavage of the signal sequence from a secreted protein is not entirely 



wo 98/39448 



169 



PCT/DS98/04493 



uniform, resulting in more than one secreted species. These polypeptides, and the 
polynucleoUdes encoding such polypeptides, are contemplated by the present invention. 

Moreover, the signal sequence identified by the above analysis may not 
necessarily predict the naturaUy occurring signal sequence. For example, the naturaUy 
occurring signal sequence may be further upstream from the predicted signal sequence. 
However, it is likely that the predicted signal sequence will be capable of directing the 
secreted protein to the ER. These polypeptides, and the polynucleotides encoding such 
polypeptides, are contenqilated by the present invention. 

Polynucl eotide and Polypeptide Variante 

"Variant" refers to a polynucleotide or polypeptide differing ftom the 

polynucleotide or polypeptide of the present invention, but retaining essential properties 

thereof. Generally, variants are overaU closely similar, and, in many regions, identical 

to the polynucleotide or polypeptide of the present invention. 

"Identity" per se has an art-iecognized meaning and can be calculated using 

pubUshed techniques. (See, e.g.: (COMPUTATIONAL MOLECULAR BIOLOGY. 
Lesk. A.M.. ed.. Oxford University Press, New York, (1988); BIOCOMPUTING: 
INFORMATICS AND GENOME PROJECTS, Smith, D.W., ed.. Academic Press, 
New York. (1993); COMPUTER ANALYSIS OF SEQUENCE DATA. PART I. 
Griffin, A.M., and Griffin, H.G., eds., Humana Press. New Jersey. (1994); 
SEQUENCE ANALYSIS IN MOLECULAR BIOLOGY, von Heinje, G., A<!ademic 
Press. (1987); and SEQUENCE ANALYSIS PRIMER, Gribskov. M. and Devereux. 
J., eds., M Stockton Press. New Yoric, (1991).) While there exists a number of 
metiiods to measure identity between two polynucleotide or polypeptide sequences, the 
term "identity" is well known to skilled artisans. (CariUo. H., and Upton. D., SIAM J 
Applied Matii 48: 1073 (1988).) Methods commonly employed to determine identity or 
similarity between two sequences include, but are not limited to. those disclosed in 
"Guide to Huge Computers." Martin J. Bishop, ed.. Academic Press, San Diego. 
(1994). and Carillo. H.. and Lipton. D.. SIAM J Applied Math 48: 1073 (1988). 
MeUiods for aligning polynucleotides or polypeptides are codified in computer 
programs, including tiie GCG program package (Devereux. J., et al.. Nucleic Acids 
Research (1984) 12(1):387 (1984)), BLASTP, BLASTN, FASTA (Atschul, S.F. et 
al., J. Molec. Biol. 215:403 (1990), Bestfit program (Wisconsin Sequence Analysis 
Package, Version 8 for Unix, Genetics Computer Group. University Research Paric. 
575 Science Drive. Madison. WI 5371 1 (using the local homology algoriUun of Smitii 
and Waterman. Advances in Applied Mathematics 2:482-489 (1981).) 
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When using any of the sequence alignment programs to determine whether a 
particular sequence is, for instance, 95% identical to a reference sequence, the 
parameters are set so that the percentage of identity is calculated over the full length of 
the reference polynucleotide and that gaps in identity of up to 5% of the total number of 
nucleotides in the reference polynucleotide are allowed. 

A preferred method for determing the best overall match between a query 
sequence (a sequence of the present invention) and a subject sequence, also referred to 
as a global sequence alignment, can be determined using the FASTDB computer 
program based on the algorithm of Brutlag et al. (Comp. App. Biosci. 6:237-245 
(1990).) The term "sequence" includes nucleotide and amino acid sequences. In a 
sequence alignment the query and subject sequences are either both nucleotide 
sequences or both amino acid sequences. The result of said global sequence alignment 
is in percent identity. Preferred parameters used in a FASTDB search of a DNA 
sequence to calculate percent identiy are: Matrix=Unitary, k-tuple=4. Mismatch 
Penalty=l, Joining Penalty=30, Randomization Group Length=0, and Cutoff Score=l, 
Gap Penalty=5, Gap Size Penalty 0.05, and Window Size=500 or query sequence 
length in nucleotide bases, whichever is shorter. Preferred parameters employed to 
calculate percent identity and similarity of an amino acid alignmoit are: Matrix=PAM 
150, k-tuple=2. Mismatch Penalty=l, Joming Penalty=20, Randomization Group 
Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size Penalty=0.05, and Window 
Size=500 or query sequence length in amino acid residues, whichever is shorter. 

As an illustration, a polynucleotide having a nucleotide sequence of at least 95% 
"identity" to a sequence contained in SEQ ID NO:X or the cDNA contained in the 
deposited clone, means that the polynucleotide is identical to a sequence contained in 
SEQ ID NO:X or the cDNA except that the polynucleotide sequence may include up to 
five point mutations per each 100 nucleotides of the total length (not just within a given 
100 nucleotide stretch). In other words, to obtain a polynucleotide having a nucleotide 
sequence at least 95% identical to SEQ ID NO:X or the deposited clone, up to 5% of the 
nucleotides m the sequence contained in SEQ ID NO:X or the cDNA can be deleted, 
inserted, or substimied with other nucleotides. These changes may occur anywhere 
throughout the polynucleotide. 

Further embodiments of the present invention include polynucleotides having at 
least 85% identity, more preferably at least 90% identity, and most preferably at least 
95%, 96%, 97%, 98% or 99% identity to a sequence contained in SEQ ID NO:X or the 
cDNA contained in the deposited clone. Of course, due to the degeneracy of the genetic 
code, one of ordinary skill in the art will immediately recognize that a large number of 
the polynucleotides having at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity 
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will encode a polypeptide identical to an aniino acid sequence contained in SEQ ID 
NO: Y or the expressed protein produced by the deposited clone. 

Similarly, by a polypeptide having an amino acid sequence having at least, for 
example, 95% "identity" to a reference polypeptide, is intended that the amino acid 
sequence of the polypeptide is identical to the reference polypeptide except that the 
polypeptide sequence may include up to five amino acid alterations per each 100 amino 
acids of the total length of the reference polypeptide. In other words, to obtain a 
polypeptide having an amino acid sequence at least 95% identical to a reference amino 
acid sequence, up to 5% of the amino acid residues in the reference sequence may be 
deleted or substituted with another amino acid, or a number of amino acids up to 5% of 
the total amino acid residues in the reference sequence may be inserted into the reference 
sequence. These alterations of the reference sequence may occur at the amino or 
carboxy terminal positions of the reference amino acid sequence or anywhere between 
those terminal positions, interspersed either individually among residues in the 
reference sequence or in one or more contiguous groups within the reference sequence. 

Further embodiments of the present invention include polypeptides having at 
least 80% identity, more preferably at least 85% identity, more preferably at least 90% 
identity, and most preferably at least 95%, 96%, 97%, 98% or 99% identity to an 
amino acid sequence contained in SEQ ID NO: Y or the expressed protein produced by 
the deposited clone. Preferably, the above polypeptides should exhibit at least one 
biological activity of the protein. 

In a preferred embodiment, polypeptides of the present invention include 
polypeptides having at least 90% similarity, more preferably at least 95% similarity, and 
still more preferably at least 96%, 97%, 98%, or 99% similarity to an amino acid 
sequence contained in SEQ ID NO: Y or the expressed protein produced by the 
deposited clone. 

The variants may contain alterations in the coding regions, non-coding regions, 
or both. Especially preferred are polynucleotide variants containing alterations which 
produce silent substitutions, additions, or deletions, but do not alter the properties or 
activities of the encoded polypeptide. Nucleotide variants produced by silent 
substitutions due to the degeneracy of the genetic code arc preferred. Moreover, 
variants in which 5-10, 1-5, or 1-2 amino acids are substiUited, deleted, or added in any 
combination arc also preferred. Polynucleotide variants can be produced for a variety 
of reasons, e.g., to optimize codon expression for a particular host (change codons in 
the human mRNA to those preferred by a bacterial host such as E. coh). 

Naturally occurring variants are called "allelic variants," and refer to one of 
several alternate forms of a gene occupying a given locus on a chromosome of an 
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organism. (Genes II, Lewin. B.. ed., John WUey & Sons, New York (1985).) These 
aUelic variants can vary at either the polynucleotide and/or polypeptide level. 
Alternatively, non-natiually occurring variants may be produced by mutagenesis 
techniques or by direct synthesis. 

Usmg known methods of protein engineering and recombinant DNA 
technology, variants may be generated to improve or alter the characteristics of the 
polypeptides of the present invention. For instance, one or mon amino acids can be 
deleted from the N-terminus or C-terminus of the secreted protein without substantial 
loss of biological function. The authors of Ron ei al.. J. Biol. Chem. 268: 2984-2988 
(1993), reported variant KGF proteins having heparin binding activity even after 
deleting 3, 8, or 27 amino-terminal amino acid residues. Similarly. Interferon gamma 
exhibited up to ten times higher activity after deleting 8-10 amino acid residues fiom die 
carboxy terminus of this protein. (Dobeli et al., J. Biotechnology 7: 199-216 (1988).) 

Moreover, ample evidence demonstrates that variants often retain a biological 
activity similar to that of the naturally occumng protein. For example. Gayle and 
coworkers (J. Biol. Chem 268:22105-2211 1 (1993)) conducted extensive mutational 
analysis of human cytokine IL-la. They used random mutagenesis to generate over 
3,500 individual IL-la mutants that averaged 2.5 amino acid changes per variant over 
the entire lengtii of the molecule. Multiple mutations were examined at every possible 
amino acid position. The investigators found dial "[mjost of the molecule could be 
altered with litde effect on either [binding or biological activity]." (See. Abstract) In 
fact, only 23 unique amino acid sequences, out of more tiian 3.500 nucleotide 
sequences examined, produced a protein that significanUy differed in activity fiom wild- 
type. 

Furtiiermore, even if deleting one or more amino acids from the N-terminus or 
C-terminus of a polypeptide results in modification or loss of one or more biological 
fiinctions. other biological activities may still be retained. For example, the ability of a 
deletion variant to induce and/or to bind antibodies which recognize the secreted form 
wiU likely be retained when less than the majority of the residues of the secreted form 
are removed from the N-terminus or C-tetminus. Whether a particular polypeptide 
lacking N- or C-terminal residues of a protein retains such immunogenic activities can 
readily be determined by routine methods described herein and otherwise known in the 



art. 



Thus, the invention fiirther includes polypeptide variants which show 
substantial biological activity. Such variants include deletions, insertions, inversions, 
repeats, and substimtions selected according to general rales known in die art so as 
have littie effect on activity. For example, guidance concerning how to make 
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phenotypically silent amino acid substitutions is provided in Bowie. J. U et al 
Science 247:1306-1310 (1990). wherein the authors indicate that there ai* two main 
strategies for studying the tolerance of an amino acid sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
selection during the process of evolution. By comparing amino acid sequences in 
different species, conserved amino acids can be identified. These conserved amino 
acids are likely important for protein function. In contrast, the amino acid positions 
where substitutions have been tolerated by natural selection indicates that these 
positions are not critical for protein fiinction. Thus, positions tolerating amino acid 
substitution could be modified while stiU maintaining biological activity of the protein. 

The second strategy uses genetic engineering to introduce amino acid changes at 
specific positions of a cloned gene to identify regions critical for protein fimction. For 
example, site directed mutagenesis or alanine-scanning mutagenesis (introduction of 
smgle alanine mutations al every residue in the molecule) can be used. (Cmmingham 
and WeUs, Science 244: 1081-1085 (1989).) The resulting mutant molecules can then 
be tested for biological activity. 

As the authors state, these two strategies have revealed that proteins are 
surprisingly tolerant of amino acid substitutions. The authors fiirther indicate which 
amino acid changes are hkely to be permissive at certain amino acid positions in the 
protein. For example, most buried (within the tertiary structure of Uie protein) amino 
acid residues require nonpolar side chains, whereas few features of surface side chains 
are generally conserved. Moreover, tolerated conservative amino acid substitutions 
involve replacement of die aliphatic or hydrophobic amino acids Ala, Val. Leu and Be; 
replacemem of the hydroxyl residues Scr and Thr. replacement of the acidic residues 
Asp and Glu; replacement of the amide residues Asn and Gta, replacement of the basic 
residues Lys, Arg. and His; replacement of the aromatic residues Phe, Tyr. and Trp. 
and replacement of the small-sized amino acids Ala, Ser, Thr, Met, and Gly. 

Besides conservative amino acid substitution, variants of the present invention 
include (i) substimtions with one or more of the non-conserved amino acid residues, 
where the substituted amino acid residues may or may not be one encoded by the 
genetic code, or (ii) substitution wiUi one or more of amino acid residues having a 
substituent group, or (iii) fiision of the mature polypeptide witir another compound, 
such as a compound to increase the stability and/or solubility of the polypeptide (for 
example, polyethylene glycol), or (iv) fiision of the polypeptide with additional amino 
acids, such as an IgG Fc fiision region peptide, or leader or secretory sequence, or a 
sequence facilitating purification. Such variant polypeptides are deemed to be within 
the scope of those skilled in the art fiom the teachings herein. 
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For example, polypepUde variants containing amino acid substitutions of 
charged amino acids widi other charged or neutral amino acids may produce proteins 
with improved characteristics, such as less aggregation. Aggregation of pharmaceutical 
formulations both reduces activity and increases clearance due to the aggregate's 
immunogenic activity. (Pinckard et al., Clin. Exp. Immunol. 2:331-340 (1967); 
Robbins et al.. Diabetes 36: 838-845 (1987); Qeland et al., Crit. Rev. Therapeutic 
Drug Carrier Systems 10:307-377 (1993).) 

Polynucleotide and PA ^vnentiHi> F p^gma^^e 

In the present invention, a "polynucleotide fiagment" refers to a short 
polynucleotide having a nucleic acid sequence contained in the deposited clone or 
shown in SEQ ID NO:X. The short nucleotide fragments are preferably at least about 
15 nt, and more preferably at least about 20 nt. still more preferably at least about 30 nt, 
and even more preferably, at least about 40 nt in length. A fragment "at least 20 nt in 
length," for example, is intended to include 20 or more contiguous bases from the 
cDNA sequence contained in the deposited clone or die nucleotide sequence shown in 
SEQ ID NO:X. These nucleotide fragments are useful as diagnostic probes and primers 
as discussed herein. Of course, larger fragments (e.g.. 50. 150. 500. 600, 2000 
nucleotides) are preferred. 

Moreover, representative examples of polynucleotide fragments of the 
invention, include, for example, fragments having a sequence from about nucleotide 
number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401- 
450, 451-500, 501-550, 551-600, 651-700, and 701 to the end of SEQ ID NO:X or the 
cDNA contamed in the deposited clone. In this context "about" includes the particulariy 
recited ranges, larger or smaUer by several (5, 4, 3, 2. or 1) nucleotides, at either 
terminus or at both termini. Preferably, tiiese fragments encode a polypeptide which 
has biological activity. 

In the present invention, a "polypeptide fragment" refers to a short amino acid 
sequence contained in SEQ ID NO:Y or encoded by the cDNA contained in the 
deposited clone. Protein fragments may be "free-standing." or comprised witiiin a 
larger polypeptide of which the fragment fonns a part or region, most preferably as a 
single continuous region. Representative examples of polypeptide fragments of the 
invention, include, for example, fragments from about amino acid number 1-20, 21-40 
41-60, 61-80, 81-100, 102-120. 121-140, 141-160. and 161 to die end of the c«iing ' 
region. Moreover, polypeptide fragments can be about 20. 30. 40. 50, 60, 70, 80, 90, 
100, 1 10, 120. 130. 140. or 150 amino acids in length. In this context "about" 
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includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
amino acids, at either extreme or at both exUiemes. 

Preferred polypeptide fragments include the secreted protein as well as the 
mature form. Further preferred polypeptide fragments include the secreted protein or 
the mamre form having a continuous series of deleted residues fix)m the amino or the 
carboxy terminus, or both. For example, any number of amino acids, ranging from 1- 
60, can be deleted from the amino terminus of either the secreted polypeptide or the 
mature form. Similarly, any number of amino acids, ranging from 1-30, can be deleted 
from the carboxy terminus of the secreted protein or mature form. Furthermore, any 
combination of the above amino and carboxy terminus deletions are preferred 
Similarly, polynucleotide fragments encoding these polypeptide fragments are also 
preferred. 

Also preferred are polypeptide and polynucleotide fragments characterized by 
structural or functional domains, such as fragments tiiat comprise alpha-helix and alpha- 
helix forming regions, beta-sheet and beta-sheet-forming regions, turn and turn- 
forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic 
regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, surface- 
forming regions, substrate binding region, and high antigenic index regions. 
Polypeptide fragments of SEQ ID NO: Y falling witiiin conserved domains are 
specifically contemplated by the present invention. Moreover, polynucleotide 
fragments encoding these domains are also contemplated. 

Otiier preferred fragments are biologically active fragments. Biologically active 
fragments arc those exhibiting activity similar, but not necessarily identical, to an 
activity of the polypeptide of the present invention. The biological activity of the 
fragments may include an inq)roved desired activity, or a decreased undesirable activity. 

Epitopes & Antibodies 

In tiie present invention, "epitopes" refer to polypeptide fragments having 
antigenic or immunogenic activity in an animal, especially in a human. A preferred 
embodiment of the present invention relates to a polypeptide fragment comprising an 
epitope, as well as the polynucleotide encoding diis fragment. A region of a protein 
molecule to which an antibody can bind is defined as an "antigenic epitope." In 
contrast, an "inmiunogenic epitope" is defined as a part of a protein that elicits an 
antibody responise. (See, for instance, Geysen et al., Proc. NaU. Acad. Sci. USA 
81:3998- 4002(1983).) 
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Fragments which function as epitopes may be produced by any conventional 
means. (See. e.g., Houghten, R. A., Proc. Nad, Acad. Sci. USA 82:5131-5135 
(1985) further described in U.S. Patent No. 4,631,21 1.) 

In the present invention, antigenic epitopes preferably contain a sequence of at 
least seven, more preferably at least nine, and most preferably between about 15 to 
about 30 amino acids. Antigenic epitopes are useful to raise antibodies, including 
monoclonal antibodies, that specifically bind the epitope. (See, for instance, Wilson et 
al.. Cell 37:767-778 (1984); Sutcliffe, J. G. et al., Science 219:660^66 (1983).) 

Similarly, inmiunogenic epitopes can be used to induce antibodies axording to 
methods well known in the art. (See, for instance, Sutcliffe et al., supra; WUson et al., 
supra; Chow, M. et al., Proc. Natl. Acad. Sci. USA 82:910-914; and BitUe, F. J. et 
al., J. Gen. Virol. 66:2347-2354 (1985).) A preferred immunogenic epitope includes 
the secreted protein. The immunogenic epitopes may be presented together with a 
carrier protein, such as an albumin, to an animal system (such as rabbit or mouse) or, if 
it is long enough (at least about 25 amino acids), without a carrier. However, 
immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to be 
sufficient to raise antibodies capable of binding to, at the very least, linear epitopes in a 
denatured polypeptide (e.g., in Western blotting.) 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 
meant to include intact molecules as well as antibody fragments (such as, for example. 
Fab and F(ab*)2 fragments) which are capable of specifically binding to protein. Fab 
and F(ab*)2 fragments lack the Fc firagment of intact antibody, clear more r^idly fh)m 
the circulation, and may have less non-specific tissue binding than an intact antibody. 
(Wahl et al., J. Nucl. Med. 24:316-325 (1983).) Thus, these fragments are preferr^ 
as well as the products of a FAB or other immunoglobulm expression library. 
Moreover, antibodies of the present invention include chimeric, single chain, and 
humanized antibodies. 

Fusion Prnteinxe 

Any polypeptide of the present invention can be used to generate fusion 
proteins. For example, the polypeptide of the present invention, when fused to a 
second protein, can be used as an antigenic tag. Antibodies raised against the 
polypeptide of die present invention can be used to indirectiy detect the second protein 
by bmding to the polypeptide. Moreover, because secreted proteins target ceUular 
locations based on trafficking signals, the polypeptides of the present invention can be 
used as targeting molecules once fused to otfier proteins. 
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Examples of domains that can be fused to polypeptides of the presem invention 
include not only heterologous signal sequences, but also other heterologous functional 
regions. The fusion does not necessarily need to be direct, but may occur through 
linker sequences. 

Moreover, fusion proteins may also be engineered to improve characteristics of 
the polypeptide of the present invention. For instance, a region of additional amino 
acids, particularly charged amino acids, may be added to the N-terminus of the 
polypeptide to improve stabihty and persistence during purification from the host cell or 
subsequent handUng and storage. Also, peptide moieties may be added to tiie 
polypeptide to facilitate purification. Such regions may be removed prior to final 
preparation of the polypeptide. The addition of peptide moieties to facilitate handUng of 
polypeptides are familiar and routine techniques in the art. 

Moreover, polypeptides of the present invention, including fragments, and 
specifically epitopes, can be combined witii parts of the constant domain of 
immunoglobulins GgG). resulting in chimeric polypeptides. These fusion proteins 
facilitate purification and show an increased half-life in vivo. One reported example 
describes chimeric proteins consisting of Uie first two domains of the human CD4- 
polypeptide and various domains of the constant regions of tiie heavy or light chains of 
mammalian immunoglobulins. (EP A 394.827; Traunecker et al.. Nature 331:84-86 
( 1988).) Fusion proteins having disulfide-linked dimeric structures (due to the IgG) 
can also be more efficient in binding and neutralizing other molecules, than the 
monomeric secreted protein or protein fragment alone. (Fountoulakis et al.. J. 
Biochem. 270:3958-3964 (1995).) 

SimUarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion 
proteins comprising various portions of constant region of immunoglobulin molecules 
together with anoUier human protein or part thereof, h many cases, the Fc part in a 
fusion protein is beneficial in therapy and diagnosis, and tiius can result in, for 
example, improved pharmacokinetic properties. (EP-A 0232 262.) Alternatively, 
deleting the Fc part after the fusion protein has been expressed, detected, and purified, 
would be desired. For example, the Fc portion may hinder tiierapy and diagnosis if the 
fusion protein is used as an antigen for immunizations. In drug discovery, for 
example, human proteins, such as hIL-5. have been fused witii Fc portions for the 
purpose of high-Uiroughput screening assays to identify antagonists of hIL-5. (See. D. 
Bennett et al.. J. Molecular Recognition 8:52-58 (1995); K. Johanson et al.. J. BioL 
Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of die present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of die fused polypeptide, hi 
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preferred embodiments, the marker amino acid sequence is a hexa-histidine peptide, 
such as the lag provided in a pQE vector (QIAGEN, Inc.. 9259 Eton Avenue, 
Chatsworth, CA, 9131 1), among others, many of which are commercially avaUable. 
As described in Gentz et al.. Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for 
instance, hexa-histidine provides for convenient purification of the fiision protein. 
Another peptide tag useful for purification, the "HA" tag, corresponds to an epitope 
derived ftom the influenza hemagglutinin proteb. (Wilson et al.. Cell 37:767(1984).) 

Thus, any of these above fiisions can be engineered using the polynucleotides 
or the polypeptides of the claimed invention. 



Vectors. Wnst r^^fs. and Prnf« .in Prodnctinn 

The present invention also relates to vectors containing the polynucleotide of the 
present invention, host cells, and the production of polypepudes by recombinant 
techniques. The vector may be, for example, a phage, plasmid, viral, or retroviral 
15 vector. Retroviral vectors may be replication competent or replication defective. In the 
latter case, viral propagation generally will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker for 
propagation in a host Generally, a plasmid vector is introduced in a precipitate, such 
as a calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is 
a virus, it may be packaged in vitro using an appropriate packaging cell Une and then 
transduced into host cells. 

The polynucleotide insert should be operatively linked to an appropriate 
promoter, such as the phage lambda PL promoter, the E. coU lac. trp. phoA and tac 
promoters, the SV40 early and late promoters and promoters of retroviral LTRs, to 
name a few. Other suitable promoters will be known to the skilled artisan. The 
expression constructs wiU fimher contain sites for transcription iniUation. termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding 
portion of the transcripts expressed by the constructs will preferably include a 
translation initiating codon at the beginning and a termination codon (UAA. UGA or 
UAG) appropriately positioned at the end of die polypeptide to be translated. 

As indicated, die expression vectors will preferably include at least one 
selectable maricer. Such maricers include dihydrofolate reductase, G418 or neomycin 
resistance for eukaryotic ceU culture and tetracycline, kanamycin or ampicillin resistance 
genes for culturing in E. coli and otiier bacteria. Representative examples of 
appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 
Streptomyces and SalmoneUa typhimurium cells; fungal cells, such as yeast cells; insect 
cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO. COS, 



wo 98/39448 



PCT/US98/D4493 



179 



10 



15 



20 



25 



30 



35 



293, and Bowes melanoma ceUs; and plant ceUs. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art 

Among vectors preferred for use in bacteria include pQE70. pQE60 and pQE-9 
avaUable from QIAGEN, Inc.; pBluescript vectors. Phagescript vector;, pNH8A 
pNH16a, PNH18A. pNH46A, available from Stratagene Cloning Systems. Inc.; 'and 
ptrc99a. pKK223-3. pKK233-3. pDR540. pRIT5 available from Pharmacia Biotech. 
Inc. Among preferred eukaryotic vectors are pWLNEO. pSV2CAT, pCXJ44. pXTl 
and pSG available from Stratagene; and pSVK3. pBPV. pMSG and pSVL avdlable 
from Pharmacia. Other suitable vectors win be readily apparent to the skiUed artisan. 

Introduction of the construct into the host ceU can be effected by calcium 
phosphate transfection. DEAE-dextran mediated transfection, cationic Upid-mediated 
transfecUon. electroporation. transduction, infection, or other methods. Such methods 
are descnbed in many standard laboratory manuals, such as Davis et al.. Basic Methods 
In Molecular Biology (1986). It is specificaUy contemplated that the polypeptides of the 
present mvention may in fact be expressed by a host ceU lacking a recombinant vector. 

A polypeptide of this invention can be recovered and purified from recombinant 
cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 

chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
preferably, high performance liquid chromatography ("HPLC") is employed for 
purification. 

Polypeptides of the present invention, and preferably the secreted form, can also 
be recovered from: products purified from natural sources, including bodily fluids 
tissues and ceUs. wheUier directly isolated or cultured; products of chemical synthetic 
procedures; and products produced by recombinant techniques from a prokaryotic or 
eukaryotic host, including, for example, bacterial, yeast, higher planu insect, and 
mammalian cells. Depending upon Uie host employed in a recombinant production 
procedure, the polypeptides of the present invention may be glycosylated or may be 
non-glycosylated. In addition, polypeptides of the invention may also include an initial 
modified meUiionine residue, in some cases as a result of host-mediated processes. 
Thus, it is well known in the art that the N-terminal meUiionine encoded by the 
translation initiation codon generally is removed with high efficiency from any protein 
after tianslation in all eukaryotic cells. WhUe the N-termmal methionine on most 
proteins also is efficienUy removed in most prokaryotes, for some proteins, this 
prokaryotic removal process is inefficient, depending on the nature of the amino acid to 
which the N-terminal methionine is covalenUy linked. 
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Uses of the Pt ^ivnuc^ Pnrtri«.4; 

Each of the polynucleotides identified herein can be used in numerous ways as 
reagents. The foUowing description should be considered exemplaiy and utilizes 
5 known techniques. 

The polynucleotides of the present invention are useful for chromosome 
identification. There exists an ongoing need to identify new chromosome maricers. 
since few chromosome marking reagents, based on actual sequence data (repeat 
polymoiphisms). are presenUy available. Each polynucleotide of the present invention 
10 can be used as a chromosome marker. 

Briefly, sequences can be mapped to chromosomes by preparing PGR primeis 
(preferably 15-25 bp) from the sequences shown in SEQ ID NO:X. Primers can be 
selected using computer analysis so Uiat primers do not span more than one predicted 
exon in the genomic DNA. These primers are tiien used for PGR screening of somatic 
ceU hybrids containing individ^ human chromosomes. Only those hybrids containing 
the human gene corresponding to the SEQ ID NO:X will yield an ampUfied fragment. 

Similarly, somatic hybrids provide a rapid method of PGR mapping the 
polynucleotides to particular chromosomes. Three or more clones can be assigned per 
day using a single thermal cycler. Moreover, sublocalization of the polynucleotides can 
be achieved with panels of specific chromosome fragments. Other gene mapping 
stirategies that can be used include in situ hybridization, piescreening with labeled flow- 
sorted chromosomes, and preselection by hybridization to construct chromosome 
specific-cDNA libraries. 

Precise chromosomal location of the polynucleotides can also be achieved using 
fluorescence in situ hybridization (HSH) of a metaphase chromosomal spread. This 
technique uses polynucleotides as short as 500 or 600 bases; however, polynucleotides 
2,000-4,000 bp arc preferred. For a review of this technique, see Verma et al., 
"Human Chromosomes: a Manual of Basic Techniques," Pergamon Press New Yoric 
(1988). 

For chromosome mapping, the polynucleotides can be used individuaUy (to 
mark a single chromosome or a single site on Uiat chromosome) or in panels (for 
marking multiple sites and/or multiple chromosomes). Preferred polynucleotides 
correspond to the noncoding regions of the cDNAs because the coding sequences are 
more likely conserved witiiin gene families, thus increasing the chance of cross 
hybridization during chromosomal mapping. 

Once a polynucleotide has been mapped to a precise chromosomal location, the 
physical position of the polynucleotide can be used in linkage analysis. Unkage 
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analysis establishes coinheritance between a chromosomal location and presentation of a 
particular disease. (Disease mapping data are found, for example, in V. McKusick, 
MendeUan Inheritance in Man (available on line through Johns Hopkins University 
Welch Medical Library) .) Assuming 1 megabase mapping resolution and one gene per 
5 20 kb. a cDNA precisely localized to a chromosomal region associated with the disease 
could be one of 50-500 potential causative genes. 

Thus, once coinheritance is established, differences in the polynucleotide and 
the corresponding gene between affected and unaffected individuals can be examined. 
First, visible structural alterations in the chromosomes, such as deletions or 
10 translocations, are examined in chromosome spreads or by PGR. If no stnictural 

alterations exist, the presence of point mutations are ascertained. Mutations observed in 
some or all affected individuals, but not in normal individuals, indicates that the 
mutation may cause the disease. However, complete sequencing of the polypeptide and 
the corresponding gene from several normal individuals is required to distinguish the 
mutation from a polymorphism. If a new polymorphism is identified, this polymorphic 
polypeptide can be used for further linkage analysis. 

Furthermore, increased or decreased expression of the gene in affected 
individuals as compared to unaffected individuals can be assessed using 
polynucleotides of the present invention. Any of these alterations (altered expression. 
20 chromosomal rearrangement, or mutation) can be used as a diagnostic or prognostic 
marker. 

In addition to the foregoing, a polynucleotide can be used to control gene 
expression through triple helix formation or antisense DNA or RNA. Both mediods 
rely on binding of the polynucleotide to DNA or RNA. For these techniques, preferred 

25 polynucleotides are usuaUy 20 to 40 bases m length and complementary to either the 
region of the gene involved in transcription (triple helix - see Lee et al., Nucl. Acids 
Res. 6:3073 (1979); Cooney et al.. Science 241:456 (1988); and Dervan et al.. Science 
25 1 : 1360 (1991) ) or to the mRNA itself (antisense - Okano. J. Neurochem. 56:560 
(1991); Oligodeoxy-nucleotides as Antisense Inhibitors of Gene Expression. CRC 

30 Press, Boca Raton. FL (1988).) Triple heUx formation optimally results in a shut-^ff 
of RNA transcription from DNA, while antisense RNA hybridization blocks translation 
of an mRNA molecule into polypeptide. Both techniques are effective in model 
systems, and the information disclosed herem can be used to design antisense or triple 
helix polynucleotides in an effort to treat disease. 

35 PoIynucleoUdes of the present invenuon are also useful in gene therapy. One 

goal of gene therapy is to insert a normal gene into an organism having a defective 
gene, in an effort to correct the genetic defect. The polynucleotides disclosed in die 
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present invention offer a means of targeting such genetic defects in a highly accurate 
manner. Another goal is to insert a new gene that was not present in the host genome, 
thereby producing a new trait in the host cell. 

The polynucleotides are also useful for identifying individuals ftom minute 
biological samples. The United States military, for example, is considering the use of 
restriction fragment length polymorphism (RFLP) for identification of its personnel. In 
this technique, an individual's genomic DNA is digested with one or more restriction 
enzymes, and probed on a Southern blot to yield unique bands for identifying 
personnel. This method does not suffer fiom the current limitations of "Dog Tags" 
which can be lost, switched, or stolen, making positive identification difficult. The 
polynucleotides of the present invention can be used as additional DNA markers for 
RFLP. 

The polynucleotides of the present invention can also be used as an alternative to 
RFLP. by determining the acmal base-by-base DNA sequence of selected portions of an 
individual's genome. These sequences can be used to prepare PGR primers for 
amphfying and isolating such selected DNA, which can then be sequenced. Using this 
technique, individuals can be identified because each individual wiU have a unique set 
of DNA sequences. Once an unique ID database is established for an individual, 
positive identification of that individual, living or dead, can be made fiom extremely 
small tissue samples. 

Forensic biology also benefits bom using DNA-based identification techniques 
as disclosed herein. DNA sequences taken from very smaU biological samples such as 
tissues, e.g.. hair or skin, or body fluids, e.g., blood, saliva, semen, etc., can be 
amplified using PGR. In one prior art technique, gene sequences amplified from 
polymorphic loci, such as DQa class H HLA gene, are used in forensic biology to 
identify individuals. (Erlich, H., PGR Technology, Freeman and Co. (1992).) Once 
these specific polymorphic loci are ampHfied, they are digested with one or more 
restriction enzymes, yielding an identifying set of bands on a Southern blot probed with 
DNA corresponding to the DQa class H HLA gene. Similarly, polynucleotides of the 
present invention can be used as polymorphic maricers for forensic purposes. 

There is also a need for reagents capable of identifying the source of a particular 
tissue. Such need arises, for example, in forensics when presented with tissue of 
unknown origin. Appropriate reagents can comprise, for example. DNA probes or 
primers specific to particular tissue prepared from the sequences of die present 
invention. Panels of such reagents can identify tissue by species and/or by organ type. 
In a similar fashion, these reagents can be used to screen tissue cultures for 
contamination. 
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In the very least, the polynucleotides of the present invention can be used as 
molecular weight markers on Southern gels, as diagnostic probes for the presence of a 
specific mRNA in a particular cell type, as a probe to "subtract-out" known sequences 
in the process of discovering novel polynucleotides, for selecting and making oligomers 
for attachment to a "gene chip" or other support, to raise anti-DNA antibodies using 
DNA immunization techniques, and as an antigen to eUcit an immune response. 

Uses of the Polvpgp tid«Hf 

Each of the polypeptides identified herein can be used in numerous ways. The 
following description should be considered exemplary and utilizes known techniques. 

A polypeptide of the present invention can be used to assay protein levels in a 
biological sample using antibody-based techniques. For example, protein expression in 
tissues can be studied with classical immunohistological methods. (Jalkanen, M., et 
al., J. Cell. Biol. 101:976-985 (1985); Jalkanen, M., et al.. J. Cell . Biol. 105:3087- 
3096 (1987).) Other antibody-based methods useful for detecting protein gene 
expression include immunoassays, such as the enzyme linked immunosorbent assay 
(EUSA) and the radioimmunoassay (RIA). Suitable antibody assay labels arc known 
in the art and include enzyme labels, such as, glucose oxidase, and radioisotopes, such 
as iodine (1251. 1211), carbon (14C), sulfur (35S). tritium (3H). indium (1 12In), and 
technetium (99mTc), and fluorescent labels, such as fluorescein and rhodamine, and 
biotin. 

In addition to assaying secreted protein levels in a biological sample, proteins 
can also be detected m vivo by imagmg. Antibody labels or markers for in vivo 
imaging of protein include those detectable by X-radiography, NMR or ESR. For X- 
radiogr^hy, suitable labels include radioisotopes such as barium or cesium, which emit 
detectable radiation but arc not overtly harmful to the subject Suitable markers for 
NMR and ESR include those with a detectable characteristic spin, such as deuterium, 
which may be incorporated into the antibody by labeling of nutrients for the relevant 
hybridoma. 

A protein-specific antibody or antibody fragment which has been labeled with 
an appropriate detectable imaging moiety, such as a radioisotope (for example, 1311, 
1 12In, 99mTc), a radio-opaque substance, or a material detectable by nuclear magnetic 
resonance, is introduced (for example, parenterally, subcutaneously, or 
intraperitoneally) into the mammal. It will be understood in the art that tiie size of the 
subject and the iniaging system used will determine the quantity of imaging moiety 
needed to produce diagnostic images. In the case of a radioisotope moiety, for a human 
subject, the quantity of radioactivity injected will normally range from about 5 to 20 
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inillicuries of 99mTc. The labeled antibody or antibody fragment will then 
preferentiaUy accumulate at the location of cells which contain the specific protein. In 
vivo tumor imaging is described in S.W. Burchiel et al., "Immunophannacokinetics of 
Radiolabeled Antibodies and Their Fragments." (Chapter 13 in Tumor Imaging: The 
Radiochemical Detection of Cancer, S.W. Burchiel and B. A. Rhodes, eds., Masson 
Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which involves 
(a) assaying the expression of a polypeptide of the present invention in cells or body 
fluid of an individual; (b) comparing the level of gene expression with a standard gene 
expression level, whereby an increase or decrease in the assayed polypeptide gene 
expression level compared to the standard e;q)ression level is indicative of a disorder. 

Moreover, polypeptides of the present invention can be used to treat disease. 
For example, patients can be administered a polypeptide of the present invention in an 
effort to replace absent or decreased levels of the polypeptide (e.g., insulin), to 
supplement absent or decreased levels of a different polypeptide (e.g., hemoglobm S 
for hemoglobin B), to inhibit the activity of a polypeptide (e.g., an oncogene), to 
activate the activity of a polypeptide (e.g., by binding to a receptor), to reduce the 
activity of a membrane bound receptor by competing with it for free ligand (e,g., 
soluble TNF receptors used in reducing inflammation), or to bring about a desired 
response (e.g., blood vessel growth). 

Similarly, antibodies directed to a iK)lypeptide of the present invention can also 
be used to treat disease. For example, administration of an antibody directed to a 
polypeptide of the present invention can bind and reduce overproduction of the 
polypeptide. Similarly, administration of an antibody can activate the polypeptide, such 
as by binding to a polypeptide bound to a membrane (receptor). 

At the very least, the polypeptides of the present invention could be used as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtr^on 
columns using methods well known to those of skill in the art. Polypeptides can also 
be used to raise antibodies, which in turn arc used to measure protem expression ftom a 
recombinant cell, as a way of assessing transformation of the host cell. Moreover, the 
polypeptides of the present invention can be used to test the following biological 
activities. 

Biological Activities 

The polynucleotides and polypeptides of die present invention can be used in 
assays to test for one or more biological activities. If tiiese polynucleotides and 
polypeptides do exhibit activity in a particular assay, it is likely that these molecules 
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may be involved in the diseases associated with the biological activity. Thus, the 
polynucleotides and polypeptides could be used to treat die associated disease. 

Immune Activity 

A polypeptide or polynucleotide of the present invention may be useful in 
treating deficiencies or disorders of the immune system, by activating or inhibiting the 
proliferation, differentiation, or mobilization (chemotaxis) of immune cells. Immune 
cells develop tiuough a process caUed hematopoiesis, producing myeloid (platelets, red 
blood cells, neutrophils, and macrophages) and lymphoid (B and T lymphocytes) ceUs 
from pluripotent stem cells. The etiology of these inmiune deficiencies or disorders 
may be genetic, somatic, such as cancer or some autoimmune disordere. acquired (e.g., 
by chemotherapy or toxins), or infectious. Moreover, a polynucleotide or polypeptide 
of the present invention can be used as a marker or detector of a particular immune 
system disease or disorder. 

A polynucleotide or polypeptide of the present invention may be useftil in 
treating or detecting deficiencies or disorders of hematopoietic ceils. A polypeptide or 
polynucleotide of the present invention could be used to increase differentiation and 
proliferation of hematopoietic ceUs, including the pluripotent stem cells, in an effort to 
treat those disorders associated witii a decrease in certain (or many) types hematopoietic 
cells. Examples of immunologic deficiency syndromes include, but are not hmited to: 
blood protein disorders (e.g. agammaglobulinemia, dysgammaglobulinemia), ataxia 
telangiectasia, conmion variable immunodeficiency, Digeorge Syndrome, HIV 
infection, HTLV-BLV infection, leukocyte adhesion deficiency syndrome, 
lymphopenia, phagocyte bactericidal dysfunction, severe combmed immunodeficiency 
(SCIDs), Wiskott-Aldrich Disorder, anemia, tiirombocytopenia, or hemoglobinuria. 

Moreover, a polypeptide or polynucleotide of the present invention could also 
be used to modulate hemostatic (the stopping of bleedmg) or thrombolytic activity (clot 
formation). For example, by increasing hemostatic or thrombolytic activity, a 
polynucleotide or polypeptide of the present mvention could be used to treat blood 
coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 
disorders (e.g. tiirombocytopenia), or wounds resulting from trauma, surgery, or otiier 
causes. Alternatively, a polynucleotide or polypeptide of tfie present invention tiiat can 
decrease hemostatic or tiirombolytic activity could be used to inhibit or dissolve 
clotting. These molecules could be important in the treaunent of heart attacks 
(infarction), strokes, or scarring. 

A polynucleotide or polypeptide of die present invention may also be useful in 
treating or detecting autoimmune disorders. Many auioinunune disorders result fix)m 
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in^propriate recognition of self as foreign material by imm This 
inappropriate recognition results in an immune response leading to the destmction of the 
host tissue. Therefore, the administration of a polypeptide or polynucleotide of the 
present invention that inhibits an immune response, particularly the proliferation. 
5 differentiation, or chemotaxis of T-cells, may be an effective tiier^y in preventing 
autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by the present 
invention include, but are not limited to: Addison's Disease, hemolytic anemia, 
antiphospholipid syndrome, rheumatoid arthritis, dermatitis, aUergic encephalomyelitis, 
10 glomerulonephritis, Goodpasture's Syndrome, Graves' Disease, Multiple Sclerosis. 
Myasthenia Gravis, Neuritis, Ophtiiahnia, Bullous Pemphigoid, Pemphigus, 
Polyendocrinopatiiies, Purpura, Reiter's Disease, Stiff-Man Syndrome, Autoimmune 
Thyroiditis, Systemic Lupus Erythematosus, Autoimmune Pulmonary Inflammation, 
Guillain-Barre Syndrome, insulin dependent diabetes mellitis, and autoimmune 
1 5 inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly allergic 
asUuna) or other respiratory problems, may also be treated by a polypeptide or 
polynucleotide of the present invention. Moreover, diese molecules can be used to treat 
an^hylaxis, hypersensitivity to an antigenic molecule, or blood group incompatibiUty. 

A polynucleotide or polypeptide of the present invention may also be used to 
treat and/or prevent organ rejection or graft-versus-host disease (GVHD). Organ 
rejection occurs by host immune cell destruction of tiie transplanted tissue through an 
immune response. Similarly, an immune response is also involved in GVHD, but, in 
this case, the foreign transplanted immune cells destroy tiie host tissues. The 
administration of a polypeptide or polynucleotide of the present invention that inhibits 
an immune response, particularly the proliferation, differentiation, or chemotaxis of T- 
cells, may be an effective therapy in preventing organ rejection or GVHD. 

Similariy, a polypeptide or polynucleotide of ttie present invention may also be 
used to modulate inflammation. For example, the polypeptide or polynucleotide may 
inhibit die proliferation and differentiation of cells involved in an inflammatory 
response. These molecules can be used to treat inflammatory conditions, both chronic 
and acute conditions, including mflanmiation associated witii infection (e.g., septic 
shock, sepsis, or systemic inflanraiatory response syndrome (SIRS)), ischemia- 
reperfiision injury, endotoxin lethality, arthritis, complement-mediated hyperacute 
rejection, nephritis, cytokine or chemokine induced lung injury, inflammatory bowel 
disease, Crohn's disease, or resulting from over production of cytokines (e.g., TNF or 
IL-1.) 
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Hvperproliferative Disorders 

A polypeptide or polynucleotide can be used to treat or detect hypeiproliferative 
disorders, including neoplasnis. A polypeptide or polynucleotide of the pr^nt 
invention may inhibit the proliferation of die disorder through direct or indirect 
interactions. Alternatively, a polypeptide or polynucleotide of die present invention 
may proliferate otiier cells which can inhibit the hyperproliferative disorder. 

For example, by increasing an immune response, particularly increasing 
antigenic qualities of the hyperproliferative disorder or by proliferating, differentiating, 
or mobilizing T-cells, hyperproliferative disorders can be treated. This immune 
response may be mcreased by either enhancing an existing immune response, or by 
initiating a new immune response. Alternatively, decreasing an inmiune response may 
also be a mediod of treating hyperproliferative disorders, such as a chemotherapeutic 
agent. 

Examples of hyperproliferative disorders that can be treated or detected by a 
polynucleotide or polypeptide of the present invention include, but are not limited to 
neoplasms located in the: abdomen, bone, breast, digestive system, liver, pancr^, 
peritoneum, endocrine glands (adrenal, parathyroid, pituitary, testicles, ovary, thymus, 
thyroid), eye, head and neck, nervous (central and peripheral), lymphatic system, 
pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly, othei: hyperprohferative disorders can also be treated or detected by a 
pol jmucleotide or polypeptide of the present invention. Examples of such 
hyperproliferative disorders include, but are not limited to: hyperganmiaglobulinemia, 
lymphoproliferative disorders, paraproteinemias, purpura, sarcoidosis, Sezary 
Syndrome, Waldenstron's Macroglobulinemia, Gaucher's Disease, histiocytosis, and 
any otiier hyperproliferative disease, besides neoplasia, located in an organ system 
listed above. 

Infectious Disease 

A polypeptide or polynucleotide of the present invention can be used to treat or 
detect infectious agents. For example, by increasing tfie immune response, particularly 
increasing die proliferation and differentiation of B and/or T cells, infectious diseases 
may be treated. The immune response may be increased by either enhancing an existing 
immune response, or by initiating a new immune response. Alternatively, the 
polypeptide or polynucleotide of die present invention may also direcdy mhibit die 
infectious agent, wiUioul necessarily eliciting an immune response. 
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Viiuses are one example of an infectious agent that can cause disease or 
symptoms that can be treated or detected by a polynucleotide or polypeptide of the 
present invention. Examples of viruses, include, but are not limited to the following 
DNA and RNA viral families: Arbovirus, Adenoviridae, Arenaviridae, Arterivirus, 
Bimaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Flaviviridae, 
Hepadnaviridae (Hepatitis), Herpesviridae (such as. Cytomegalovirus. Heipes 
Simplex, Herpes Zoster), Mononegavirus (e.g., Paramyxoviridae. Morbillivirus, 
Rhabdoviridae), Orthomyxoviridae (e.g.. Influenza), P^ovaviridae, Parvoviridae, 
Picomaviridae, Poxviridae (such as Smallpox or Vaccinia), Reoviridae (e.g.. 
Rotavirus), Retroviridae (HTLV-I, HTLV-D, Lentivirus), and Togaviridae (e.g., 
Rubivirus). Viruses falling within these families can cause a variety of diseases or 
symptoms, including, but not limited to: arthritis, bronchioUitis, encephalitis, eye 
infections (e.g., conjunctivitis, keratitis), chronic fatigue syndrome, hepatitis (A, B. C, 
E, Chronic Active, Delta), meningitis, opportunistic infections (e.g., AIDS), 
pneumonia, Burkitt's Lymphoma, chickenpox , hemorriiagic fever. Measles, Mumps, 
Parainfluenza, Rabies, the common cold. Polio, leukemia. Rubella, sexually 
transmitted diseases, skin diseases (e.g., Kaposi's, warts), and vircmia. A polypeptide 
or polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

Similarly, bacterial or fungal agents that can cause disease or symptoms and that 
can be treated or detected by a polynucleotide or polypeptide of the present invention 
include, but not limited to, the following Gram-Negative and Gram-positive bacterial 
famiUes and fungi: Actinomycetales (e.g., Corynebacteriimi, Mycobacterium, 
Norcardia), Aspergillosis, Bacillaceae (e.g.. Anthrax, Clostridium), Bacteroidaceae, 
Blastomycosis, Bordetella, Borrelia, Brucellosis, Candidiasis, Campylobacter, 
Coccidioidomycosis, Cryptococcosis, Dermatocycoses, Enterobacteriaceae (Klebsiella, 
Sahnonella, Serratia, Yersinia), Erysipelothrix, Helicobacter, Legionellosis, 
Leptospirosis, Listeria, Mycoplasmatales, Neisseriaceae (e.g., Acinetobacter, 
Gonorrhea, Menigococcal), Pasteurellacea Infections (e.g., Actinobacillus, 
Heamophilus, Pasteurella), Pseudomonas, Rickettsiaceae, Chlamydiaceae, Syphilis, 
and Staphylococcal. These bacterial or fungal families can cause the following diseases 
or symptoms, including, but not limited to: bacteremia, endocarditis, eye infections 
(conjunctivitis, tuberculosis, uveitis), gingivitis, opportunistic infections (e.g., AIDS 
related infections), paronychia, prosthesis-related mfections, Reiter's Disease, 
respiratory u^t infections, such as Whooping Cough or Empyema, sepsis, Lyme 
Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, food poisoning. 
Typhoid, pneumonia. Gonorrhea; meningitis. Chlamydia, Syphilis, Diphtheria, 
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Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, tetanus, 
impetigo. Rheumatic Fever, Scarlet Fever, sexually transmitted diseases, skin diseases 
(e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, wound infections. 
A polypeptide or polynucleotide of the present invention can be used to treat or detect 
any of these symptoms or diseases. 

Moreover, parasitic agents causing disease or symptoms that can be treated or 
detected by a polynucleotide or polypeptide of the present invention include, but not 
limited to, the following families: Amebiasis, Babesiosis, Coccidiosis, 
Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, Giardiasis, Helminthiasis, 
Leishmaniasis, Theileriasis, Toxoplasmosis, Trypanosomiasis, and Trichomonas. 
These parasites can cause a variety of diseases or syii^)toms, including, but not limited 
to: Scabies, TrombicuUasis, eye infections, intestinal disease (e.g., dysentery, 
giardiasis), liver disease, lung disease, opportunistic infections (e.g., AIDS related). 
Malaria, pregnancy complications, and toxoplasmosis. A polypeptide or polynucleotide 
of the present invention can be used to treat or detect any of these symptoms or 
diseases. 

Preferably, treatment using a polypeptide or polynucleotide of the present 
invention could either be by administering an effective amount of a polypeptide to the 
patient, or by removing cells from the patient, supplying the ceUs with a polynucleotide 
of the present invention, and returning the engineered cells to the patient (ex vivo 
therapy). Moreover, the polypeptide or polynucleotide of the present invention can be 
used as an antigen in a vaccine to raise an immune response against infectious disease. 

Regeneration 

A polynucleotide or polypeptide of the present invention can be used to 
differentiate, proliferate, and attract cells, leadmg to the regeneration of tissues. (See, 
Science 276:59-87 (1997).) The regeneration of tissues could be used to repair, 
replace, or protect tissue damaged by congenital defects, urauma (wounds, bums, 
incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, periodontal 
disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, reperfusion 
injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs 
(e.g., pancreas, liver, intestine, kidney, skin, endotheUum), muscle (smooth, skeletal 
or cardiac), vascular (mcluding vascular endothelium), nervous, hematopoietic, and 
skeletal (bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration occurs 
without or decreased scarring. Regeneration also may include angiogenesis. 
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Moreover, a polynucleotide or polypepUde of the present invention may increase 
regeneration of tissues difficult to heal. For example, increased tendon/ligament 
regeneration would quicken recovery time after damage. A polynucleotide or 
polypeptide of the present invention could also be used prophylactically in an effort to 
avoid damage. Specific diseases that could be treated include of tendinitis, carpal tunnel 
syndrome, and other tendon or ligament defects. A further example of tissue 
regeneration of non-healing wounds includes pressure ulcers, ulcers associated with 
vascular insufficiency, surgical, and traumatic wounds. 

Similarly, nerve and brain tissue could also be regenerated by using a 
polynucleotide or polypeptide of the present invention to proliferate and differentiate 
nerve cells. Diseases that could be treated using this method include central and 
peripheral nervous system diseases, neuropathies, or mechanical and traumatic 
disorders (e.g., spinal cord disorders, head u^uma, cerebrovascular disease, and 
stoke). Specifically, diseases associated with peripheral nerve injuries, peripheral 
neuropathy (e.g.. resulting fit>m chemotherapy or other medical therapies), localized 
neuropathies, and central nervous system diseases (e.g., Alzheimer's disease, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy- 
Drager syndrome), could all be treated using the polynucleotide or polypeptide of tiie 
present invention. 

Chemotaxis 

A polynucleotide or polypeptide of die present invention may have chemotaxis 
activity. A chemotaxic molecule attracts or mobilizes cells (e.g.. monocytes, 
fibroblasts, neutrophils, T-ceUs, mast cells, eosinophils, epithelial and/or endothelial 
cells) to a particular site in die body, such as inflammation, infection, or site of 
hyperproliferation. The mobilized cells can tiien fight off and/or heal the particular 
trauma or abnormaUty. 

A polynucleotide or polypeptide of die present invention may increase 
chemotaxic activity of particular cells. These chemotactic molecules can tiien be used to 
treat inflammation, infection, hyperproliferative disorders, or any immune system 
disorder by increasing die number of cells targeted to a particular location in die body. 
For example, chemotaxic molecules can be used to treat wounds and odier trauma to 
tissues by attracting immune cells to die mjured location. Chemotactic molecules of die 
present invention can also attract fibroblasts, which can be used to treat wounds. 

It is also contemplated that a polynucleotide or polypeptide of the present 
invention may inhibit chemotactic activity. These molecules could also be used to treat 
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disorders. Thus, a polynucleotide or polypeptide of the present invention could be used 
as an inhibitor of chemotaxis. 

Binding Activity 

A polypeptide of the present invention may be used to screen for molecules that 
bind to the polypeptide or for molecules to which the polypeptide binds. The binding 
of the polypeptide and the molecule may activate (agonist), increase, inhibit 
(antagonist), or decrease activity of the polypeptide or the molecule bound. Exanq)les 
of such molecules include antibodies, oligonucleotides, proteins (e.g., receptors),or 
small molecules. 

Preferably, the molecule is closely related to the natural ligand of the 
polypeptide, e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural 
or functional mimetic. (See, Coiigan et al.. Current Protocols in Immunology 
l(2):Chapter 5 (1991).) Similarly, the molecule can be closely related to the natural 
receptor to which the polypeptide binds, or at least, a fragment of the receptor capable 
of being bound by the polypeptide (e.g., active site). In either case, the molecule can 
be rationally designed using known techniques. 

Preferably, the screening for these molecules involves producing appropriate 
cells which express the polypeptide, either as a secreted protein or on the cell 
membrane. Preferred cells include cells from mammals, yeast, Drosophila, or E, colL 
Cells expressing the polypeptide (or cell njembrane containing the expressed 
polypeptide) are then preferably contacted with a test compound potentially containing 
the molecule to observe binding, stimulation, or inhibition of activity of either the 
polypeptide or the molecule. 

The assay may simply test binding of a candidate compound to the polypeptide, 
wherein binding is detected by a label, or in an assay involving competition with a 
labeled competitor. Further, the assay may test whether the candidate compound results 
in a signal generated by binding to the polypeptide. 

Alternatively, the assay can be carried out using cell-free preparations, 
polypeptide/molecule affixed to a solid support, chemical libraries, or natural product 
mixtures. The assay may also simply comprise the steps of mixing a candidate 
compound with a solution containing a polypeptide, measuring polypeptide/molecule 
activity or binding, and comparing die polypeptide/molecule activity or binding to a 
standard. 

Preferably, an ELIS A assay can measure polypeptide level or activity in a 
sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 
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antibody can measure polypeptide level or activity by either binding. direcUy or 
indirecUy. to the polypeptide or by competing with the polypeptide for a substrate. 

AU of these above assays can be used as diagnostic or prognostic markers. The 
molecules discovered using these assays can be used to treat disease or to bring about a 
5 particular result in a patient (e.g.. blood vessel growth) by activating or inhibiting the 
polypeptide/molecule. Moreover, the assays can discover agents which may inhibit or 
enhance the producUon of the polypeptide from suitably manipulated cells or tissues. 

Therefore, the invenUon includes a method of identifying compounds which 
bind to a polypeptide of the invention comprising the steps of: (a) incubating a 
10 candidate binding compound with a polypeptide of the invention; and (b) determining if 
bindmg has occurred. Moreover, the invention includes a method of identifying 
agonists/antagonists comprising the steps of: (a) incubating a candidate compound with 
a polypeptide of tiie invention, (b) assaying a biological activity . and (b) determining if 
a biological activity of the polypeptide has been altered. 

15 

Other Aetivitipg 

A polypeptide or polynucleotide of the present invention may also increase or 
decrease die differentiation or proUferation of embiyonic stem cells, besides, as 
discussed above, hematopoietic lineage. 

[) A polypeptide or polynucleotide of the present invention may also be used to 

modulate mammalian characteristics, such as body height, weight, hair color, eye color, 
skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., cosmetic 
surgery). SimUarly, a polypeptide or polynucleotide of tiie present invention may be 
used to modulate mammaUan metabolism affecting catabolism, anabolism, processing, 

5 utilization, and storage of energy. 

A polypeptide or polynucleotide of die present mvention may be used to change 
a mammal's mental state or physical state by influencing biorhythms, caricadic 
rhyUmis, depression (including depressive disorders), tendency for violence, tolerance 
for pain, reproductive capabilities (preferably by Activin or Inhibin-like activity). 
I hormonal or endocrine levels, appetite. libido, memory, stress, or otiier cognitive 
qualities. 

A polypeptide or polynucleotide of the present invention may also be used as a 
food additive or preservative, such as to increase or decrease storage capabilities, fat 
content, Upid, protein, carbohydrate, vitamins, minerals, cofactprs or otiier nutritional 
components. 
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Other P referred Kmbodimente 

Other preferred embodiments of the claimed invention include an isolated 
nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical 
to a sequence of at least about 50 contiguous nucleotides in the nucleotide sequence of 
SEQ ID NO:X wherein X is any integer as defined in Table 1 . 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
Clone Sequence and ending with the nucleotide at about the position of the 3* 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 1. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
Start Codon and ending witii the nucleotide at about the position of the 3* Nucleotide of 
the Gone Sequence as defined for SEQ ID NO:X in Table 1 . 

Similarly prefened is a nucleic acid molecule wherein said sequence of 
contiguous nucleotides is included in the nucleotide sequence of SEQ ID NO:X in die 
range of positions beginning with the nucleotide at about the position of the 5* 
Nucleotide of the First Amino Acid of Uie Signal Peptide and ending witii the nucleotide 
at about the position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID 
NO:X in Table 1. 

Also prefened is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 150 contiguous 
nucleotides in the nucleotide sequence of SEQ ID NOX. 

Further preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 500 contiguous 
nucleotides in the nucleotide sequence of SEQ ID NO:X. 

A fiirther preferred embodiment is a nucleic acid molecule comprising a 
nucleotide sequence which is at least 95% identical to the nucleotide sequence of SEQ 
ID NO:X beginning with the nucleotide at about the position of the 5' Nucleotide of the 
First Amino Acid of the Signal Peptide and ending with the nucleotide at about the 
position of the 3* Nucleotide of the Clone Sequence as defmed for SEQ ID NO:X m 
Table 1. 

A fiirther preferred embodiment is an isolated nucleic acid molecule comprising 
a nucleotide sequence which is at least 95% identical to the complete nucleotide 
sequence of SEQ ID NO:X. 
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Also preferred is an isolated nucleic acid molecule which hybridizes under 
stringent hybridization conditions to a nucleic acid molecule, wherein said nucleic acid 
molecule which hybridizes does not hybridize under stringent hybridization conditions 
to a nucleic acid molecule having a nucleotide sequence consisting of only A residues or 
of only T residues. 

Also preferred is a conqwsition of maOer comprising a DNA molecule which 
comprises a human cDNA clone identified by a cDNA Clone Identifier in Table 1, 
which DNA molecule is contained in die material deposited with the American Type 
Culture Collection and given die ATCC Deposit Number shown in Table 1. for said 
cDNA Clone Identifier. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least 50 contiguous 
nucleotides in the nucleotide sequence of a human cDNA clone identified by a cDNA 
Clone Identifier in Table 1, which DNA molecule is contained in the deposit given die 
ATCC Deposit Number shown in Table 1 . 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of at 
least 50 contiguous nucleotides is included in die nucleotide sequence of die complete 
open reading frame sequence encoded by said human cDNA clone. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to sequence of at least 150 contiguous 
nucleotides in the nucleotide sequence encoded by said human cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule comprising 
a nucleotide sequence which is at least 95% identical to sequence of at least 500 
contiguous nucleotides in the nucleotide sequence encoded by said human cDNA clone. 

A fiuther preferred embodiment is an isolated nucleic acid molecule comprising 
a nucleotide sequence which is at least 95% identical to die complete nucleotide 
sequence encoded by said human cDNA clone. 

A further preferred embodiment is a metiiod for detecting in a biological sample 
a nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical 
to a sequence of at least 50 contiguous nucleotides in a sequence selected bom the 
group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any integer 
as defined in Table 1 ; and a nucleotide sequence encoded by a human cDNA clone 
identified by a cDNA Qone Identifier in Table 1 and contained in die deposit widi die 
ATCC Deposit Number shown for said cDNA clone in Table I ; which method 
comprises a step of comparing a nucleotide sequence of at least one nucleic acid 
molecule in said sample widi a sequence selected fix)m said group and determining 
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whether the sequence of said nucleic acid molecule in said sample is at least 95% 
identical to said selected sequence. 

Also preferred is the above method wherein said step of comparing sequences 
comprises detennining the extent of nucleic acid hybridization between nucleic acid 
molecules in said sample and a nucleic acid molecule comprising said sequence selected 
from said group. Similariy, also preferred is the above method wherein said step of 
comparing sequences is performed by comparing die nucleotide sequence determined 
from a nucleic acid molecule in said sample with said sequence selected bom said 
group. The nucleic acid molecules can comprise DNA molecules or RNA molecules. 

A further preferred embodiment is a method for identifying the species, tissue or 
cell type of a biological sample which method comprises a step of detecting nucleic acid 
molecules in said sample, if any, comprising a nucleotide sequence that is at least 95% 
identical to a sequence of at least 50 conuguous nucleotides in a sequence selected from 
the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any 
integer as defined in Table 1; and a nucleotide sequence encoded by a human cDNA 
clone identified by a cDNA Qone Identifier in Table 1 and contained in the deposit witii 
die ATCC Deposit Number shown for said cDNA clone in Table 1 . 

The metfiod for identifying the species, tissue or cell type of a biological sample 
can conq)rise a step of detecting nucleic acid molecules comprising a nucleotide 
sequence in a panel of at least two nucleotide sequences, wherein at least one sequence 
in said panel is at least 95% identical to a sequence of at least 50 contiguous nucleotides 
in a sequence selected from said group. 

Also preferred is a metiiod for diagnosing in a subject a pathological condition 
associated with abnormal stracture or expression of a gene encoding a secreted protein 
identified in Table 1, which method comprises a step of detecting in a biological sample 
obtained from said subject nucleic acid molecules, if any, comprising a nucleotide 
sequence tiiat is at least 95% identical to a sequrace of at least 50 contiguous 
nucleotides in a sequence selected from die group consisting of: a nucleotide sequence 
of SEQ ID NO:X wherein X is any integer as defined in Table 1 ; and a nucleotide 
sequence encoded by a human cDNA clone identified by a cDNA Clone Identifier in 
Table 1 and contained in die deposit with die ATCC Deposit Number shown for said 
cDNA clone m Table 1. 

The metiiod for diagnosing a pathological condition can comprise a step of 
detecting nucleic acid molecules comprising a nucleotide sequence in a panel of at least 
two nucleotide sequences, wherein at least one sequence in said panel is at least 95% 
identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
said group. 
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Also preferred is a composition of matter conq)rising isolated nucleic acid 
molecules wherein the nucleotide sequences of said nucleic acid molecules con^)rise a 
panel of at least two nucleotide sequences, wherein at least one sequence in said panel i 
at least 95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of; a nucleotide sequence of SEQ ID NO:X wherein 
X is any integer as defined m Table 1 ; and a nucleotide sequence encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1 . The 
nucleic acid molecules can comprise DNA molecules or RNA molecules. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
amino acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1. 

Also preferred is a polypeptide, wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of SEQ ID NO:Y in die range of positions 
beginning with die residue at about the position of the Fu^t Amino Acid of the Secreted 
Portion and ending with the residue at about Uie Last Amino Acid of the Open Reading 
Frame as set forth for SEQ ID NO:Y m Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of at least about 100 contiguous amino acids in the 
amino acid sequence of SEQ ID NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to the complete amino acid sequence of SEQ ID NO:Y. 

Further preferred is an isolated polypeptide cony)rising an amino acid sequence 
at least 90% identical to a sequence of at least about 10 contiguous anuno acids in die 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with die 
ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is a polypeptide wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of a secreted portion of the secreted protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table I and 
contained in die deposit widi the ATCC Deposit Number shown for said cDNA clone m 
Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 30 contiguous amino acids in die 
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ammo acid sequence of the secreted portion of the protein encoded by a human cDNA 
clone identified by a cDNA Qone Identifier in Table 1 and contained in the deposit with 
the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also prefeited Is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 100 contiguous amino acids in the 
amino add sequence of the secreted portion of the protein encoded by a human cDNA 
clone identified by acDNAaone Identifier in Table 1 and contained in the deposit with 
the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to the amino acid sequence of the secreted portion of the protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table 1. 

Further preferred is an isolated antibody which binds specifically to a 
polypepUde comprising an amino acid sequence that is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
defined in Table 1; and a complete amino acid sequence of a protein encoded by a 
human cDNA clone identffied by a cDNA Clone Identifier in Table 1 and contained in 
the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Further preferred is a method for detecting in a biological sample a polypeptide 
comprising an amino acid sequence which is at least 90% identical to a sequence of at 
least 10 conUguous amino acids in a sequence selected from the group consisting of: an 
amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table 1 ; 
and a complete amino acid sequence of a protein encoded by a human cDNA clone 
identified by acDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1 ; which method 
comprises a step of comparing an amino acid sequence of at least one polypeptide 
molecule in said sample with a sequence selected from said group and detcnnining 
whether the sequence of said polypeptide molecule in said sample is at least 90% 
identical to said sequence of at least 10 contiguous amino acids. 

Also prefenwl is the above method wherein said step of comparing an amino 
acid sequence of at least one polypeptide molecule in said sample with a sequence 
selected from said group comprises detenmning the extent of specific binding of 
polypeptides in said sample to an antibody which binds specificaUy to a polypeptide 
comprising an amino acid sequence that is at least 90% identical to a sequence of at least 
10 conUguous amino acids in a sequence selected from the group consisting of: an 
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amino acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; 
and a complete amino acid sequence of a protein encoded by a human cDNA clone 
identified by a cDNA Qone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1. 
5 Also preferred is the above method wherein said step of conq)aring sequences is 

performed by comparing the amino acid sequence determined from a polypeptide 
molecule in said sample with said sequence selected fit)m said group. 

Also preferred is a method for identifying die species, tissue or cell type of a 
biological sample which metiiod comprises a step of detecting polypeptide molecules in 

10 said sample, if any, comprising an amino acid sequence that is at least 90% identical to 
a sequence of at least 10 contiguous amino acids in a sequence selected fix)m die group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
defined in Table 1 ; and a complete amino acid sequence of a secreted protein encoded 
by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 

15 in die deposit with die ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is die above metiiod for identifying die species, tissue or cell type 
of a biological sample, which metiiod comprises a step of detecting polypeptide 
molecules comprising an amino acid sequence in a panel of at least two amino acid 
sequences, wherein at least one sequence in said panel is at least 90% identical to a 

20 sequence of at least 10 contiguous amino acids in a sequence selected from die above 
group. 

Also preferred is a mediod for diagnosing in a subject a padiological condition 
associated widi abnoraial stmcture or expression of a gene encoding a secreted protein 
identified in Table 1, which method comprises a step of detecting in a biological sample 

25 obtained fit)m said subject polypeptide molecules comprising an amino acid sequence m 
a panel of at least two amino acid sequences, wherein at least one sequence in said panel 
is at least 90% identical to a sequrace of at least 10 contiguous amino acids in a 
sequence selected firom die group consisting of: an amino acid sequence of SEQ ID 
NO:Y wherein Y is any integer as defined in Table 1; and a complete amino acid 

30 sequence of a secreted protein encoded by a human cDNA clone identified by a cDNA 
Clone Identifier in Table 1 and contained in die deposit widi die ATCC Deposit Number 
shown for said cDNA clone in Table 1, 

In any of diese mediods, die step of detecting said polypeptide molecules 
includes using an antibody. 

Also preferred is an isolated nucleic acid molecule con^>rising a nucleotide 
sequence which is at least 95% identical to a nucleotide sequence encoding a 
polypeptide wherein said polypeptide comprises an amino acid sequence diat is at least 
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90% identical to a sequence of at least 10 contiguous amino acids in a sequence selected 
from die group coiisisting of: an amino acid sequence of SEQ ID NO:Y wherein Y is 
any integer as defined in Table 1 ; and a complete amino acid sequence of a secreted 
protein encoded by a human cDNA clone identified by a cDNA Qone Identifier in Table 
5 1 and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table 1. 

Also preferred is an isolated nucleic acid molecule, wherein said nucleotide 
sequence encoding a polypeptide has been optimized for expression of said polypeptide 
in a prokaryotic host. 

10 Also preferred is an isolated nucleic acid molecule, wherein said polypeptide 

comprises an amino acid sequence selected from the group consisting of: an amino acid 
sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; and a 
con^lete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
1 5 ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Further preferred is a method of making a rewMnbinant vector comprising 
inserting any of the above isolated nucleic acid molecule into a vector. Also preferred is 
the recombinant vector produced by this method. Also prefened is a method of making 
a recombinant host ceU comprising introducing the vector into a host ceU. as weU as the 
20 recombinant host cell produced by this method. 

Also prefened is a method of making an isolated polypeptide comprising 
culturing this recombinant host cell under conditions such that said polypeptide is 
expressed and recovering said polypeptide. Also preferred is this method of making an 
isolated polypeptide, wherein said recombinant host ceU is a eukaryotic cell and said 
polypq)tide is a secreted portion of a human secreted protein comprising an amino acid 
sequence selected from the group consisting of: an amino acid sequoice of SEQ ID 
NO:Y beginning with the residue at the position of the First Amino Acid of the Secreted 
Portion of SEQ ID NO:Y wherein Y is an integer set forth in Table 1 and said position 
of the Fust Amino Acid of the Secreted Portion of SEQ ID NO: Y is defined in Table 1 ; 
30 and an amino acid sequence of a secreted portion of a protein encoded by a huinan 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1 . The 
isolated polypepUde produced by this method is also preferred. 

Also preferred is a method of treatment of an individual in need of an increased 
level of a secreted protein activity, which method comprises administering to such an 
individual a pharmaceutical con^sition comprising an amount of an isolated 
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polypeptide, polynucleotide, or antibody of the claimed invention effective to increase 
the level of said protein activity in said individual. 

Having generally described the invention, the same will be more readily 
understood by reference to the foUowing examples, which are provided by way of 
5 illustration and are not intended as limiting. 



Example-s 

Example 1; Isolation of a Sel ected cDNA Clone Prom the Dep ositMl 
Sample 

10 Each cDNA clone in a cited ATCC deposit is contained in a plasmid vector. 

Table 1 identifies the vectors used to construct the cDNA library fiom which each clone 
was isolated. In many cases. Uie vector used to constmct the library is a phage vector 
from which a plasmid has been excised. The table immediately below correlates the 
related plasmid for each phage vector used in constructing the cDNA library. For 
15 example, where a particular clone is identified in Table 1 as being isolated in die vector 
"Lambda Zap." the corresponding deposited clone is in "pBluescript." 

Vector Us^ tp Construct I .ihraiy Corresponding Deposited PlasmiH 

Lambda Zap pBluescript (pBS) 

Uni-ZapXR pBluescript (pBS) 

20 Zap Express pBK 

lafinidBA plafinidBA 
PSportl pSportl 
pCMVSport2.0 pCMVSport2.0 
pCMVSport3.0 pCMVSportS.O 
25 pCR*2.1 pCR*2.1 

Vectors Lambda Zap (U.S. Patent Nos. 5.128.256 and 5,286,636), Uni-Zap 
XR (U.S. Patent Nos. 5.128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 
5,128,256 and 5,286,636), pBluescript (pBS) (Short, J. M. et al.. Nucleic Acids Res. 
16:7583-7600 (1988); Alting-Mees, M. A. and Short, J. M., Nucleic Acids Res. 
30 17:9494 (1989)) and pBK (Alting-Mees, M. A. et al.. Strategies 5:58-61 (1992)) are 
commercially available from Stiata[gene Cloning Systems, Inc., 1 101 1 N. Torrey Pines 
Road, La Jolla, CA. 92037. pBS contains an ampicilUn resistance gene and pBK 
contains a neomycin resistance gene. Botii can be transformed into E. coli strain XL-1 
Blue, also available from Stratagene. pBS comes in 4 forms SK+. SK-. KS+ and KS. 
35 The S and K refers to die orientation of die polylinker to the T7 and T3 prirher 

sequences which flank the polylinker region ("S" is for SacI and "K" is for Kpnl which 
are die fust sites on each respective end of die linker). "+" or "-" refer to die orientation 
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Of the fl origin of replication ("ori"), such that in one orientation, single stranded rescue 
initiated from the f 1 ori generates sense strand DNA and in the other, antisense. 

Vectors pSportl. pCMVSport 2.0 and pCMVSport 3.0. were obtained from 
Life Technologies. Inc.. P. O. Box 6009. Gaithersburg. MD 20897. AU Sport vectors 
contain an ampicillin resistance gene and may be transformed into E. coli strain 
DHIOB. also avaUable from Life Technologies. (See. for instance. Gruber. C. E.. et 
al.. Focus 15:59 (1993).) Vector lafinid BA (Bento Soares. Columbia University. NY) 
contains an ampicillin resistance gene and can be transformed into E. coli strain XL-1 
Blue. Vector pCR*2. 1. which is available from Invitrogen. 1600 Faraday Avenue, 
Carlsbad, CA 92008, contains an ampicillin resistance gene and may be transformed 
into E. coli strain DHIOB, avaUable from Life Technologies. (See. for instance, Clark, 
J. M., Nuc. Acids Res. 16:9677-9686 (1988) and Mead, D. et al., Bio^-echnology 9: 
(1991).) Preferably, a polynucleotide of the present invention does not comprise the 
phage vector sequences identified for die particular clone in Table 1, as weU as the 
1 5 corresponding plasmid vector sequences designated above. 

The deposited material in the sample assigned die ATCC Deposit Number cited 
in Table 1 for any given cDNA clone also may contain one or more additional plasmids 
each comprising a cDNA clone different from that given clone. Thus, deposits sharing ' 
the same ATCC Deposit Number contain at least a plasmid for each cDNA clone 
identified in Table 1. Typically, each ATCC deposit sample cited in Table 1 comprises 
a mixture of approximately equal amounts (by weight) of about 50 plasmid DNAs. each 
containing a different cDNA clone; but such a deposit sample may include plasmids for 
more or less tiian 50 cDNA clones, up to about 500 cDNA clones. 

Two approaches can be used to isolate a particular clone from the deposited 
sample of plasmid DNAs cited for dial clone in Table 1. First, a plasmid is direcdy 
isolated by screening the clones using a polynucleotide probe corresponding to SEO ID 
NO:X. 

Particularly, a specific polynucleotide with 3040 nucleotides is syntiiesized 
using an Applied Biosystems DNA synthesizer according to die sequence reported. 
The oligonucleotide is labeled, for instance, with ^^-y-ATP using T4 polynucleotide 
kinase and purifled according to routine metiiods. (E.g., Maniatis et al.. Molecular 
Cloning: A Uboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) 
The plasmid mixture is transformed into a suitable host, as indicated above (such as 
XL-1 Blue (Stratagene)) using techniques known to diose of skill in die art, such as 
those provided by die vector supplier or in related publications or patents cited above. 
The transformants are plated on 1.5% agar plates (containing die appropriate selection 
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agent, e.g., ampiciUin) to a density of about 150 transformants (colonies) per plate. 
These plates are screened using Nylon membranes according to routine methods for 
bacterial colony screening (e.g:. Sambrook et al.. Molecular Cloning: A Laboratory 
Manual. 2nd Edit.. (1989). Cold Spring Harbor Laboratory ftess. pages 1.93 to 
1.104). or other techniques known to those of skill in the art 

Aitemauvely. two primers of 17-20 nucleotides derived torn both ends of the 
SEQ ID NO:X (i.e., within the region of SEQ ID NO:X bounded by the 5* NT and the 
3- NT of the clone defined in Table 1) are synthesized and used to ampUfy the desired 
cDNA using the deposited cDNA plasmid as a template. The polymerase chain reaction 
IS carried out under routine conditions, for instance, in 25 lil of reaction mixture with 
0.5 ug of the above cDNA template. A convenient reaction mixture is 1.5-5 mM 
MgClj. 0.01% (w/v) gelatin, 20 \xM each of dATP, dCTP. dGTP. dTTP. 25 pmol of 
each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR'(denaturation 
at 94°C for 1 min; annealing at 55°C for 1 min; elongation at 72°C for I min) are 
perfonned with a Pericin-Ehner Cetus automated theimal cycler. The amplified product 
is analyzed by agarose gel electrophoresis and the DNA band with expected molecular 
weight is excised and purified. The PGR product is verified to be the selected sequence 
by subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3' non-coding 
portions of a gene which may not be present in tiie deposited clone. These methods 
include but are not limited to, filter probing, clone enrichment using specific probes, 
and protocols similar or identical to 5' and 3' "RACE" protocols which are weU known 
in the art For instance, a metiiod similar to 5' RACE is avaUable for generating the 
missing 5' end of a desired fiiU-length transcript. (Fromont-Racine et al.. Nucleic Acids 
Res. 21(7):1683-1684 (1993).) 

Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a population 
of RNA presumably containing full-lengtii gene RNA transcripts. A primer set 
containing a primer specific to the ligated RNA oligonucleotide and a primer specific to 
a known sequence of the gene of interest is used to PCR amplify die 5' portion of the 
desired fuU-length gene. This amplified product may tiien be sequenced and used to 
generate the fiill length gene. 

This above metiiod starts with total RNA isolated from tiie desired source, 
altiiough poIy-A+ RNA can be used. The RNA preparation can tiien be tieated witii 
phosphatase if necessary to eliminate 5' phosphate groups on degraded or damaged 
RNA which may interfere witii tiie later RNA ligase step. The phosphatase should tiien 
be inactivated and tiie RNA tiieated witii tobacco acid pyrophosphatase in order to 
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remove the cap stnicture present at the 5' ends of messenger RNAs. This reaction 
leaves a 5' phosphate group at the 5" end of the cap cleaved RNA which can then be 
ligated to an RNA oligonucleotide using T4 RNA ligase. 

This modified RNA preparation is used as a template for first strand cDNA 
synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is 
used as a template for PGR amplification of the desired 5' end using a primer specific to 
the Ugated RNA oUgonucIeotide and a primer specific to the known sequence of the 
gene of interest. The resultant product is then sequenced and analyzed to confirm diat 
the 5' end sequence belongs to the desired gene. 

Example Isolation of Gennmir r i ones rnrrp^ponding tn a 
Polvnucleotidt> 

A human genomic PI library (Genomic Systems, Inc.) is screened by PGR 
using primers selected for the cDNA sequence corresponding to SEQ ID NO:X.. 
according to the method described in Example 1 . (See also, Sambrook.) 

Example 3: Ti^^n.. Dfetrihntinn of Pnlyp i^pfirf^ 

Tissue distribution of mRNA expression of polynucleotides of the present 
invention is determined using protocols for Northern blot analysis, described by. 
among others, Sambrook et al. For example, a cDNA probe produced by the method 
described in Example 1 is labeled with P» using the rediprime™ DNA labeling system 
(Amersham Life Science), according to liianufacturer's instructions. After labeling, the 
probe is purified using CHROMA SPIN-100™ column (Clontech Laboratories, Inc.), 
according to manufactarer's protocol number PT1200- 1 . The purified labeled probe is 
then used to examine various human tissues for mRNA expression. 

Multiple Tissue Northern (MTN) blots containing various human tissues (H) or 
human immune system tissues (IM) (Clontech) are examined with the labeled probe 
using ExprcssHyb™ hybridization solution (Qontech) according to manufacturer's 
protocol number PTl 190-1 . Following hybridization and washing, the blots are 
mounted and exposed to film at -70°C overnight, and the films developed according to 
standard procedures. 

Example 4; Chromasomal Mapp in g » f the VnWnnol^tM^ 

An oUgonucIeotide primer set is designed according to the sequence at the 5' 
end of SEQ ID NO:X. This primer preferably spans about 100 nucleotides. This 
primer set is then used in a polymerase chain reaction under the following set of 
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conditions : 30 seconds, 95»C; 1 minute, 56°C; 1 minute, 70°C. This cycle is repeated 
32 times foUowed by one 5 minute cycle at 70»C. Human, mouse, and hamster DNA 
is used as template in addition to a somatic ceU hybrid panel containing individual 
chromosomes or chromosome fiagments (Bios, Inc). The reactions is analyzed on 
either 8% polyaciylamide gels or 3.5 % agarose gels. Chromosome mapping is 
detennined by the presence of an approximately 100 bp PGR fragment in the particular 
somatic cell hybrid. 

Example 5; Bacterial Kxnressinn of a Polvpgptidg 

A polynucleotide oicoding a polypeptide of the present invention is ampUfied 
using PGR oligonucleotide primers corresponding to the 5' and 3' ends of the DNA 
sequence, as ouUined in Example 1 , to synthesize insertion fiagments. The primere 
used to amplify the cDNA insert should preferably contain restriction sites, such as 
BamHI and Xbal, at the 5' end of the primers in order to clone the amplified product 
15 into the expression vector. For example. BamHI and Xbal correspond to the restriction 
enzyme sites on the bacterial expression vector pQE-9. (Qiagen, Inc., Chatsworth, 
CA). This plasmid vector encodes antibiotic resistance (AmpO, a bacterial origin of 
replication (ori), an IPTG-regulatable promoter/operator (P/O), a ribosome binding site 
(RBS), a 6-histidine tag (6-His), and restriction enzyme cloning sites. 
20 The pQE-9 vector is digested with BamHI and Xbal and the amplified fragment 

is Ugated into the pQE-9 vector maintaining the reading frame initiated at the bactaial 
RBS. The ligation mixture is then used to transform the E. coli strain M15/rep4 
(Qiagen, Inc.) which contains multiple copies of die plasmid pREP4, which expresses 
the laci repressor and also confers kanamycin resistance (KanO- Transformants arc 
25 identified by their abUity to grow on LB plates and ampicillin/kanamycin resistant 

colonies are selected. Plasmid DNA is isolated and confirmed by lestriction analysis. 

Clones containing the desired constructs are grown overnight (0/N) in liquid 
culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). 
The O/N culture is used to inoculate a large culture at a ratio of 1:100 to 1:250. The 
30 cells are grown to an optical density 600 (O.D.«») of between 0.4 and 0.6. IPTG 

asopropyl-B-D-thiogalacto pyranoside) is then added to a final concentration of 1 mM. 
IPTG induces by inactivating the lacI repressor, clearing the P/O leading to increased 
gene expression. 

Cells are grown for an exti^ 3 to 4 hours. Cells are then harvested by 
35 centrifiigation (20 mins at 6000Xg). The ceU peUet is solubilized in the chaotropic 
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agent 6 Molar Guanidine HCl by stirring for 3-4 hours at 4°C. The cell debris is 
removed by centrifiigation. and the supernatant containing the polypeptide is loaded 
onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (available ftom 
QIAGEN, Inc.. supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with high 
5 affimty and can be purified in a simple one-step procedure (for details see: TTie 
QIAexpressionist (1995) QIAGEN, Inc., supra). 

Briefiy. the supernatant is loaded onto the column in 6 M guanidine-HCl pH 8 
the column is first washed with 10 volumes of 6 M guanidine-HQ. pH 8. then washed* 
with 10 volumes of 6 M guanidine-HCl pH 6. and finally the polypeptide is eluted with 
10 6 M guanidine-HQ, pH 5. 

The purified protem is dien renatured by dialyzing it against phosphate-buffered 
sahne(PBS)or50mMNa-acetate,pH6bufferplus200mMNaa. Alternatively the 
protein can be successfully refolded while immobilized on the Ni-NTA column. TTie 
recommended conditions are as foUows: renature usmg a linear 6M-1M urea gradient in 
500 mM NaCl. 20% glycerol, 20 mM Tris/HCl pH 7.4. containing protease inhibitors 
The renaturation should be performed over a period of 1 .5 hours or more. After 
renaturation the proteins are eluted by the addition of 250 mM immidazole. Immidazole 
IS removed by a final dialyzing step against PBS or 50 mM sodium acetate pH 6 buffer 
plus 200 mM Naa. The purified protein is stored at 4»C or frozen at -80-C. 

In addiUon to the above expression vector, the present invention fiirther includes 
an expression vector comprising phage operator and promoter elements operatively 
linked to a polynucleotide of the present invention, called pHE4a. (ATCC Accession 
Number XXXXXX.) This vector contains: 1) a neomycinphosphotransferase gene as 
a selection maricer. 2) an E. coli origm of replication. 3) a T5 phage promoter sequence 
4) two lac operator sequences. 5) a Shine-Delgamo sequence, and 6) the lactose operon 
repressor gene (laclq). The origin of replication (oriC) is derived from pUCl 9 (LTI. 
GaiUiersburg, MD). The promoter sequence and operator sequences are made 
synthetically. 

DNA can be inserted into the pHEa by restiicting the vector with Ndel and 
Xbal. BamHI. Xhol, or Asp718. rumiing the restricted product on a gel. and isolating 
the larger fragment (the stuffer fragment should be about 310 base pairs). The DNA 
insert is generated according to the PGR protocol described in Example 1, using PGR 
pnmers having restriction sites for Ndel (5' primer) and Xbal. BamHI. Xhol or 
Asp718 (3' primer). The PGR insert is gel purified and restricted with compatible 
enzymes. The insert and vector are ligated according to standard protocols. 
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The engineered vector could easily be substituted in the above protocol to 
express protein in a bacterial system. 

Pxamply 6; Pii rificaUon of a PolvpeptidP from an Inclusinn Ri^y 

5 The following alternative method can be used to purify a polypeptide expressed 

in E coli when it is present in the form of inclusion bodies. Unless otherwise specified, 
. all of the following steps are conducted at 4- 10**C 

Upon completion of the production phase of the £ coli fermentation, the cell 
culture is cooled to 4- lOX and the cells harvested by continuous centrifiigation at 
10 15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
weight of cell paste and the amount of purified protein lequir^ an appropriate amount 
of ceU paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 
mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension using a 
^ high shear mixer. 

1 5 The cells are then lysed by passing the solution through a microfluidizer 

(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is 
then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 xg for 15 min. The resultant peUet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

20 The resulting washed inclusion bodies are solubilized with 1 .5 M guaniduie 

hydrochloride (GuHCl) for 2-4 hours. After 7000 xg cenuifugation for 15 min., the 
peUet is discarded and the polypeptide containing supernatant is incubated at 4**C 
overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble particles, 

25 the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 
volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EOTA by 
vigorous stirring. The refolded diluted protein solution is kept at 4^C without mixing 
for 12 hours prior to further purification steps. 

To clarify the refolded polypeptide solution, a previously prepared tangential 

30 filtration unit equipped with 0. 16 membrane filter with appropriate surface area 
(e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The 
filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosystems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted 
with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a 
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Stepwise manner. The absorbance at 280 nm of the effluent is continuously monitored. 
Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the polypeptide are then pooled and mixed with 4 volumes 
of water. The diluted sample is then loaded onto a previously prepared set of tandem 
columns of strong anion (Poros HQ-50. Perseptive Biosystems) and weak anion 
(Poros CM-20. Perseptive Biosystems) exchange resins. The columns are equilibrated 
with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium 
acetate. pH 6.0, 200 mM NaQ. The CM-20 column is tiien eluted using a 10 column 
volume linear gradient ranging ftom 0.2 M NaQ, 50 mM sodium acetate, pH 6.0 to 1 .0 
M NaCl, 50 mM sodium acetate. pH 6.5. Fractions are coUected under constant 
monitoring of the effluent Fractions containing the polypeptide (determined, for 
instance, by 16% SDS-PAGE) are dien pooled. 

The resultant polypeptide should exhibit greater dian 95% purity after the above 
refolding and purification steps. No major contaminant bands should be observed from 
Commassie blue stained 16% SDS-PAGE gel when 5 ^g of purified protein is loaded. 
The purified protein can also be tested for endotoxin/LPS contamination, and typically 
the LPS content is less tiian 0. 1 ng/ml according to LAL assays. 

Example 7: Cloni np and Expression of a Pnlvpentide in a Raculovims 
20 Exorcs-sion System 

In this example, tht plasmid shutUe vector pA2 is used to insert a polynucleotide 
into a baculovirus to express a polypeptide. This expression vector contains the strong 
polyhedrin promoter of the Autographa califomka nuclear polyhedrosis virus 
(AcMNPV) foUowed by convenient restriction sites such as BamHI. Xba I and 
Asp71 8. The polyadenylation site of die simian virus 40 ("SV40") is used for efficient 
polyadenylation. For easy selection of recombinant virus, the plasmid contains die 
beta-galactosidase gene from K coU under control of a weak DrosophUa promoter in die 
same orientation, followed by the polyadenylation signal of the polyhedrin gene. The 
inserted genes are flanked on botii sides by viral sequences for ceU-mediated 
homologous recombination witfi wild-type viral DNA to generate a viable vims tiiat 
express the cloned polynucleotide. 

Many other baculovirus vectors can be used in place of the vector above, such 
as pAc373, pVL941, and pAcIMl, as one skiUed m the art would readily appreciate, as 
long as die construct provides appropriately located signals for transcription, 
translation, secretion and die like, including a signal peptide and an in-frame AUG as 
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required. Such vectors are described, for instance, in Luckow et al.. Virology 170:31- 
39 (1989). 

Specifically, the cDNA sequence contained in the deposited clone, including the 
AUG initiation codon and the naturally associated leader sequence identified in Table 1, 
5 is amplified using the PGR protocol described in Example 1. If the naturally occurring 
signal sequence U used to produce the secreted protein, the pA2 vector does not need a 
second signal peptide. Alternatively, the vector can be modified (pA2 OP) to include a 
baculovinis leader sequence, using the standard methods described in Summers et al.. 
"A Manual of Methods for Baculovinis Vectors and Insect Cell Culture Procedures," 
10 Texas Agricultural Experimental Station Bulletin No. 1555 (1987). 

The amplified fragment is isolated fiom a 1 % agarose gel using a commercially 
avaUable kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fluent then is digested 
with appropriate restriction enzymes and again purified on a 1 % agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and 
15 optionally, can be dephosphorylated using calf intestinal phosphatase, using routine 
procedures known in the art. The DNA is then isolated from a 1% agarose gel using a 
commercially available kit ("Geneclean" BIO 101 Inc., La JoUa, Ca,). 

The Augment and the dephosphorylated plasmid are ligated together with T4 
DNA ligase. E, coliimiOl or other suitable £. coli hosts such as XL-1 Blue 
20 (Stratagene Cloning Systems, La Jolla, CA) cells are transformed with the ligation 
mixture and spread on culnirc plates. Bacteria containing the plasmid are identified by 
digesting DNA ftom individual colonies and analyzing the digestion product by gel 
electrophoresis. The sequence of the cloned fiagment is confirmed by DNA 
sequencing. 

25 Five jig of a plasmid containing the polynucleotide is co-transfected with 1 .0 jig 

of a commerciaUy available linearized baculovirus DNA ("BaculoGold™ baculovirus 
DNA", Pharmingen, San Diego, CA), using the lipofection method described by 
Feigner et al., Proc. NaU. Acad. Sci. USA 84:7413-7417 (1987). One ^g of 
BaculoGold™ virus DNA and 5 [ig of the plasmid are mixed in a sterile weU of a 

30 microtiter plate containing 50 pi of serum-free Grace's medium (Life Technologies 

Inc., Gaithersburg, MD). Afterwards, 10 fil Lipofecun plus 90 jU Grace's medium are 
added, mixed and incubated for 15 minutes at room temperature. Then the transfecUon 
mixture is added drop-wise to Sf9 insect cells (ATCC CRL 171 1) seeded in a 35 mm 
tissue culture plate with 1 ml Grace's medium without serum. The plate is then 

35 incubated for 5 hours at 27^ C. The transfection solution is then removed from the plate 
and 1 ml of Grace's insect medium supplemented with 10% fetal calf serum is added. 
Cultivation is then continued at 2T C for four days. 
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After four days the supernatant is coUected and a plaque assay is performed, as 
described by Summers and Smith, supra. An agarose gel with "Blue Gal" (Life 
Technologies Inc.. Gaithersburg) is used to aUow easy identification and isolation of 
gal-expressing clones, which produce blue-stained plaques. (A detailed description of a 
5 "plaque assay" of this type can also be found in the user's guide for insect cell culture 
and baculovirology distributed by Life Technologies Inc.. Gaithersburg, page 9-10.) 
After appropriate incubation, blue stained plaques are picked with the tip of a 
micropipettor (e.g., Eppendorf). The agar containing the recombinant viruses is then 
resuspended in a microcentrifiige tube containing 200 ^1 of Grace's medium and the 
10 suspension containing the recombinant baculovirus is used to infect Sf9 cells seeded in 
35 nun dishes. Four days later the supematants of these culture dishes are harvested 
and then they are stored at 4" C. 

To verify the expression of the polypeptide. Sf9 cells are grown in Grace's 
medium supplemented with 10% heat-inactivated FBS. The cells are infected with the 
recombinant baculovirus containing the polynucleotide at a multipUcity of infection 
("MOT") of about 2. If radiolabeled proteins are desired, 6 hours later the medium is 
removed and is replaced with SF900 H medium minus methionine and cysteine 
(available from Life Technologies Inc., Rockville, MD). After 42 hours. 5 ^Ci of ^S- 
methionine and 5 jiCi ^S-cysteine (available from Amereham) are added. The cells are 
further incubated for 16 hours and then are harvested by centrifiigation. The proteins in 
the supernatant as well as the intracellular proteins are analyzed by SDS-PAGE 
followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of puriGed 
protein may be used to determine the amino terminal sequence of the produced protein. 

^ Example ^: Expression of a Polvn«.p iiH> in Mammalian PaIIc 

The polypeptide of tiie present invention can be expressed in a mammaUan ceU. 

A typical mammalian expression vector contains a promoter element, which mediates 
the initiation of transcription of mRNA, a protein coding sequence, and signals required 
for the termination of transcription and polyadenylation of the transcript. Additional 

30 elements include enhancers, Kozak sequences and intervening sequences Hanked by 
donor and acceptor sites for RNA splicing. Highly efficient transcription is achieved 
with the early and late promoters from SV40. the long terminal repeats (LTRs) from 
Retroviruses, e.g.. RSV. HTLVI, HIVI and the early promoter of the cytomegalovirus 
(CMV). However, ceUular elements can also be used (e.g., the human actin promoter). 

35 Suitable expression vectors for use in practicing the present invention include, 

for example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), 
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pRSVcat (ATCC 37152). pSV2dhfr (ATCC 37146). pBC12MI (ATCC 67109). 
pCMVSport 2.0. and pCMVSport 3.0. Mammalian host ceUs that could be used 
include, human Hela, 293. H9 and Juikat cells, mouse NIH3T3 and C127 ceUs. Cos 1. 
Cos 7 and CVl. quail QCl-3 cells, mouse L cells and Chinese hamster ovary (CHO) 
ceUs. 

Altematively. the polypeptide can be expressed in stable cell lines containing the 
polynucleotide integrated into a chromosome. The co-transfection with a selectable 
marker such as dhfr. gpt. neomycin, hygromycin allows the identification and isolation 
of the transfected cells. 

The transfected gene can also be amplified to express large amounts of the 
encoded protein. The DHFR (dihydrofolate reductase) marker is usefiil in developing 
ceU lines that cany several hundred or even several thousand copies of the gene of 
interest. (See. e.g.. Alt. F. W., et al.. J. Biol. Chem. 253:1357-1370 (1978); Hamlin. 
J. L. and Ma, C. Biochem. et Biophys. Acta. 1097:107-143 (1990); Page. M. J. and 
15 Sydenham, M. A., Biotechnology 9:64-68 (1991).) Another usefU selection marker is 
the enzyme glutamine synthase (GS) (Murphy et al., Biochem J. 227:277-279 (1991); 
Bebbington et al., Bio^-echnology 10:169-175 (1992). Using these markers, the 
mammalian cells are grown in selective medium and the cells with the highest resistance 
are selected. These ceU lines contain the amplified gene(s) mtegrated into a 
20 chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 
production of proteins. 

Derivatives of the plasmid pSV2-dhfr (ATCC Accession No. 37146), the 
expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Acassion 
No. 209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (Cullen et 
25 al.. Molecular and Cellular Biology. 438-447 (March. 1985)) plus a fragment of the 
CMV-enhancer (Boshart et al.. Cell 41:521-530 (1985).) Multiple cloning sites, e.g., 
with the restriction enzyme cleavage sites BamHI, Xbal and Asp718, facilitate die 
cloning of the gene of interest. The vectors also contain the 3' intron, the 
polyadenylation and termination signal of the rat preproinsulin gene, and the mouse 
30 DHFR gene under control of the S V40 early promoter. 

Specifically, the plasmid pC6. for example, is digested with appropriate 
restriction enzymes and then dephosphotylated using calf intestinal phosphates by 
procedures known in the art. The vector is then isolated from a 1% agarose gel. 

A polynucleotide of the present invention is amplified according to the protocol 
outlined in Example 1. If the naturally occurring signal sequence is used to produce the 
secreted protein, the vector does not need a second signal peptide. Altematively. if the 
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naturaUy occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. (See, e.g., WO 96/34891.) 

The ampUfied fragment is isolated from a 1 % agarose gel using a commerciaUy 
available kit ("Geneclean." BIO 101 Inc.. La Jolla. Ca.). The fragment then is digested 
5 with appropriate restriction enzymes and again purified on a 1% agarose gel. 

The amplified fragment is then digested with the same restriction enzyme and 
purified on a 1% agarose gel. The isolated fragment and the dephosphoiylated vector 
are then ligated with T4DNAligase. K coliUBmx orXL-1 Blue ceUs are then 
transformed and bacteria are identified that contain the fragment inserted into plasmid 
10 pC6 using, for instance, restriction enzyme analysis. 

Chinese hamster ovaiy cells lacking an active DHFR gene is used for 
transfection. Five fig of the expression plasmid pC6 is cotransfected with 0.5 fig of the 
plasmid pSVneo using lipofectin (Feigner et al.. supra). The plasmid pSV2-neo 
contains a dominant selectable maricer. the neo gene from Tn5 encoding an enzyme that 
confers resistance to a group of antibiotics including G418. The cells are seeded in 
alpha minus MEM supplemented with 1 mg/ml G41 8. After 2 days, the ceUs are 
trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus 
MEM supplemented with 10. 25, or 50 ng/ml of metothrexate plus 1 mg/ml G418. 
After about 10-14 days single clones are trypsinized and then seeded in 6-well petri 
dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 100 nM 
200 nM. 400 nM, 800 nM). Clones growing at the highest concentrations of 
methotrexate are then transferred to new 6-weU plates containing even higher 
concentrations of methotrexate (1 fiM, 2 ^M, 5 fiM, 10 mM, 20 mM). Hie same 
procedure is repeated until clones are obtained which grow at a concentration of 100 - 
200 fiM. Expression of the desired gene product is analyzed, for instance, by SDS- 
PAGE and Western blot or by reversed phase HPLC analysis. 

Example 9? Pi-ftt^i n FiKinns 

The polypeptides of the present invention are preferably ftised to other proteins. 
These fiision proteins can be used for a variety of appUcations. For example, fiision of 
the present polypeptides to His-tag. HA-tag. protein A, IgG domains, and maltose 
binding protein facilitates purification. (See Example 5; see also EP A 394.827; 
Traunecker. et al.. Nature 331:84-86 (1988).) Similarly, fiision to IgG-1, IgG-3. and 
albumin increases the halflife time in vivo. Nuclear localization signals fiised to the 
polypeptides of the present invention can target the protein to a specific subceUular 
localization, while covalent heterodimer or homodimers can increase or decrease the 
activity of a fiision protein. Fusion proteins can also create chimeric molecules having 
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more than one function. Finally, fusion proteins can inciease solubility and/or stability 
of the fused protein compared to the non-fiised protein. All of the types of fiision 
proteins described above can be made by modifying the following protocol, which 
outlines the fusion of a polypeptide to an IgG molecule, or the pit)tocol described in 
Example 5. 

Briefly, the human Fc portion of the IgG molecule can be PGR amplified, using 
primers that span the 5' and 3' ends of the sequence described below. These primers 
also should have convenient restriction enzyme sites that will facDitate cloning into an 
expression vector, preferably a mammalian expression vector. 

For example, if pC4 (Accession No.209646) is used, the human Fc portion can 
be ligated into the BamHI cloning site. Note that the 3' BamHI site should be 
destroyed. Next, the vector containing the human Fc portion is re-restricted with 
Bamm. linearizing the vector, and a polynucleotide of the present invention, isolated 
by the PGR protocol described in Example 1, is ligated into this BamHI site. Note that 
the polynucleoUde is cloned without a stop codon, otherwise a fusion protein will not 
be produced. 

If the namraUy occurring signal sequence is used to prxxiuce the secreted 
protein, pG4 does not need a second signal peptide. Alternatively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. (See, e.g., WO 96/34891.) 

Human IgG Fc region: 

GGGATCCGGAGGGCAAATCITCrGACAAAACrGAGACATCCCCACCGrGCG 

CAGCAOCTGAAITCGAGGGTGCACCGTCAGTdTCCTCrTCCCCCCAAAACC 

CAAGGACACGCTGATGATCTCCCGGACrCCTCAGGTCACATCGGrcGTGGT 

GGACGTAAGCCACGAAGACCCTGAGGTCAAGTrcAACTGGTACGTGGACG 

GCGTGGAGGTGCATAATGCGAAGACAAAGCCGCGGGAGGAGCAGTACAAC 

AGCAGGTACCGTGTGGTCAGGGTCGTGAGGGTCCTGGACCAGGACTGGCrG 

AATGGCAAGGAGTACAAGTGCAAGGTCrCCAACAAAGCCCrCGCAAGCCGC 

ATGGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGCn- 

GTACACCCTGCCCCCATCCCGGGATGAGCrGACCAAGAACCAGGTCAGCCr 

GACCTGCCTGGTCAAAGGCTTCrATGCAAGCGACATGGCCGTGGAGTGGGA 

GAGCAATGGGCAGGCGGAGAACAACTACAAGACCACGCCTGGCGTGCTGG 

AGTGCGAGGGCrCdTCrrCCTCrACAGCAAGCrCACCCTGGACAAGAGCA 

GGTGGCAGGAGGGGAACGTCITCrCATGCrCCGTGATGCATGAGGCrcrGC 

ACAACCACTACAGGCAGAAGAGCCrGTCCCrGTCrCCGdGTAAATGAGTGC 
GACGGCCGGGACrCTAGAGGAT (SEQ ID NO: 1) 
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Example 10; Production of an Antii>odv fmm a Polyp ppfidP 

The antibodies of the present invention can be prepared by a variety of methods. 
(See. Current Protocols. Chapter 2.) For example, cells expressing a polypeptide of 
5 the present invention is administered to an animal to induce the pnxluction of sera 
containing polyclonal antibodies. In a preferred method, a preparation of the secreted 
protein is prepared and purified to render it substantially free of natural contaminants. 

Such a preparation is then mtroduced into an animal in Older to produce polyclonal 
antisera of greater specific activity. 

10 In the most preferred method, the antibodies of the present invention are 

monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared usmg hybridoma technology. (Kohler et al., Nature 
256:495 (1975); Kohler et al., Eur. J. Immunol. 6:51 1 (1976); Kohler et al.. Eur. J. 
Immunol. 6:292 (1976); Hammerling et al., in: Monoclonal Antibodies and T-Cell 
Hybridomas, Elsevier, N.Y., pp. 563-681 (1981).) In general, such procedures 
involve immunizing an animal (preferably a mouse) witii polypeptide or. more 
preferably, with a secreted polypeptide-expressing cell. Such cells may be cultured in 
any suitable tissue culture medium; however, it is preferable to culture cells in Earle's 
modified Eagle's medium supplemented with 10% fetal bovine serum (inactivated at 
20 about 56»C), and supplemented with about 10 g/1 of nonessential amino acids, about 
1,(X)0 U/ml of penicillin, and about 100 Jtg^ of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma 
ceU line. Any suitable myeloma cellline may be employed in accordance with Uie 
present invention: however, it is preferable to employ the parent myeloma cell line 
(SP20), available from the ATCC. After fusion, the resulting hybridoma cells are 
selectively maintained in HAT medium, and then cloned by limiting dilution as 
described by Wands et al. (Gastroenterology 80:225-232 (1 98 1).) The hybridoma cells 
obtained through such a selection are then assayed to identify clones which secrete 
antibodies capable of binding the polypeptide. 

Alternatively, additional antibodies capable of binding to the polypeptide can be 
produced in a two-step procedure using anti-idiotypic antibodies. Such a metiiod 
makes use of the fact that antibodies arc themselves antigens, and therefore, it is 
possible to obtain an antibody which binds to a second antibody. In accordance with 
this method, protein specific antibodies are used to immunize an animal, preferably a 
mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
and the hybridoma ceUs are screened to identify clones which produce an antibody 



25 



30 



35 



^^^^ FCr/US98A)4493 

214 



whose abiUty to bind to the protein-specific antibody can be blocked by the polypeptide. 
Such antibodies comprise anti-idiotypic antibodies to the protein-specific antibody and 
can be used to immunize an animal to induce fonnation of further protein-specific 
antibodies. 

5 It will be appreciated that Fab and F(ab')2 and other ftxigments of the antibodies 

of the present invention may be used according to the methods disclosed herein. Such 
fragments are typically produced by proteolytic cleavage, using enzymes such as papain 
(to produce Fab Segments) or pepsin (to produce F(ab')2 fragments). Alternatively, 
secreted protein-binding fragments can be produced through the application of 

10 recombinant DNA technology or through synthetic chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced using 
genetic constructs derived fix>m hybridoma cells producing the monoclonal antibodies 
described above. Methods for producing chimeric antibodies are known in the art. 

15 (See, for review, Morrison, Science 229: 1202 (1985); Oi et al., BioTechniques 4:214 
(1986); Cabilly et al., U.S. Patent No. 4,816,567; Taniguchi et al., EP 171496; 
Morrison et al., EP 173494; Neuberger et al., WO 8601533; Robinson et al., WO 
8702671; Boulianne et al.. Nature 312:643 (1984); Neuberger et al„ Namre 314:268 
(1985).) 

20 

Example 11; Production Of S ecreted Protein For High-Throughp ut 
Screening Assay s 

The following protocol produces a supernatant containing a polypeptide to be 
tested. This supernatant can then be used in the Screening Assays described in 

25 Examples 13-20. 

First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution 
(Img/ml in PBS) 1:20 in PBS (w/o calcium or magnesium 17-516F Biowhittaker) for a 
working solution of 50ug/ml. Add 200 ul of this solution to each well (24 well plates) 
and incubate at RT for 20 minutes. Be sure to distribute the solution over each well 

30 (note: a 1 2-channel pipetter may be used with tips on every other channel). Aspirate off 
the Poly-D-Lysine solution and rinse with 1ml PBS (Phosphate Buffered Saline). The 
PBS should remain in the well until just prior to plating the cells and plates may be 
poly-lysbe coated in advance for up to two weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 10^ ceUs/well m .5ml 

35 DMEM(Dulbecco's Modified Eagle Medium)(with 4.5 G/L glucose knd L-glutamine 
(12-604F Biowhittaker))/10% heat inactivated FBS(14-503F Biowhittaker)/ Ix 
Penstrep( 1 7-602E Biowhittaker). Let the cells grow overnight. 
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The next day. mix together in a sterile solution basin: 300 ul Upofectamine 
(18324-012 Gibco/BRL) and 5ml Optimem I (31985070 GibcomRL)/96-weU plate. 
With a small volume multi-channel pipetter. aliquot approximately 2ug of an expression 
vector containing a polynucleoUde insert, produced by the methods described in 
5 Examples 8 or 9, into an appropriately labeled 96-weU round bottom plate. With a 
multi-channel pipetter. add 50ul of the Upofectamine/Optimem I mixture to each well. 
Pipette up and down genUy to mix. Incubate at RT 15-45 minutes. After about 20 
minutes, use a multi-channel pipetter to add 150ul Optimem I to each weU. As a 
control, one plate of vector DNA lacking an insert should be transfected with each set of 
10 transfections. 

Preferably, the transfection should be performed by tag-teaming the following 
tasks. By tag-teaming, hands on time is cut in half, and the cells do not spend too 
much time on PBS. First, person A aspirates off the media from four 24-well plates of 
cells, and then person B rinses each well with .5-lml PBS. Person A then aspirates off 

15 PBS rinse, and person B. using al2-channel pipetter with tips on every other channel, 
adds die 200ul of DNA/Upofectamine/Optimem I complex to the odd wells first, then to 
the even weUs. to each row on the 24-well plates. Incubate at 37X for 6 hours. 

While cells are incubating, prepare appropriate media, eidier 1%BSA in DMEM 
with Ix penstrep, or CHO-5 media (see below) widi 2mm glutamine and Ix penstrep. 

20 (BSA (8 1-068-3 Bayer) lOOgm dissolved in IL DMEM for a 10% BSA stock 

solution). Filter the media and collect 50 ul for endotoxin assay in 15ml polystyrene 
conical. 

The transfection reaction is terminated, preferably by tag-teaming, at the end of 
the incubation period. Person A aspirates off the transfection media, while person B 
adds 1.5ml appropriate media to each well. Incubate at 37°C for 45 or 72 hours 
depending on the media use± 1 %BSA for 45 hours or CHO-5 for 72 hours. 

On day four, using a 300ul multichannel pipetter, aUquot 600ul in one 1ml deep 
weU plate and the remaining supernatant into a 2ml deep well. The supematants ftom 
each weU can then be used in the assays described in Examples 13-20. 

It is specifically understood that when activity is obtained in any of the assays 
described below using a supernatant, the activity originates from either the polypeptide 
direcUy (e.g.. as a secreted protein) or by the polypeptide inducing expression of other 
proteins, which are then secreted into the supernatant. Thus, the invention further 
provides a method of identifying die protein in the supernatant characterized by an 
35 activity in a particular assay. 



25 



30 



wo 98/39448 



PCT/DS98/04493 



216 



HGS'CHO'S medium formulation: 
Inorganic Salts 



ca(J12 (anhyd) 


I16.6me/L 


CuSO,-5H,0 


0.00130 


he(N0,),-9H,0 


0.050 


FeS0^-7H,0 


0.417 


KCl 


311.80 


MgCJ, 


28.64 


MgSO, 


48.84 


NaCl 


6995.50 


NaHCO, 


2400.0 


NaH,PO,-H,0 


62.50 


Na^P04 


71.02 


ZnS0^-7H,0 


.4320 



5 Lipids 



Arachidonic Acid 


.002 mg/L 


Cholesterol 


1.022 


DL-alpha- 
Tocopherol-Acetate 


.0/0 


Linoleic Acid 


0.0520 


Linolenic Acid 


0.010 


Myristic Acid 


0.010 


Oleic Acild 


0.010 


I'abnitric Acid 


0.010 


I'ahmtic Acid 


0.010 


Fluronic F-68 


100 


iiteanc Aad 


0.010 


Tween 80 


2.20 


Carbon Source 


1 LMjlucose 1 4551 mg/L 


Amino Acids 


L- Alanine 


130.85 mE/ml 


L-Aiginine-HCL 


14/.50 


L-Asparagine-HjO 


/.50 


L-Aspartic Acid 


6.65 


L-Cysune-2HCL- 
H,0 


29.56 


L-C'ystine-2HCL 


31.29 


L-Cilutamic Acid 


/.35 


L-tilutamine 


365.0 


Ulycine 


18.75 


L-HisUdine-HCL- 


52.48 
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H,0 




L-lsoleuane 


106.97 


L-Leucine 


111.45 


L-Lysine HCL 


163.75 


L-Methionine 


32.34 


L-Phenylalainine 


68.48 


L-Proline 


40.0 


L-Serine 


26.25 


L-Threonine 


101.05 


L-Tiyptopban 


19.22 


L-Tryrosme-2Na- 
2H,0 


91.79 


L-Valine 


99.65 


Vitamins 


Biotin 


0.0035 mg/L 


JXJa Pantothenate 


3.24 


Choline Chloride 


11.78 


Folic Aad 


4.65 


i-Inositoi 


15.60 


Niacinamide 


3.02 


Pyndoxal HCL 


3.00 


Pyridoxine HCL 


0.031 


Riboflavin 


0.319 


1 hiamine HCL 


3.17 


Thymidine 


0.365 


Vitamin B,, 


0.680 



Other Components 



HHPES Buffer 


25 mM 


Na Hypoxanthine 


2.39 mg/L 


Upoic Acid 


0.105 


5>odium Putrescine-2HCL 


0.081 


Sodium Pyruvate 


55.0 


Sodium Selenite 


0.0067 


Btbanolamine 


20uM 


Ferric Citrate 


0.122 


Methyl-B-Cyclodextrin complexed with 
Linoleic Acid 


41.70 


Methyl-B-Cyclodextrin complexed with 
Oleic Acid 


33.33 


Methyl-B-Cyclodextrin con^)lexed with 
Retinal Acetate 


10 



Adjust osmolarity to 327 mOsm 
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Example 12: C nstniction nf r^ A S Reporter C nsfmrf 

One signal transduction pathway involved in the differentiation and proliferation 
of ceUs is called the Jaks-STATs pathway. Activated proteins in the Jaks-STATs 
pathway bind to gamma activation site "GAS" elements or interferon-sensitive 
responsive element ("ISRE"). located in the promoter of many genes. Thebindingof a 
protein to these elements alter the expression of the associated gene. 

GAS and ISRE elements are recognized by a class of transcription factors called 
Signal Transducers and Activators of Transcription, or "STATs." There are six 
members of the STATs family. Statl and Stat3 are present in many ceU types, as is 
Stat2 (as response to IFN-alpha is widespread). Stat4 is more restricted and is not in 
many ceU types though it has been found in T helper class I, cells after tieatthent wiUi 
IL-12. Stat5 was originally called mammary growtii factor, but has been found at 
higher concentrations in oUier ceUs including myeloid cells. It can be activated in tissue 
culture cells by many cytokines. 

The STATs are activated to translocate from the cytoplasm to the nucleus upon 
tyrosine phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") 
family. Jaks represent a distinct family of soluble tyrosine kinases and include Tyk2, 
Jakl, Jak2. and Jak3. These kinases display significant sequence similarity and are 
generally catalytically inactive in resting cells. 

The Jaks arc activated by a wide range of receptors summarized in tiie Table 
below. (Adapted from review by Schidler and DameU, Ann. Rev. Biochem. 64:621-51 
(1995).) A cytokine receptor family, capable of activating Jaks, is divided into two 
groups: (a) Class 1 includes receptors for IL-2, E.-3. IL-6, IL-7, IL-9, IL-1 1, IL- 
12, IL-15, Epo, PRL. GH, G-CSF, GM-CSF, LIF, CNTF. and tiirombopoietin; a^d 
(b) Class 2 includes IFN-a, IFN-g, and H^10. The Class 1 receptors share a 
conserved cysteine motif (a set of four conserved cysteines and one tryptophan) and a 
WSXWS motif (a membrane proxial region encoding Trp-Ser-Xxx-Trp-Ser (SEQ ID 
N0:2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in tum 
activate STATs, which tiien translocate and bind to GAS elements. This entire process 
is encompassed in the Jaks-STATs signal transduction patiiway. 

Therefore, activation of the Jaks-STATs patiiway, reflected by the binding of 
the GAS or the ISRE element, can be used to indicate proteins involved in the 
proUferation and differentiation of ceUs. For example, growtii factors and cytokines are 
known to activate tiie Jaks-STATs patiiway. (See Table below.) Thus, by using GAS 
elements linked to reporter molecules, activators of die Jaks-STATs pathway can be 
identified. 
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ISRE 



STATS 



GASf elements) or 
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40 



45 



Ligand 


tvk2 


Jakl 

*r CUV I. 










BFN familv 














IFN-a/B 


+ 


f 








TCDT7 

loKJb 


IFN-g 






1 

T 




1 
i 


LmAo 


(IRFl>Lys6>IFP) 












11-10 


+ 


? 


? 


- 


1.3 




rol30 familv 














IL-6 fPleiotrohic^ 


4- 


I 


T 


7 




GAS 


f IRF 1 >L v<;6:>TFP'i 












Il-l irPleiotrohir^ 


9 


1 

T 




9 


1,3 




OnMrPleiotmhir'i 




1 

T 


+ 


9 


1,3 




LIF(Pleiotrohic) 


7 




a. 


7 






CNTF(Pleiotn)hic) 


-/+ 


+ 


+ 


7 


1,3 




G-CSF(Pleiotrohic) 


? 


+ 




7 


1,3 






1 

+ 




+ 


+ 


1,3 


























+ 


1,3,5 


GAS 


IL-4 (lymph/myeloid) 










\j 


v^JlvT 1 = Irr 


»Ly6)(IgH) 












IL-7 (lymphocytes) 








+ 


5 


GAS 


IL-9 (lymphocytes) 




+ 






5 


GAS 


IL-13 (lymphocyte) 




+ 


7 


7 


6 


GAS 


IL-15 


? 


+ 


7 


+ 


5 


GAS 


cpl40 family 














IL-3 (myeloid) 






+ 




5 


GAS 


(IRFl>IFP»Ly6) 












IL-5 (myeloid) 






+ 




5 


GAS 


GM-CSF (myeloid) 










5 


GAS 



Growth hormone family 
GH 7 
PRL ? 
EPO ? 
CAS>IRF1 =IFP»Ly6) 



+/- 



+ 
+ 
+ 



Receptor Tyrosine Kinases 
EGF ? + 



PDGF 
CSF-1 



+ 
+ 



5 

1.3,5 
5 



1,3 

1,3 
1,3 



GAS(B- 

GAS (IRFl) 
GAS (not IRFl) 
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To construct a synthetic GAS containing promoter element, which is used in the 
Biological Assays described in Examples 13-14, a PGR based strategy is employed to 
generate a GAS-SV40 promoter sequence. The 5* primer contains four tandem copies 
of the GAS binding site found in the IRFl promoter and previously demonstrated to 
5 bind STATs upon induction with a range of cytokines (Rothman et al.. Immunity 

1 :457-468 (1994).). although other GAS or ISRE elements can be used instead. The 5* 
primer also contains 18bp of sequence complementary to the SV40 early promoter 
sequence and is flanked with an Xhol site. The sequence of the 5* primer is: 

5':GCGCCTCGAGATITCCCCGAAATCrAGATrTCCCCGAAATCATITCCCCG 
10 AAATGATTTCCCCGAAATATCrGCCATCrcAATrAG:3' (SEQ ID NO:3) 

The downstream primer is complementary to die SV40 promoter and is flanked 
with a Hind m site: 5':GCGGCAAGCrnTTGCAAAGCCTAGGC:3' (SEQ ID 
NO:4) 

PGR amplification is performed using the SV40 promoter template present in 
the B-gal:promoter plasmid obtained from Qontech. The resulting PGR fragment is 
digested with Xhol/Hindm and subcloned into BLSK2-. (Sfratagene.) Sequencing 
widi forward and reverse primers confirms that the insert contains the following 
sequence: 

5':CreeAQATrrcOCGGAAATGTAGATrrCCCGGAAATGATrrcCCCGAAATG 

20 ATTrGGGGGAAATATCrGGGATCrGAATTAGTCAGGAAGGATAGTCGGGGGG 

GTAACrCGGGGCATCGGGGCGCTAACrGGGGGGAGITGCGGGCATrcrGGGG 

GGGATGGGTGACrAATmrnTATrrATGCAGAGGGGGAGGGCGGCTGGGG 

CrCTGAGCTATrGGAGAAGTAGTGAGGAGGCnTnTGGAGGGCrAGGCnT 
TGCAAAMQ£IT:3' (SEQIDNO:5) 
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With this GAS promoter element linked to the SV40 promoter, a GAS:SEAP2 
reporter construct is next engineered. Here, the reporter molecule is a secreted alkaline 
phosphatase, or "SEAP." Clearly, however, any reporter molecule can be instead of 
SEAP. in this or in any of the other Examples. WeU known reporter molecules that can 
be used instead of SEAP include chloramphenicol acetyltransferase (CAT), luciferase, 
alkaline phosphatase, B-galactosidase. green fluorescent protein (GFP). or any protein 
detectable by an antibody. 

The above sequence confirmed synthetic GAS-SV40 promoter element is 
subcloned into the pSEAP-Promoter vector obtained from Glontech using HindlH and . 
Xhol. effectively replacing the SV40 promoter with the amplified GAS:SV40 promoter 
element, to create the GAS-SEAP vector. However, this vector does not contain a 
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neomycin resistance gene, and therefore, is not preferred for mammaUan expression 
systems. 

Thus, in order to generate mammalian stable cell lines expressing the GAS- 
SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using 
SaU and NoU, and inserted into a backbone vector containing the neomycin resistance 
gene, such as pGFP-I (Qontech). using these restriction sites in the multiple cloning 
site, to create the GAS-SEAP/Neo vector. Once this vector is ttansfected into 
mammalian cells, this vector can then be used as a reporter molecule for GAS binding 
as described in Examples 13-14. 

Other constracts can be made using the above description and replacing GAS 
with a different promoter sequence. For example, constniction of reporter molecules 
containing NFK-B and EGR promoter sequences arc described in Examples 15 and 16. 
However, many other promoters can be substituted using the protocols described in 
these Examples. For instance, SRE. IL-2, NFAT. or Osteocalcin promoters can be 
15 substituted, alone or in combination (e.g., GAS/NF-KB/EGR, GAS/NF-KB. D- 
2/NFAT, or NF-KB/GAS). Similarly, other cell lines can be used to test reporter 
constnict activity, such as HELA (epitheUal), HUVEC (endothelial), Reh (B-cell). 
Saos-2 (osteoblast), HUVAC (aortic), or Canliomyocyte. 

20 Example 13; High-Throughput Srr eening Assay for T.pi.|l Arfiy ity 

The following protocol is used to assess T-cell activity by identifying factors, 
such as growth factors and cytokines, that may proliferate or differentiate T-cells. T- 
ceU activity is assessed using the GAS/SEAP/Neo constnict produced in Example 12. 
Thus, factors that increase SEAP activity indicate the ability to activate the Jaks-STATS 

25 signal transduction pathway. The T-cell used in this assay is Jurkat T-cells (ATCC 

Accession No. TB-152), although Molt-3 cells (ATCC Accession No. CRL-1552) and 
Molt-4 cells (ATCC Accession No. CRH582) cells can also be used. 

Jurkat T-cells are lymphoblastic CD4+ Th 1 helper ceUs. In order to generate 
stable cell lines, approximately 2 million Jurkat ceUs are transfected with the GAS- 

30 SEAP/neo vector using DMRIE-C (Ufe Technologies)(transfection procedure 
described below). The transfected cells arc seeded to a density of approximately 
20.(XK) cells per well and transfectants resistant to 1 mg/ml genticin selected. Resistant 
colonies are expanded and then tested for theur response to increasing concentrations of 
interferon gamma. The dose response of a selected clone is demonstrated. 

35 Specifically, the following protocol will yield sufficient cells for 75 wells 

containing 200 ul of cells. Thus, it is either scaled up. or perfonned in multiple to 
generate sufficient cells for multiple 96 weU plates. Juricat cells arc maintained in RPMI 
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+ 10% seium with l%Pen-Strep. Combine 2.5 mis of OPTI-MEM (Life Technologies) 
with lOug of pIasniidDNAinaT25 flask. Add 2.5 ml OFH-MEM containing 50 ul 
of DMRIE-C and incubate at room temperature for 15-45 mins. 

During the incubation period, count ceU concentration, spin down the required 
number of cells (10^ per transfection), and resuspend in OPTI-MEM to a fmal 
concentration of 10' ceUs/ml. Then add 1ml of 1 x 10' cells in OPD-MEM to T25 flask 
and incubate at 3TX: for 6 hrs. After the incubation, add lOmlof RPMI+ 15% serum. 

The Jurkat:GAS-SEAP stable reporter Unes are maintained in RPMI + 10% 
serum. 1 mg/ml Genticin. and 1% Pen-Strep. These cells arc treated with supematants 
containing a polypeptide as produced by the protocol described in Example 1 1 . 

On the day of treatment with the supernatant, the cells should be washed and 

resuspendedinfreshRPMI+10%serumtoadensityof500,000cellsperml. The 
exact number of cells required will depend on the number of supematants being 
screened. For one 96 weU plate, approximately 10 million cells (for 10 plates, 100 
15 million cells) are required. 

Transfer the cells to a triangular reservoir boat, in order to dispense the cells into 
a 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 200 ul 
of cells into each well (therefore adding 100, 000 cells per well). 

After all the plates have been seeded, 50 ul of the supematants are transferred 
direcdy from the 96 weU plate containing the supematants mto each well using a 12 
channel pipette. In addiUon, a dose of exogenous interferon gamma (0.1. l.o; 10 ng) 
is added to wells H9. HIO, and HI 1 to serve as additional positive controls for the 
assay. 

The 96 well dishes containing Jurkat ceUs treated with supematants are placed in 
an incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul samples 
from each weU are then transferred to an opaque 96 weU plate using a 12 channel 
pipette. The opaque plates should be covered (using sellophene covers) and stored at - 
200C untU SEAP assays are performed accordmg to Exan^le 17. The plates 
containing the remaining treated cells are placed at 40C and serve as a source of material 
30 for rcpeating the assay on a specific well if desired. 

As a posiUve control. 100 UnitAnl interferon gamma can be used which is 
known to activate Jurkat T cells. Over 30 fold induction is typically observed in the 
positive control wells. 
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Example 14: Hiph-T hrouphniit S creening Assay Identifying Mvel id 
Activity 

The following protocol is used to assess myeloid activity by identifying factors, 
such as growth factors and cytokines, that may proliferate or differentiate myeloid cells. 
5 Myeloid ceU activity is assessed using the GAS/SEAP/Neo construct produced in 

Example 1 2. Thus, factors that increase SEAP activity indicate die ability to activate the 
Jaks-STATS signal transduction pathway. The myeloid ceU used in this assay is U937. 
a pre-monocyte cell line, although TF-I. HL60, or KGl can be used. 

To transienUy transfect U937 cells with the GAS/SEAP/Neo constmct produced 
10 in Example 12, a DEAE-Dextran method (Kharbanda et. al., 1994. CeU Growth & 

Differentiation. 5:259-265) is used. First, harvest 2xl0e7 U937 cells and wash with 
PBS. The U937 cells are usually grown in RPMI 1640 medium containing 10% heat- 
inactivated fetal bovine serum (FBS) supplemented with 100 unitsAnl penicilUn and 100 
mg/ml streptomycin. 

15 Next, suspend the cells in 1 ml of 20 mM Tris-HCl (pH 7.4) buffer containing 

0.5 mg/ml DEAE-Dextran, 8 ug GAS-SEAP2 plasmid DNA. 140 mM NaQ, 5 mM 
KCl, 375 uM Na2HP04.7H20, 1 mM MgCl2, and 675 uM CaCl2. Incubate at 370C 
for 45 min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then 
20 resuspend in 10 ml complete medium and incubate at 370C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing die cells in 400 
ug/ml G418. The G418-free medium is used for routine growth but every one to two 
months, die cells should be re-grown in 400 ug/ml G418 for couple of passages. 

These cells are tested by harvesting 1x10* cells (tiiis is enough for ten 96-well 
25 plates assay) and wash with PBS. Suspend the ceUs in 200 ml above described growtfi 
medium, widi a final density of 5x10* ceUs/ml. Plate 200 ul cells per well in the 96- 
weU plate (or lxl(f cells/well). 

Add 50 ul of the supernatant prepared by the protocol described in Example 11. 
Incubate at 370C for 48 to 72 hr. As a positive control, 100 Unit/ml interferon gamma 
can be used which is known to activate U937 cells. Over 30 fold induction is typicaUy 
observed in die positive control wells. SEAP assay the supernatant according to die 
protocol described in Example 17. 
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Example \5: Hiph-Thr ughnnf Screening A^«*v Identifying Np.t 
Activity. 

When cells undergo differentiation and proliferation, a group of genes are 
activated through niany different signal transduction pathways. One of these genes, 
5 EGRl (early growth response gene 1), is induced in various tissues and cell types upon 
activation. The promoter of EGRl is responsible for such induction. Using the EGRl 
promoter linked to reporter molecules, activation of cells can be assessed. 

Particularly, the following protocol is used to assess neuronal activity in PC12 
ceU lines. PC12 cells (rat phenochromocytoma cells) are known to proliferate and/or 
10 differentiate by activation with a number of mitogens, such as TPA (tetradecanoyi 

phorbol acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). The 
EGRl gene expression is activated during this treatment. Thus, by stably transfecting 
PC12 cells with a construct containing an EGR promoter linked to SEAP reporter, 
activation of PCI 2 cells can be assessed. 

The EGR/SEAP reporter constnict can be assembled by the foUowing protocol. 
The EGR-1 promoter sequence (-633 to +l)(Sakamoto K et al.. Oncogene 6:867-871 
(1991)) can be PGR amplified from human genomic DNA using the following primers: 



15 



20 



35 



5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3* (SEQ ID NO:6) 
5- GCGAAGCTrCGCGACrCCCCGGATCCGCCrC-3' (SEQ ID NO:7) 



Using the GAS:SEAP/Neo vector produced in Exanq>le 12, EGRl amplified 
product can then be inserted into this vector. Linearize the GAS:SEAP/Neo vector 
using restriction enzymes XhoWIindlll, removing the GAS/SV40 stuffer. Restrict the 
25 EGRl amplified product with these same enzymes. Ligate the vector and the EGRl 
promoter. 

To prepare 96 well-plates for ceU culture, two mis of a coating solution (1 :30 
dilution of coUagen type I (Upstate Biotech Inc. Cat#08-1 15) m 30% ethanol (filter 
sterilized)) is added per one 10 cm plate or 50 ml per weU of the 96-weU plate, and 
30 allowed to air dry for 2 hr. 

PC12 ceUs are routinely grown in RPMI-1640 medium (Bio Whittaker) 
containing 10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat- 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 
ug/ml streptomycin on a precoated 10 cm tissue culture dish. One to four split is done 
every three to four days. Cells are removed from the plates by scraping and 
resuspended with pipetting up and down for more than 15 times. 
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Transfect the EGR/SEAP/Neo construct into PC12 using the Lipofectamine 
protocol described in Exan^)le 1 1 . EGR-SEAP/PCl 2 stable cells are obtained by 
growing the cells in 3(X) ug/ml G418. The G418-free medium is used for routine 
growth but every one to two months, the cells should be re-grown in 300 ug/ml G418 
5 for couple of passages. 

To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS 
(Phosphate buffered saline). Then starve the ceUs in low serum medium (RPMI-1640 
containing 1% horse serum and 0.5% FBS with antibiotics) overnight. 
10 The next morning, remove the medium and wash the cells with PBS. Scrape 

off the cells from the plate, suspend the ceUs well in 2 ml low serum medium. Count 

the cell number and add more low serum medium to reach final cell density as 5x10^ 
cells/ml. 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent to 
15 lxl05 cells/well). Add 50 ul supernatant produced by Example 1 1, 37^0 for 48 to 72 
hr. As a positive control, a growth factor known to activate PC12 cells through EGR 
can be used, such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty-fold 
induction of SEAP is typically seen in the positive control wells. SEAP assay the 
supernatant according to Example 17. 

20 

Example 16; High-Thro ughput Screening Assay for T-cell Activity 

NF-kB (Nuclear Factor kB) is a transcription factor activated by a wide variety 
of agents including the inflanmiatory cytokines IL-1 and TNF, CD30 and CD40, 
lymphotoxin-alpha and lymphotoxin-beta, by exposure to LPS or thrombin, and by 

25 expression of certain vhal gene products. As a transcription factor, NF-kB regulates 
the expression of genes involved in immune cell activation, control of apoptosis (NF- 
kB spears to shield ceUs from apoptosis), B and T-cell development, anti-viral and 
antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- kB is retained in the cytoplasm with I-kB 

30 (Inhibitor kB). However, upon stimulation, I- kB is phosphorylaled and degraded, 

causing NF- Kfi to shuttle to the nucleus, thereby activating transcription of target 

genes. Target genes activated by NF- kB include IL-2, IL-6, GM-CSF, ICAM-1 and 
class 1 MHC. 
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Due to its central role and ability to respond to a range of stimuli, reporter 
constructs utilizing the NF-kB promoter element are used to screen the supematants 
produced in Example 1 1 . Activators or inhibitors of NF-kB would be useful in treating 
diseases. For example, inhibitors of NF-kB could be used to treat those diseases 
related to the acute or chronic activation of NF-kB, such as rheumatoid arthritis. 

To constmct a vector containing the NF-kB promoter element, a PCR based 
strategy is employed. The upstream primer contains four tandem copies of the NF-kB 
binding site (GGGGACITTCCC) (SEQ ID NO:8), 18 bp of sequence complementary 
to the 5' end of the SV40 early promoter sequence, and is flanked with an Xhol site: 
5':GCGGCCTCGAGGGGACriTCCCGGGGACTT^ 
TITCCATCCrGCCATCrCAATTAG:3* (SEQIDNO:9) 

The downstream primer is complementary to the 3' end of the SV40 promoter 
and is flanked with a Hind m site: 

5^GCGGCAAGCTriTrGCAAAGCCrAGGC:3' (SEQ ID NO:4) 
1 5 PCR amplification is performed using the S V40 promoter template present in 

the pB-gal:promoter plasmid obtained from Clontech. The resulting PCR fragment is 
digested with Xhol and Hind ffl and subcloned into BLSK2-. (Stratagene) 
Sequencing with the T7 and T3 primers confirms the insert contains the following 
sequence; 

20 S^CTCGAGGGGACTTTCCCGGGGACrrrCCGGGGACrrrCCGG^^ 

ATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACrCCGCCCA 
TCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCrCCGCCCCATGGCrOACr 
AAiiiiiiiiATTrATGCAGAGGCCGAGGCCGCCTCGGCCrCTGAGCT 
CAGAAGTAGTGAGGAGGCrTTrrrGGAGGCCTAGG(m^ 

25 3* (SEQ ID NO: 10) 

Next, replace the S V40 minimal promoter element present in the pSEAP2- 

promoier plasmid (Clontech) with this NF-KB/SV40 Augment using Xhol and Hindffl. 

However, this vector does not contain a neomycin resistance gene, and therefore, is not 
30 preferred for mammalian expression systems. 

In order to generate stable mammalian ceU lines, the NF-KB/SV40/SEAP 
cassette is removed from the above NF-kB/SEAP vector using restriction enzymes Sail 
and NotI, and inserted into a vector containing neomycin resistance. Particularly, the 
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NF-Kfi/SV40/SEAP cassette was inserted into pGFP-1 (Clontech). replacing the GFP 
gene, after restricting pGFP-1 with Sail and Nod. 

Once NF-kB/S V40/SEAP/Neo vector is created, stable Jurkat T-cells arc 
created and maintained according to the protocol described in Example 13. SimUariy. 
the mediod for assaying supematants with these stable Juricat T^lls is also described 
in Example 13. As a posiUve control, exogenous TNF alpha (0.1.1, 10 ng) is added to 
wells H9. HIO. and HI 1, with a 5-10 fold activation typically observed. 

Example 17: A.ssav fn r SKAP Arf p Yl fY 

As a reporter molecule for the assays described in Examples 13-16. SEAP 
activity is assayed using the Tropix Phospho-light Kit (Cat. BP^) accoiding to the 
following general procedure. The Tropix Phospho-light Kit supplies the DUution. 
Assay, and Reaction Buffers used below. 

Prime a dispenser with the 2.5x DUution Buffer and dispense 15 fil of 2.5x 
dilution buffer into Optiplates containing 35 m of a supernatant. Seal the plates with a 

plastic sealer and incubate at 65oC for 30 min. Separate the Optiplates to avoid uneven 
heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser and 
prime with the Assay Buffer. Add 50 fil Assay Buffer and incubate at room 
20 temperature 5 min. Empty the dispenser and prime with the Reaction Buffer (see the 
table below). Add 50 ul Reaction Buffer and incubate at room temperature for 20 
minutes. Since the intensity of the chemiluminescent signal is time dependent, and it 
takes about 10 minutes to read 5 plates on luminometer. one should treat 5 plates at each 
time and start the second set 10 minutes later. 

Read the relative light unit in the luminometer. Set H12 as blank, and print the 
results. An increase in chemiluminescence indicates reporter activity. 

Reaction Buffer Formulation: 

# of plates Rxn buffer diluent (ml) CSPD (ml) 

3 



25 



10 60 

H 65 

12 70 

13 75 

14 80 



3.25 
3.5 
3.75 
4 
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15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 

42 

43 

44 

45 

46 

47 

48 

49 

50 



85 

90 

95 

100 

105 

no 

115 

120 

125 

130 

135 

140 

145 

150 

155 

160 

165 

170 

175 

180 

185 

190 

195 

200 

205 

210 

215 

220 

225 

230 

235 

240 

245 

250 

255 

260 



4.25 
4.5 
4.75 
5 

5.25 
5.5 
5.75 
6 

6.25 
6.5 
6.75 
7 

7.25 
7.5 
7.75 
8 

8.25 
8.5 
8.75 
9 

9.25 
9,5 
9.75 
10 

10.25 
10.5 
10.75 
11 

11.25 
11.5 
11.75 
12 

12.25 
12.5 
12.75 
13 
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Example 18: Hieh-Throuyhput Scregnin p Assay Identifying Thang Ps in 
Small Molecule Concentrati on and Membrane PermeflhiiffY 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
5 moleciUes. such as calcium, potassium, sodium, and pH, as well as alter membrane 
potential. These alterations can be measured in an assay to identify supematants which 
bind to receptors of a particular ceU. Although the following protocol describes an 
assay for calcium, this protocol can easily be modified to detect changes in potassium, 
sodium, pH, membrane potential, or any other smaU molecule which is detectable by a 
10 fluorescent probe. 

The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") to 
measure changes in fluorescent molecules (Molecular Probes) that bind smaU 
molecules. Qearly, any fluorescent molecule detecting a small molecule can be used 
instead of the calcium fluorescent molecule, fluo-3, used here. 
15 For adherent ceUs, seed the cells at 10,000 -20.000 cells/well in a Co-star black 

96-well plate with clear bottom. The plate is incubated in a COj incubator for 20 hours. 
The adherent cells are washed two times in Biotek washer with 200 ul of HBSS 
(Hank's Balanced Salt Solution) leaving 100 ul of buffer after the final wash. 

A stock solution of 1 mg^ll fluo-3 is made in 10% pluronic acid DMSO. To 
20 load the cells with fluo-3, 50 ul of 12 ug/ml fluo-3 is added to each well. The plate is 
mcubated at 37°C in a COj incubator for 60 min. The plate is washed four times in the 
Biotek washer with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells are 
re-suspended to 2-5x10* cells/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml 
25 fluo-3 solution in 10% pluronic acid DMSO is added to each ml of cell suspension. 

The tube is then placed in a 3T'C water bath for 30-60 min. The cells arc washed twice 
with HBSS. rcsuspended to 1x10* ceUs/ml, and dispensed into a microplate. 100 
ul/well. The plate is centrifiiged at 1000 ipm for 5 min. The plate is then washed once 
in Denley CellWash with 200 ul, followed by an aspiration step to 100 ul final volume. 
30 For a non-cell based assay, each well contains a fluorescent molecule, such as 

fluo-3. The supernatant is added to the well, and a change in fluorescence is detected. 

To measure the fluorescence of intracellular calcium, the FLIPR is set for the 
following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 
second; (3) Camera F/stop is ¥12; (4) Excitation is 488 nm; (5) Emission is 530 ran; and 
35 (6) Sample addition is 50 ul. Increased emission at 530 nm indicates an extracellular 
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signaling even which has resulted in an increase in the intracellular Ca"*^ 
concentration. 

Example 19: High-Throughput Screening Assay Identifying Tyrosine 

5 Kinase Actiyity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of 
transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine Kinase 
RPTK) group arc receptors for a range of mitogenic and metabolic growth factors 
including the PDGF, FGF, EOF, NGF, HGF and hisulin receptor subfamilies. In 

10 addition there are a large family of RPTKs for which the corresponding hgand is 
unknown. Ligands for RPTKs include mainly secreted small proteins, but also 
membrane-bound and extracellular matrix proteins. 

Activation of RPTK by lig:ands involves ligand-mediated receptor dimerization, 
resulting in transphosphorylation of the receptor subunits and activation of the 

15 cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases include receptor 
associated tyrosine kinases of the src-family (e.g., src, yes, Ick, lyn, fyn) and non- 
receptor linked and cytosolic protein tyrosine kinases, such as the Jak family, members 
of which mediate signal transduction triggered by the cytokine superfamily of receptors 
(e.g., the Interleukins, Interferons, GM-CSF, and Leptin). 

20 Because of the wide range of known factors capable of stimulating tyrosine 

kinase activity, the identification of novel human secreted proteins capable of activating 
tyrosine kinase signal transduction pathways are of interest. Therefore, the following 
protocol is designed to identify those novel himian secreted proteins capable of 
activating the tyrosine kinase signal transduction pathways. 

25 Seed target cells (e.g., primary keratinocytes) at a density of approximately 

25,0(K) cells per well in a 96 well Loprodyne Silent Screen Plates purchased from 
Nalge Nunc (N^rville, IL). The plates are sterilized with two 30 minute rinses with 
100% ethanol, rinsed with water and dried overnight Some plates are coated for 2 hr 
with 100 ml of cell culture grade type I collagen (50 mg/ml), gelatin (2%) or polylysine 

30 (50 mg/ml), all of which can be purchased from Sigma Chemicals (St. Louis, MO) or 
10% Matrigel purchased from Becton Dickinson (Bedford,MA), or calf serum, rinsed 
wiUi PBS and stored at 4^C. Cell growth on these plates is assayed by seeding 5,000 
cells/well in growth medium and indirect quantitation of cell number through use of 
alamarBlue as described by the manufacuirer Alamar Biosciences, Inc. (Sacramento, 

35 CA) after 48 hr. Falcon plate covers #307 1 from Becton Dickinson (Bedford^IA) are 
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used to cover the Loprodyne Silent Screen Plates. Falcon Microtest m ceU culture 
plates can also be used in some proliferation experiments. 

To prepare extracts, A431 cells are seeded onto the nylon membranes of 
Loprodyne plates (20,OOQ/200ml/weU) and cultured overnight in complete medium. 
5 Cells are quiesced by incubation in serum-free basal medium for 24 hr. After 5-20 
minutes treatment with EGF (60ng/ml) or 50 ul of the supernatant produced in Example 
1 1, the medium was removed and 100 ml of extraction buffer ((20 mM HEPES pH 
7.5, 0.15 M NaCl, 1% Triton X-100, 0.1% SDS, 2 mM Na3V04, 2 mM Na4P207 
and a cocktail of protease inhibitors (# 1836170) obtained ftom Boeheringer Mannheim 
10 (Indianapolis, IN) is added to each well and the plate is shaken on a rotating shaker for 
5 minutes at 4^0. The plate is then placed in a vacuum transfer manifold and the extract 
filtered through the 0.45 mm membrane bottoms of each well using house vacuum. 
Extracts are collected in a 96-well catch/assay plate in the bottom of the vacuum 
manifold and hnmediately placed on ice. To obtain exttacts clarified by centrifiigation. 
1 5 the content of each well, after detergent solubilization for 5 minutes, is removed and 
centrifiiged for 1 5 minutes at A^HZ at 1 6,000 x g. 

Test the filtered extracts for levels of tyrosine kinase activity. Although many 
methods of detecting tyrosine kinase activity are known, one method is described here. 
Generally, the tyrosine kinase activity of a supernatant is evaluated by 
20 determining its ability to phosphorylate a tyrosine residue on a specific substrate (a 
biotinylated peptide), Biotinylated peptides that can be used for this purpose include 
PSKl (corresponding to amino acids 6-20 of the cell division kinase cdc2-p34) and 
PSK2 (corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for 
a range of tyrosine kinases and are available bom Boehringer Mannheim, 
25 The tyrosine kinase reaction is set up by adding the following components in 

order. First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg2+ (5mM 
ATP/50mM MgCl2), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, 
pH7.3, 40 mM beta-glycerophosphate, ImM EGTA, lOOmM MgCl2, 5 mM Mna2, 
0.5 mg/ml BS A), then 5ul of Sodium Vanadate(lmM), and then 5ul of water. Mix the 
30 components genUy and preincubate the reaction mix at 30^C for 2 min. Initial the 
reaction by adding lOul of the control enzyme or the filtered supernatant. 

The tyrosine kinase assay reaction is then terminated by adding 10 ul of 120mm 
EDTA and place the reactions on ice. 

T>rrosine kinase activity is determined by transferring 50 ul aliquot of reaction 
35 mixture to a microtiter plate (MTP) module and incubating at 31^C for 20 min. This 
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aUows the streptavadin coated 96 well plate to associate with the biotinylated peptide. 
Wash the NflP module with 300ul/well of PBS four times. Next add 75 ul of anU- 
phospotyrosine antibody conjugated to horse radish peroxidase(anti-P-Tyr- 

POD(0.5u/nil)) to each well and incubate at 370C for one hour. Wash the weU as 
5 above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) and 
incubate at room temperature for at least 5 mins (up to 30 min). Measure the 
absorbance of the sample at 405 nm by using ELBA reader. The level of bound 
peroxidase activity is quantitated using an EUSA reader and reflects the level of 
10 tyrosine kinase activity. 



20 



Example 20: High-Thrniigh put Screening A-wav Mp^ Hfyinf 
Phosphorylation Activity 

As a potential alternative and/or con^liment to the assay of protein tyrosine 
15 kinase activity described in Example 19, an assay which detects activation 

(phosphorylation) of major intraceUular signal transduction intermediates can also be 
used. For example, as described below one particular assay can detect tyrosme 
phosphorylation of the Erk-1 and Erk-2 kinases. However, phosphorylation of other 
molecules, such as Raf, JNK. p38 MAP. Map kinase kinase (MEK), MEK kinase, 
Src, Muscle specific kinase (MuSK), IRAK, tec, and Janus, as well as any other 
phosphoserine, phosphotyrosine, or phosphothreonine molecule, can be detected by 
substituting these molecules for Erk- 1 or Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96-well ELISA 
plate with O.lml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are then 
rinsed with PBS and blocked with 3% BSA/PBS for 1 hr at RT. The protein G plates 
are then treated with 2 conunercial monoclonal antibodies (lOOng/weU) against Erk-1 
and Erk-2 (1 hr at RT) (Santa Cruz Biotechnology). (To detect other molecules, this 
step can easily be modified by substituting a monoclonal antibody detecting any of the 

above described molecules.) After 3-5 rinses witii PBS, the plates are stored at 40C 
30 until use. 

A43 1 cells are seeded at 20,000/well in a 96-well Loprodyne filterplate and 
cultured overnight in growth medium. The cells ate then starved for 48 hr in basal 
medium (DMEM) and then treated with EGF (6ngAvell) or 50 ul of tiie supematants 
obtained in Example 1 1 for 5-20 minutes. The cells are then solubiUzed and extracts 
35 filtered directiy into the assay plate. 
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After incubation with the extract for 1 hr at RT. the wells are again rinsed. As a 
positive control, a commercial preparation of MAP kinase (lOngAveU) is used in place 
of A431 extract Plates are then treated with a commercial polyclonal (rabbit) antibody 
(lugMil) which specifically recognizes the phosphorylated epitope of the Erk-1 and 
Eik-2 kinases (1 hr at RT). This antibody is biotinylated by standard procedures. The 
bound polyclonal antibody is tiien quantitated by successive incubations with 
Europiura-streptavidin and Europium fluorescence enhancing reagent in the Wallac 
DELHA instrument (time-resolved fluorescence). An increased fluorescent signal over 
background indicates a phosphorylation. 



Pyample 21; Method of Di^tPr miiiing Alte^«on.s in a CU^n^ 
Correspondin g to a PftlynucleotidP 

RNA isolated from entire families or individual patients presenting wiUi a 
phenotype of interest (such as a disease) is be isolated. cDNA is tiien generated from 
these RNA samples using protocols known in die art. (See, Sambrook.) The cDNA is 
tiien used as a template for PCR, employing primers sunounding regions of interest in 
SEQ ID NO:X. Suggested PCR conditions consist of 35 cycles at 95''C for 30 
seconds; 60-120 seconds at 52-58''C; and 60-120 seconds at 70°C, using buffer 
solutions described in Sidransky, D., et al.. Science 252:706 (1991). 

PCR products is tiien sequenced using primers labeled at tiieir 5* end witii T4 
polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre Technologies). 
The intron-exon borders of selected exons is also determined and genomic PCR 
products analyzed to confirm tiie results. PCR products harboring suspected mutations 
is tiien cloned and sequenced to vaUdate die results of die direct sequencing. 

PCR products is cloned into T-taUed vectors as described in Holton, T.A. and 
Graham. M.W.. Nucleic Acids Research, 19:1156 (1991) and sequenced witii T7 
polymerase (United States Biochemical). Affected individuals is identified by 
mutations not present in unaffected individuals. 

Genomic rearrangements are also observed as a mediod of determining 
30 alterations in a gene corresponding to a polynucleotide. Genomic clones isolated 
according to Example 2 are nick-ti^lated with digoxigenindeoxy-uridine 5'- 
triphosphate (Boehringer Manheim), and HSH performed as described in Johnson, 
Cg. et al.. Metiiods Cell Biol. 35:73-99 (1991). Hybridization witii die labeled probe is 
carried out using a vast excess of human cot-1 DNA for specific hybridization to tiie 
35 corresponding genomic locus. 
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Chromosomes are counterstained with 4,6KUammo-2-phenylidole and 
propidium iodide, producing a combination of C- and R-bands. Aligned images for 
precise mapping are obtained using a triple-band filter set (Chroma Technology, 
Brattleboro. VT) in combination with a cooled charge-coupled device camera 
(Photometries, Tucson, AZ) and variable excitation wavelengtii filters. (Johnson, Cv. 
et al.. Genet. Anal. Tech. Appl., 8:75 (1991).) Image collection, analysis and 
chromosomal ftactional length measurements are performed using the ISee Graphical 
Program System. (Inovision Corporation, Durham, NC.) Chromosome alterations of 
the genomic region hybridized by the probe arc identified as insertions, deletions, and 
translocations. These alterations are used as a diagnostic marker for an associated 
disease. 



Example 22; Mrthod of Deterfing Abnormal I^v^ls of a Polvp «.p»M> i„ » 
Biological .Samp le 

15 A polypeptide of the present invention can be detected in abiological sample, 

and if an increased or decreased level of the polypeptide is detected, this polypeptide is 
a marker for a particular phenotype. Methods of detection are numerous, and thus, it is 

understood that one skilled m the art can modify the foUowing assay to fit their 
particular needs. 

20 For example, antibody-sandwich ELISAs are used to detect soluble 

polypeptides in a sample, preferably a biological sample. WeUs of a microtiter plate are 
coated with specific antibodies, at a final concentration of 0.2 to 10 ug/ml. The 
antibodies are either monoclonal or polyclonal and are produced by die method 
described in Example 10. The wells are blocked so that non-specific binding of the 

25 polypeptide to the well is reduced. 

The coated wells are then incubated for > 2 hours at RT widi a sample 
containing the polypeptide. Preferably, serial dUutions of die sample should be used to 
validate results. The plates are tiien washed three times widi deionized or distilled water 
to remove unbounded polypeptide. 

30 Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a 

concentration of 25^ ng, is added and incubated for 2 hours at room temperature. 
The plates are again washed three times wiUi deionized or distilled water to remove 
unbounded conjugate. 

Add 75 ul of 4-metfiylumbelliferyl phosphate (MUP) or p-nitrophenyl 

35 phosphate (NPP) substrate solution to each well and incubate 1 hour at room 

temperature. Measure the reaction by a microtiter plate reader. Prepare a standard 
curve, using serial dilutions of a conti-ol sample, and plot polypeptide concentration on 
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the X-axis (log scale) and fluorescence or absorbance of the Y-axis (linear scale). 
Interpolate the concentration of the polypeptide in the sample using the standard curve. 

Exanaple 23; Formulating a Polvpgptidi> 

5 The secreted polypeptide conqxKition will be formulated and dosed in a fashion 

consistent with good medical practice, taking into account the clinical condition of the 
individual patient (espedaUy the side effects of treatment with die secreted polypeptide 
alone), the site of delivery, the method of administration, the scheduling of 
administration, and other factors known to practitioners. The "effective amount" for 
10 purposes herein is thus determined by such considerations. 

As a general proposition, the total phannaceuticaUy effective amount of secreted 
polypeptide administered parenterally per dose will be in die range of about 1 Mg/kg/day 
to 10 mg/kg/day of patient body weight, alUiough, as noted above, Uiis will be subject 
to therapeutic discretion. More preferably, diis dose is at least 0.01 mg/kg/day. and 
15 most preferably for humans between about 0.01 and 1 mg/kg/day for die hormone. If 
given continuously, die secreted polypeptide is typicaUy administered at a dose rale of 
about 1 ^g/kg/hour to about 50 |ig/kg/hour, eidier by 1-4 injections per day or by 
continuous subcutaneous infusions, for example, using a mini-pump. An intravenous 
bag solution may also be employed. The lengtii of treatment needed to observe changes 
20 and die interval following treatment for responses to occur appears to vary depending 
on the desired effect 

Pharmaceutical conditions containing die secreted protein of die invention are 
administered orally, rectally. parentCTally, intracistemally. intravaginaUy, 
intraperitoneally. topically (as by powders, ointments, gels, drops or transdermal 
25 patch). bucaUy, or as an oral or nasal spray. "PharmaceuticaUy acceptable carrier" refers 
to a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material or 
formulation auxiliary of any type. The term "parenteral" as used herein refers to modes 
of administration which include intravenous, intramuscular, ind^ritoneal, intrastemal, 
subcutaneous and intraarticular injection and infusion. 
30 The secreted polypeptide is also suitably adnainistered by sustained-release 

systems. Suitable examples of sustained-release compositions include semi-permeable 
polymer matrices in Uie form of shaped articles, e.g., fihns. or mirocapsules. 
Sustained-release matrices include polylactides (U.S. Pat. No. 3,773,919, EP 58.481), 
copolymers of L-glutamic acid and gamma-ediyl-L-glutamate (Sidman. U. et al., 
35 Biopolymers 22:547-556 (1983)). poly (2- hydroxyediyl mediacrylate) (R. Langer et 
al.. J. Biomed. Mater. Res. 15:167-277 (1981), and R. Langer, Chem. Tech. 12:98- 
105 (1982)), ediylene vinyl acetate (R. Langer et al.) or poIy-D- (-)-3-hydroxybutyric 
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acid (EP 133,988). Sustained-release composiUons also include liposomaUy entrapped 
polypeptides. Uposomes containing the secreted polypeptide are prepared by methods 
known per se: DE 3,218.121; Epstein et al.. Proc. NaU. Acad. Sci. USA 82:3688-3692 
(1985); Hwang et al., Proc. NaU. Acad. Sci. USA 77:4030^34 (1980)- EP 52 322- 
EP 36.676; EP 88.046; EP 143.949; EP 142.641; Japanese Pat. Appl. 83-1 18008- ' 
U.S. Pat Nos. 4.485.045 and 4.544.545; and EP 102.324. Ordinarily, the liposomes 
are of the smaU (about 200-800 Angstroms) unilameUar type in which the lipid content 
IS greater than about 30 mol. percent cholesterol, the selected proportion being adjusted 
for the optimal secreted polypeptide therapy. 

For parenteral administration, in one embodiment, the secreted polypeptide is 
formulated generally by mixing it at the desired degree of purity, in a unit dosage 
mjectable form (solution, suspension, or emulsion), with a pharmaceutically acceptable 
earner. i.e.. one that is non-toxic to recipients at the dosages and concentrations 
employedandiscompatiblewithotheringredientsofthefonnulation. Forexample the 
formulauon preferably does not include oxidizing agents and other compounds that are 
known to be deleterious to polypeptides. 

Generally, the formulations are prepared by contacting the polypeptide 
uniformly and intimately with liquid earners or finely divided solid carriers or both. 
Then, if necessary, the product is shaped into the desired formulation. Preferably the 
carrier is a parenteral carrier, more preferably a solution that is isotonic with the blood 
of the lecipienL Examples of such carrier vehicles include water, saline. Ringer's 
solution, and dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl 
oleate are also useful herein, as well as Uposomes. 

The carrier suitably contains minor amounts of additives such as substances that 
enhance isotonicity and chemical stabiUty. Such materials are non-toxic to recipients at 
the dosages and concentrations employed, and include buffers such as phosphate, 
citiate. succinate, acetic acid, and other organic acids or their salts; antioxidants silch as 
ascorbic acid; low molecular weight (less than about ten residues) polypeptides, e.g., 
polyarginine or ttipeptides; proteins, such as serum albumin, gelatin, or 
immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids, 
such as glycine, glutamic acid, aspartic acid, or arginine; monosaccharides, 
disaccharides, and other carbohydrates including ceUulose or its derivative!, glucose, 
manose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mamiitoi or 
sorijitol; counterions such as sodium; and/or nonionic surfactants such as polysorbates, 
35 poloxamers, or PEG. 

The secreted polypeptide is typically formulated in such vehicles at a 
concentration of about 0.1 mg/mlto 100 mg/ml. preferably 1-lOmg/ml.atapHof 
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about 3 to 8. It will be understood that the use of certain of the foregoing excipients, 
carriers, or stabilizers wUl result in the formation of polypeptide salts. 

Any polypeptide to be used for therapemic administration can be sterile. 
Sterility is readily accomplished by filtration through sterile filtration membranes (e.g., 
0.2 micron membranes). Therapeutic polypeptide compositions generally are placed 
into a container having a sterile access port, for example, an intravenous solution bag or 
vial having a stopper picrceable by a hypodennic injection needle. 

Polypeptides ordinarily wiU be stored in unit or multi-dose containers, for 
example, sealed ampoules or vials, as an aqueous solution or as a lyophilized 
formulation for reconstitution. As an example of a lyophilized formulation, 10-ml vials 
are filled with 5 ml of sterile-filtered 1% (w/v) aqueous polypeptide solution, and the 
resulting mixture is lyophilized. The infusion solution is prepared by reconstituting the 
lyophilized polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one or 
15 more containers filled with one or more of the ingredients of the pharmaceutical 

compositions of the invention. Associated with such containei<s) can be a notice in the 
form prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, which notice reflects approval by die agency of 
manufacture, use or sale for human administration. In addition, the polypeptides of tiie 
present invention may be employed in conjunction witii otiier tiierapeutic compounds. 

Example 24; Mpthod of Treating n e crea.sed l^^^ls of th^ Pnlvp >pfiH, 
It will be appreciated that conditions caused by a decrease in the standard or 
normal expression level of a secreted protein in an individual can be treated by 
administering the polypeptide of die present invention, preferably in the secreted form. 
Thus, die invention also provides a mediod of treatment of an individual in need of an 
increased level of the polypeptide comprising administering to such an mdividual a 
pharmaceutical composition comprising an amount of the polypeptide to increase the 
activity level of the polypeptide in such an individual. 

For example, a patient witii decreased levels of a polypeptide receives a daily 
dose 0. 1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the 
polypeptide is in die secreted form. The exact details of die dosing scheme, based on 
administration and formulation, arc provided in Example 23. 
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Example 25: Method of Treating fnrr e ased I^vpIs of the Pftlvp «>p^M^ 

Antisense technology is used to inhibit production of a polypeptide of the 
present invention. This technology is one example of a method of decreasing levels of 
a polypeptide, preferably a secreted fonn, due to a variety of etiologies, such as cancer. 

For example, a patient diagnosed with abnonnally increased levels of a 
polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 
2.0 and 3.0 mg/kg day for 21 days. This treatment is repeated after a 7-day rest periwl 
if the treatment was weU tolerated. The formulation of the antisense polynucleotide is 
provided in Example 23. 



35 



Example 26: Method of Tr^s^trt^ ent Using f^ne Therapy 

One meUiod of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. GeneraUy, fibroblasts are obtained from a 
subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 
separated into small pieces. Small chunks of die tissue are placed on a wet surface of a 
tissue culture fiask, approximately ten pieces are placed in each flask. The flask is 
turned upside down, closed tight and left at room temperature over night. After 24 
hours at room temperature, the flask is inverted and the chunks of tissue remain fixed to 
the bottom of die flask and fresh media (e.g.. Ham's F12 media, with 10% PBS, 
20 penicillin and streptomycin, is added. The flasks are tiien incubated at 3TC for 
approximately one week. 

At Uiis time, fiiesh media is added and subsequentiy changed every several days. 
After an additional two weeks in cultare, a monolayer of fibroblasts emerge. The 
monolayer is tiypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier, P.T. et al., DNA, 7:219-25 (1988)), flanked by tiie long 
terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and 
Hindin and subsequentiy treated witii calf intestinal phosphatase. The linear vector is 
fractionated on agarose gel and purified, using glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified 
using PGR primers which correspond to die 5' and 3' end sequences respectively as set 
forUi in Example 1. Preferably, die 5' primer contains an EcoRI site and die 3' primer 
includes a Hindm site. Equal quantities of die Moloney murine sarcoma vims linear 
backbone and Uie amplified EcoRI and Hindffl fragment are added togedier, in die 
presence of T4 DNA ligase. The resulting mixtare is maintained under conditions 
appropriate for ligation of die two fragments. The Ugation mixture is ttien used to 
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tiansfonn bacteria HB 101 . which are then plated onto agar containing kanamycin for 
the purpose of confmning that the vector has the gene of interest properiy inserted. 

The amphotropic pA317 or GP+amI2 packaging cells are gix>wn in tissue 
culture to confluent density in Dulbecco's Modified Eagles Medium (DMEM) widi 10% 
calf serum (CS). penicillin and streptomycin. The MSV vector containing the gene is 
then added to the media and the packaging ceUs transduced with the vector. The 
packaging cells now produce infectious viral particles containing the gene (the 
packaging cells are now referred to as producer cells). 

Fresh media is added to the transduced producer ceUs. and subsequenUy. the 
mediaisharvestedftoma lOcmplateofconfluentproducercells. He spent mJdia, 
containing the infectious viral particles, is filtered through a milliporc filter to remove 
detached producer cells and this media is then used to infect fibrx)blast ceUs. Media is 
removed fiom a su^confluent plate of fibroblasts and quickly replaced widi the media 
from the producer cells. This media is removed and replaced with fresh media If the 
Uter of virus is high, then virtuaUy aU fibroblasts will be infected and no selection is 
required. If the titer is very low, tbtn it is necessary to use a retroviral vector that has a 
selectable marker, such as neo or his. Once the fibroblasts have been efficiently 
infected, the fibroblasts are analyzed to determine whether protein is being produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or 
after having been grown to influence on cytodex 3 microcarrier beads. 

It will be clear that the invention may be practiced otherwise than as particularly 
described in die foregoing description and examples. Numerous modifications and 
variations of the present invention are possible in light of die above teachings and, 
tiierefore. are within die scope of die appended claims. 

The entire disclosure of each document cited (including patents, patent 
applications, journal articles, abstracts, laboratory manuals, books, or otiier 
disclosmes) in tire Background of die Invention. Detailed Description, and Examples is 
hereby incorporated herein by reference. 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: Human Genome Sciences, Inc. et al. 

(ii) TITLE OF INVENTION: 186 Human Secreted Proteins 

(iii) NUMBER OF SEQUENCES: 644 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Human Genome Sciences, Inc. 

(B) STREET: 9410 Key West Avenue 

(C) CITY: Rockville 

(D) STATE: Maryland 

(E) COUNTRY: USA 

(F) ZIP: 20850 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1.4Mb storage 

(B) COMPUTER: HP Vectra 486/33 

(C) OPERATING SYSTEM: MSDOS version 6.2 

(D) SOFTWARE: ASCII Text 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: March 6, 1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
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(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: A. Anders Brookes, Esq. 

(B) REGISTRATION NUMBER: 36,373 

(C) REFERENCE/DOCKET NUMBER: PS002 . PCT 

(vi) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 309-8504 

(B) TELEFAX: (301) 309-8439 

(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQJCTH: 733 base padrs 
2^ (B) TYPE: nucleic acid 

(C) STRANEEnHESS: doiible 

(D) TOPOLOGY: linear 



(Xi) SEQOENCE DESCRIPTION: SEQ ID NO: 1: 
GGGATCOGGA GCCCAAATCT TCTGACAAAA CTCACACATG CCCACCGIGC CCAGCACCIG 60 
AATTOSACSGG TGCACCC3TCA CyrCTTCCrCT TCCCCCCAAA ACCCAAGGAC ACCCTCATGA 120 
35 TCTCCCGGAC TCCTGAGGTC ACATGCGTGG TGGTC3GACGT AAGCCAOGAA GACCCTGAGG 



180 



TCAAGTTCAA CTGGTACGTG GACGGCGTrOG AGC3TGCATAA TGCCAAGACA AAGCCGOGGG 240 

AGGAGCAGTA CAACAGCACG TACOGTGTGG TCAGCGTCCT CACCGOCCTC CACCAGGACT 300 

GGCTGAATGG CAAGGACTAC AAGTC5CAAGG TCTCCAACAA AGCCCTCCCA ACCCCCA3TO 360 

AGAAAACCAT CICCAAAGCC AAAGGGCAGC CCCGRGAACC ACAGGTOTAC ACCCIGCCCC 420 
45 CATCCCGGGA TGAGCTGACC AAGAACCAGG TCAGCCTGAC CTCCCTOGTC AAAGGCTTCT 



480 



ATCCAAGCGA CAT0C5CCGTG GAGTQGGAGA GCAATGGGCA GCCGGAGAAC AACTACAAGA 540 

CCACGCCTCC CGTGCTGGAC TCCGACQC3CT CCTTCTICCT CTACAGCAAG CTCACCGIGG 600 

ACAAGAGCAG GTQGCAGCAG GOGAACC?rCT TCTCATGCTC CGTCATCCAT GAGGCTCTCC 660 

ACAACCACTA CACOCAGAAG AGCCTCTCCC TCHCTCOQGG TAAATCAGTC CGACCSGCCGC 720 
55 - GACTCTAGAG GAT 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SBCK7ENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) T5fPE: amino acid 
(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Trp Ser Xaa Trp Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENtSTH: 86 base pairs 

(B) TyPE: nucleic acid 

(C) STRANEEENESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE EESCRIPTim: SEQ ID NO: 3: 
GCX3CCT0QAG ATITCCCCGA AATCTAGATT TCCCCXSAAAT GATITCCCCG AAATCATnc 60 
CCCGAAATAT CTGOCATCTC AATTAG 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SBQUajCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: micleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SBQUQCE DESCRIPTION: SEQ ID NO: 4: 
GOQGCAAGCT TTTTGCAAAG CCTAGGC 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 base pairs 

(B) TYPE: nucleic acid 
*C) SIKANEEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

^ CTCGAGAnr CCCOGAAATC TAGATITCCC OQAAATGATT TCCCCGAAAT GATncOXXS 60 
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AAAT^PCTOG CAOCTCAATT AGTCAGCAAC CATACmro; CCCCTAACTC CGCCCMCCC 
GCCCCEAACT COGOOCfiCTT COGCXXrAnc TCOGCOXAT GCXTTCACTAA ITITmTAT 
'^ATCSCAGAG G0CGAGGCCX3 OrraSGCCTC TGAGCTA^ CAGAACTACn' GAGGAGGCPr 
TTTTOQAGGC CTMGCITTT CCAAAAAGCT T 

(2) INFORMATION FCR SEQ ID NO: 6: 

(i) SEOTaKE CHARACTERISTICS: 

(A) LOKSTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID IK): 6: 
GOGCTCGAGG GATGACAGCG ATAGAACCCC G6 



(2) INFORMATION FOR SBQ ID NO: 7; 

(i) SEC3(UQICE CHARACTERISTICS: 

<A) LQKSTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J: SBQ ID NO: 7: 
GOGAAGCTTC GCGACTCCCC GGA3TO3CCT C 



(2). INFORMATICN FOR SEQ ID NO: 8 

<i) SBQUQJCE CHARACTERISTICS 



(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STOANDEIMESS: double 

(D) TOPGLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 8: 
GGGGACTTTC CC 



(2) INFORMATIC»J FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 



120 
180 
240 
271 
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(A) LQKTTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEENESS : double 
^ (D) TOPOI«Y: linear 

Cxi) SEQUENCE DESCRIPTION: SBQ ID NO: 9: 
GOQGCCTCGA CSGGGACTITC CCGGGGACTT TCOQGGGACT TTCCGGGACT ITCCATCCTC 60 
10 CCATCTCAAT TAG 

73 



15 (2) INFORMATION FCH SEQ ID NO; 10: 
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(i) SBEKJENCE CHARACTERISTICS: 

(A) LENCTH: 256 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 
<D) TOPdjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 10: 
CTOGAGQGGA OTTTCCCGGG GACITPOCGG C3GACITrcCG GGACITOXIA 'TCTOCCAICT 60 
CAAOTAaKIA GCAACCATAG TCCCGCCCCT AACTCCGCCC AICCCGCCCC TAAC2XXX3CC 120 
CAGTTCOGCC CATTCTOCGC CCCATC5GCTG ACTAATTITr TrTATITATC CAGAC3C3CCGA 180 
GGCCGCCTCG GCCTCTGAGC TATTCCAGAA CJTAGTCACSGA GCSCmTTTC GAGCXOTAC3G 
CTTTTGCAAA AAGCTT 



(2) INFORMATION PC3R SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANE3EDNESS: double 

(D) TOPOLOGY: linear 

(xi) SBQDENCE DESCRIPTICN: SBQ ID NO: 11: 
CSGCAOSftGGT AAITTCTACC AGAAATPPCC AGAGCATTAT GTAGOTAGAA AAAAATOCAA 

50 GCAAGCTGTT AAAGATCTTG GAT0CCA3TA TATACTA3CT ATAGCR3AAA TCOCTAATTC 
AATCACmr TCTCnTTAT CCTCTAACCA AAAAATTOIT TAATTITCCA TCCCAAAaX?r 
TTTTAATCrr TCTPATATTTT TTAAAAATCC TmCTCCTC ATCAITOCCT TTnTOTOGr 
TGTAAATAGA CTTACTTGCA CTITCAAGAT GAGTrACTCC OTOICATCrT ACAAATAICT 
GATATGGTAA nTTCATAAC AGATGTCAGT TITCAACCAA GAATTOCnGA TITCTTrATA 

60 AGAAAAAAAC TGGdTCATr TCTCrTGAAAT TOCXCTITOA AAATnCTrr TTACAOTrCT 



240 
256 



60 
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240 
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AAGCCAACTG AGATAOOCTPG ATOGTICTPGA TITCTITCAA IGA-TCCTTAC CAlCTATITr 480 
^ AGCCACTCAG CCTTrrArTA TTTOTClArr TGTAAACTITr A^mCTCTTA ACTCATITAA 540 
TAAATATACT CnTnATCTGT TTCTGAAAAA AAAAAAAAAA AA 
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(2) INFORMATION FOR SBQ ID NO: 12: 



(i) SBQ0H4CE CHARACTERISTICS: 

(A) LHJGTH: 465 base pairs 
<B) TYPE: nucleic acid 
(C) STOANDEENESS: double 
<D) TOPOLOGY: linear 



50 



60 



(xi) SBQOENCE IffiSCRIPTIGN: SEQ ID NO: 12: 
CTrrGC5GC5GT GAG0CCGAC3C TOCTOOSGCSG CTTamXSCC GGCCAGGACA CAGCTACTCG 
CAOGC30GC3CG GCGCCTC3GCT ATGATGTTCC TCACCCAGGG OXSGCCTCTC CCCTCTACTC 

CCTGOTGGAA TC3GACTAAAA GGACCCTTGT C3TC3GGAACAG CT3CTCCCCA AACAOCCTOC 
TQCTGGCTC5C CAGGCAGGCC CTCTC3GAAGG GAAQCX3C5CAG GACTXMtVVS G 



GACCCCroCA GC3GCAGQCAG CTTGGGCCCG AGCCCAAGCA TTTOGCICTG CTCCCCCCAA 
GC3GGACAGGA AGCCrCTTGG GCCTCTTCCC TTCdXSGACA AGC3CCCOCTC CCTTTOCCTC 
35 ACATAAACTG TACAGTATTT TCATTAAAAG CCTCTTTCAT AAAAA 



(2) INFORMATION FOR SBQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 474 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOIjOGY: linear 



582 



60 
120 
180 
240 



360 
420 
465 



60 
120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATGCAAITCC TC3CTCACAGC CTTTCrGTTG GTCSCCACTTC TCGCTCTITC TCAIWTCCCC 
ATATCCCTAG GCITCTCCCC CTCCTAGAAG QGCTTCITCA TAGATTAGAA AATAAGAATC 
AGTGACA!rrT CCTATGTGCA TATAAGAAGG AGCCACAAGA CATCTCTnT AAATAAAAGG 180 
ACACTGTCCA TCCmTAGC TC3CCGAATAG AACCTTGGTC TCATCCTCCT GGAGCTAGGC 240 
CTTTAAAACA GCTTCTGTGT ITCTCATITG TCTCAGTCPT TIGCCAGGCTr TTTATCGGAA 



300 



AGATAATGIT COGlTrAAAA TATTTCCTAA TGAGGCOCXSG On^^ 360 
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ACCCTAGCAM 1TOQGC3GCTG AGOGQGTCGA TCAOGAC3GTC AGGAGATCGA GACOVTCCIG 420 
GSTAACATOG TCSAAACCCO^ TCTCTACTAA AAAlftCAAAA AAAAAAAAAA AAAA 474 

(2) IKFORMATIQN FOR SEQ ID NO: 14: 

(i) SBC^JENCE CHARACTERISTICS: 

(A) LENGTH: 314 base pairs 

(B) TYPE: nucleic acid 

(C) STOANDECNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TTATGTTOGG GAGCAAGACC TGEATAGCCAG CCTTTACATG GGAGTATAAT TCTOTCCTCC 60 
ATCTCATAAG CCCCAGTACC TGAGCCAGAA TGMTATAAC CAACCACACT GTCTCTrTAT 120 
CATGGATQGC TTTAGCAGTA GGlTATnTC ATCAITOCCA TTTGTAGCTC TACACTTOGIT 180 
25 TATAGTAATT TCTTCATCTTT TAAC3TCTCTC CCPCAGTC3CC TCTTCTEKTC AAACTCATTC 
CTCTCTCANG CAGTTGAGCT CTOCATTCTC CCYTATQGGG GAGAGCTCTC TTCGAGAGAG 
AGAATAINAC TTCC 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SBQiUENCE CHARACTERISTICS: 

(A) IaENGTH: 613 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECajESS: double 

(D) TOPOLOGY: linear 



240 
300 
314 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 15: 

CTCATATTQC CGTCTGGCTA AAAGTGAACA TQCCATTGAT CAATCTGCTT TTATTATATT 60 

ATGTTCCTTAA TGGTC3GCAAG CAAGftCAAGA AGTAGAAAGA AAGAIGGTOT AAGCTCAAGA 120 

ACCCACTAAA TCTATCCTAT GGCCrOGGTT CACCCAGCCT GCTITCTCGA TTTTOICTCA 180 

50 CTATAACAGA GCTCCCAAGG AGACTGCAGA GTCfiGCTCCC TTAflGCACTC TAACTAAAGC 240 

CTAACICTTC CGnrcCACCC AACAATGTYC CCAGCTCATC CTCTITCCCR AACTTCCCCTr 300 

TCTC5CCCCAG ATGCGAATTG CATTTAACTA ATCCrCAAC?r GAAATGTCCA CACAGRA3TC 360 

CATTTTAATr AfiCATACCAT AGTmTGTG CAAATITGCT TICAGARGAC 1CCCATIGCA 420 
C5CTGCTCAGA GACGCTAAWG GCAGGGCCTC TTGAWGCTTr CCCGATAGCT TTCAGCTCCA 



480 



60 



ATAGCTCTTA GGCAGAATGC CATGAGC?GTC CTGOCCAACT GTATTACTOG GGAACACCIG 540 



wo 98/39448 



247 



PCTAJS98«W493 



10 



20 



30 



35 



40 



45 



50 



55 



APPGGCTAGA AGnTCATCCT CCTGTAACTT TTCTGAGTTC TrTACAITrA CTCGTCAAAC 
CCAAATATGC CAC 



(2) INPORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) hENGTH: 356 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECHESS : double 
15 (D) TOPOLOGY: linear 



600 
613 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

COCCCCOCAT TGAACCCTGG GCTGTGAAAG TTTTTOCCTG TGTC3QGTCCT TCTGKmSGC 60 

C3CCTGGTC3TG TGGKTCOCAA CTCCICTTGC AAAGTOQCAG CAGCCAATCA TGAAGCGCCC. 120 

TTAITTrTAG TTC3CAGATC3A CCAGGTCTCC CCCOCACAGC CTCTOTCTOG TCCCTCATTC 180 

25 GTGAGTC3GTC TQCCTGCCCA AQGAGCCTGA TTQCSTGQGAA ATOOCATCAT CTAATAOGAT 240 

QQGAAGGCAT TTQGTCCTQG TTATOTTTAT TACAACATCA TTGCACTCTC GGACTCCAOT 300 

COCTGAAAAC GTAATTICTG GTGTTACCAA AGGACCACAG GQGAAAAAAA AAAAAA 356 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SBC3qENCE CHARACTERIOTICS : 

(A) LHIGTH: 414 base pairs 

(B) TYPE: nucleic acid 

(C) STRflNDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SBC3UENCE DESCRIPTION: SEQ ID NO: 17: 

GAAACTftNAT CCCGGGGCTT TTAACWQGTA CTTGGGAAAT AAGTAITOC3G TAATCACTAA 60 

0«3GACATTG ACTOCACCAA ACCAAAGCTA TAGAAAGAAA TGATTGACTT TTTAAAATAT 120 

A3TCACATTA ACTGTPCCTAG GATACTTCTC TTCAGGCTTT CXSAAAAdTC TTCCITCAAA 180 

TTPGCATATC CACTCCAGTT CTGTCACCAA AGATTTTAAT CTTCAGMKX; CAATnCCTC 240 

TCTCOCAGAA AAAAGTACTA CAACAGGCTC AAGGGATATG CTTTGGTGGT CAAGGGA3TA 300 

CACTATQGTT TTCCTICICT TCACAATGGT ATTEACAC3GA GACCTTCTCA TCAGAGGACG 360 

TACTGAACTA TCTTTATCSAC TTTGGATITG ATCAGAQGIT TAAAAAAAAA AAAA 414 



60 



^098139448 PCr/US98A»4493 

248 

(2) INFORMATIOT PCH SBQ ID NO: 18: 

(i) SBQOaiCE CHARACTERISTICS: 

<A) LENGTH: 469 base padrs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SBQOOICE DESCRIPTION: SBQ ID NO: 18: 

AATCACCATT GCAATACAAA TGATCTGCCT GGTGAATGYT GAGCICTACC OCACMTCCT 60 

CAGGAACYTC GGAGTGftTGG TGTGnTCCTC CCTGTGTGAC ATAGGTOGGA TAATCACCCC 120 

CTTCATAGTC TTCAGGCTGA QGGAGGTCTG GCAAGCCTTG CCCCrcATTT TOnTCCdCT 180 

GrrOGGOCTG CITGCCGOGG GAGTGAOGCT ACPICTTCCA GAGACCAAGG OGGTCGCTTT 240 

GCCAGAGACC ATGAAGGACG CCGAGAACCT TOGGAGAAAA GCAAAGCCCA AAGAAAACAC 300 

GATTTACCTT AAGGTCCAAA CCTCAGAACC CTCGGGCACC TCAGAGAGAT GmTCCGGC 360 

GATGTOGTGT TGGAGQGATG AAGATGGAGT TATCCTCTQC AGAAA3TCCT AGACGCCTTC 420 

25 ACTTCTCTGT ATIX7ITCCTC ATACTTGCCT ACCOCCAAAT TAATATCAG 459 
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(2) INFORMATION FOR SBQ ID NO: 19: 

(i) SBQOQJCE CHARACTERISTICS: 

(A) LENCTTH: 550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANEEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 19: 

CCCCCCCCCC CCCCCACACT TTCAGGfiGTC ACCCCCCAGC ATTTOGGGTO 60 

TACTCCAGCC TKSAGCTCCC TQAGGGAGCC TGCACTCCCT GCTCCXAATC CCCGCTACTC 120 

GTGCAGGGAT GCAGCCTGGA GCTGGOGTCC TTCTTCTSGG CCTOCTGCTC CCGCCACCCC 180 

AGAGCCOCAG CCTCTCCTGA ATPGACATCA GTQCTTCCCT GAACTOCCTC CCCCACCCCT 240 

GGGCATEATC CCAGGAAACT TTATGTTTrC TA6AAGCTAA GCAGCTOCTG GQACTCAGGG 300 

50 ACTQCTGCAG GTAGGCTGAG TGGCAGCTCA G?rOCTAGAAG GTCTCTGAAG ATCTCGACTC 360 

AGGACCTTGC TACTCCCCAA GCCAGAGCCC ATCAGCCAGG CCTCCTOTCA GCCACCTCCC 420 

TCTGGAGTGC TGAGCTCAAC CAAAGOCTGG CAAGCTCTGG GOCTCATTTA AGGGATTCTG 480 

ATGAGCCGAT GGGCCCTGGA GGCAGCCCAT TAAAGCATCT GGCTCCTTTT TOGAAAAAAA 540 
AAAAAAAAAA 

550 



wo 98/39448 
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(2) INPORMATION PQR SBQ ID NO: 20: 

(i) SBt^JENCE CHARACTERISTICS: 

(A) LQTCTH: 741 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 



(xi) SEC^IHJCE DESCRIPTION: SEQ ID NO: 20: 
TCTTCAAGAG TGTftCACJEAC AGGA3TATTA TAATGAAAGT TTATATCAAC AGQOTITO3T 
15 TOGCTCTGCA TATATTATAA GCAAAAGAGA 1TOCTAAACT C3CCACAOTAT TCCAGATAAC 
mrCAGriTC OXXCTITCT TCTOGrPTCTT TAATPK3AAA CCTAGATACA TXSCAffTAAAA 
ACTAGGAGAA TGACmTAC OCTPC3GGGAC AGCCAAOTTT TOITCATAAA CCTAmCCT 240 
AGCATGOCTT CAGGAAGITC TGCCAGACCC TAGASTCTCA AQCSACCCACT GTTCTTCTOr 
TGTAOGAGCT CCCTGAACCA TTGTrCAGAG GACCAATCTC ACATCGCTTC ATCGGCATCG 
25 NCCA1GGGAG CATCTOGGTG ATAYCTCTCT ACAGTATK3G COCITCTCCG AC3GCIX5ATAC 
ACAAGQOCrrC TCTTCCACAT GATCATTTCX: AAACCTCCCC CAGCCCCTAC CATCCAATOT 
GGAAGGAAAA CAAGAACTIGC CTGAAGAAGA GTCCAAGCTA CAGATACACA CSCGTOTCCAT 540 
TC3CGGCTC3TC ACCTTCCTCC TCCCACTTCT GTATOCTCAG AGATCCTCCG TCCSMXJrTTC 
CTTAAOCTCA CSCTGACTrcC CTGTGAATGT CTAATOCTAG TTCAGCSCSCCT CCAGGCATIG 
35 ATITGTACAG TQGTAACTCC CAATGAGC3CT TCTOTTATCA TTTOCTK^rcc rrTYTCTOIC 
ATTAAAAGAA ATGATTTTCC C 



40 



(2) INPORMATION FOR SEQ ID NO: 21: 



. <i) SEC3UENCE CHARACTERISTICS: 

<A) LEfTOTH: 991 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

SEQUENCE DESCRIPTION: SBQ ID NO: 21: 
GGCACGAGTC TOCOCTCXSGG AAGTIT^ 

AATGTGTCTA GAGTGTTQGG CAGAAAATCT GC5GACCACAC CACACCACSTT CTCTCCTTAA 
TCCAOGTCAT TTOCCTTCTA TCCCAGCTAT GTTTCCACTrc TCCKTOCT CTITCCAAGA 
GCAACAAGAA ATGAATAAAT CTCTC3C7rGAG TICTTTATIT (HTCITCACT TICTTTTACA 

60 CTOTATTm: TGAGrrrArc GGrGTCTGrre aattaaaaag gaaaactaga aataactaaa 



60 
120 
180 



300 
360 
420 
480 



600 
660 
720 
741 
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(2) INFORMATION FOR SBQ ID NO: 22: 



(i> SEQUENCE CHARACTERIffnCS : 

(A) LQJGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANnEENESS: double 

(D) TOPOLOGY: linear 



ACTCflGGTTG AAGGAAATAT ACATAAATAA GATAAAGCTG ACCTCTAGAT ATAGCAGCTT 360 

ATAAAGCTTA GAGITGTCTA ACTTQAGTCC AAATTTTCCT CTCATCITIC TCATOCCGAA 420 

CAAAAAAGCA GTCATGrrTTO TrATGTGATT GGAATtSGAAC CTGAGAAGAG AGCA3X3CTCT 480 

CSTTCTJCTGG GACAGGAAAG CTTGCCnGCA CXZAAGTCTGA ACCACCACCT TCATOTTCAC 540 

ATAGATTATG TGCTGGAACA TATITCACAC aSGCTTGQCA GTAAACACTT CTAGTOITCr 600 

GCAGTC3GAAA CQGTCATCTT COGCTAAAGC ACGGCGTC?rr GTCCAGCGGA AATOGTCATC 660 

TGCTGCTAAA ACACAGCTTC CATOffTAAOXS TATQCTOCTT ACTCAAAGAG TOTOGTCCCA 720 

AACAQCCTTT GGGAGGTOCT CCTTGATTCA TGGATGAAAC CTOGAACATC TTCAGGACTC 780 

AOrrAACCAT AGGTCCTTAA ATAACTCTCC ACACGTTrTT CTTAGrPTrAT CTCTACATOC 840 

AGGGTGTGCA GCAGCCTGTT CAAAGTCATA TITrcTGGGA AATATTICCA GTOrrTASTT 900 

CSCACTTTAGC CCACPCTGTG TAGOCTTATT TCITCTAAAC TCACCAITAA TCIX3AATAAT 960 



^ AGTCAAATTT AGQGQGACTG TATTrGCCTT A 



(XX) SEQUENCE DESCRIPTION: SBQ ID NO: 22: 
CCACGCGTCC GGAATTCCCC TGAGGATCTT GGGCTATCTT TGACAGGQGA TTCTTOCAAG 60 
TTGATGCTTT CTACAAGTGA ATATAGTCAG TCCCCAAAGA TOGAGAGCTT GAOTTCTCAC X20 
AGAATTC3ATG AAGATGGAGA AAACACACAG ATTGAGGATA CGQAACCCAT GTCTCCAGTr 
45 CTCAATTCTA AATTTGTTCC TGCTGAAAAT GATAGTATCC TCATCAATCC AGCACAGGAT 
CXSTGAACTAC AACTGAGTCA GAATGATGAC AAAACAAAGG GAGATOATAC AGACACCAQG 
GATGACATTA GTATTTTAGC CACTGGTTGC AAGGGCAGAG AAGAAACGCT AGCAGAAGAA 
GTrPGTAOTG ATCTCACITC TGATTCGGGG AGTCAGGCAG TTCCGTCACC AGCTACTCGA 
TCTGAGGCAC TTTCTAGTGT GTTAGATCAG GAGGAAGCTA TQGAAATTAA AGAACACCAT 
55 CCAGAGGAGG GGTCITCAGG GTCTGAGGTG GAAGAAAICC CTCAGACACC TKHGAAAOT 
CAAGGAGAGG AACTCAAAGA AGAAAATATG GAGAGTGTTC OTITOCACCT TTCTCTCACr 



180 
240 
300 
360 
420 
480 
540 



600 



60 



GAAACTCAGT CCCAAGGGTT GTGTCTTCGG AGGCATCCAA AAAAAAAAAA AAA 653 
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(2) INFX)RMATION FX>R SBO ID NO: 23- 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENBSS: double 
10 <D) TOPOLCGY: linear 

(xi) SBQUENCE DESCRIPTION: SBQ ID NO: 23: 

GGCAGGCTGA CGAOCTQCAA GCCACAGTC3G CTGOCCTGTG OGTGCTGCGA GGTGGGGGAC 

CCTGGGCAGG AAGCTOGCTG AGCCCCAAGA OCCCGGGGGC CATQGQCGGG GATCTOC3TOC 

TTGQCCTQGG GGCCTTGAGA OGCCC5AAAGC GCrTQCTOGA GCAQGAGAAG TCTCTRGCCG 

20 GCTQGGCACT GGTOCTOGCA SGARCTGGCA TTGGACTCAT GGTGCTGCAT GCAGAGATOC 

TGTQC3TTCGG GGGGTGCTCG GCTGTCAATG CCACTGQGCA CCTTTCAGAC ACACTTTOGC 



CCATTTTTCC AGAGCTGCAG AGAGCACCTG CTrGGCSGAGGA AGAAGTCTAA CTCACCAGCC 
60 TCTGCTCTTA TCTTTGTAAT AAATGTTAAA GCCAGAAAAA AATAAAAAAA AAAAAAAAAA 



60 
120 
180 
240 
300 



TGATCCCCAT CACArrCCTG ACCATCGGCT ATQGTGACGT GGTQCOGGGC ACCA3X3TOGG 360 
GCAAGATCGT YTQCCTGTGC ACTGGAGTCA TQGGTCTCTC CTCCACAGCC CTOCraTTOG 420 



480 
540 
600 
660 



CCCSTOGTOGC COGGAAGCTG GAGTTTAACA AGGCAGAGAA GCACGTOCAC AACTTCATGA 
30 TGGATATCCA GTATACCAAA GAGATGAAGG AGTCCGCTGC CCGAGTGCTA CAAGAAGCCT 
GGATGTTCTA CAAACATACT CGCAGGAAGG AGTCTCATGC TGCCCGCANG CATCAGCGCA 
ANCTGCTGGC CGCCATCAAC GCGTTCCGCC AGGTGOGGCT GAAACACCGG AAGCTCCGGG 
AACAAGTGAA CTCCATGGTG GACATCTCCA AGATGCACAT GATCCTGTAT GACCTOCAGC 720 
AGAATCTGAG CAGCTCACAC OQGGCCCTGG AGAAACAGAT TGACAOGCTG GCGQGGAAGC 780 
40 TGGATGCCCT GACTGAGCTG CTTAGCACTG CCCTQGGGCC GAGGCAGCTT CCAGAACCCA 
GOCAGCAGTC CAAGTAGCTG GACOCACGAG GAGGAACCAG GCTACTTTCC CCAGTACTGA 
GGTOGrrOGAC ATCGTCTCTG CCACTCCTGA CCCAGCCCTG AACAAAGCAC CTCAAGIGCA 

45 

AGGACCAAAG GGGGCCCTGG GTTGGAGTGG GTTQGCTTGC TGATGGCTGC TGGAGGGGAC 



840 
900 
960 
1020 
1080 
1140 



GCTGGCTAAA GTGQGKAGGC CTTGGCCCAC CTGAGGCCCC AGCTTQGGAAC ATQGTCACCC 
50 CCACTCTGCA TACCCTCATC AAAAACACTC TCACTATOCT GCTATQGACG ACCTCCAGCT 

CTCAGTTACA AGrTGCAGGCG ACTGGAGGCA GGACTCCTQG GTCOCTOGGA AAGAGGGTAC 1200 
TAGGGGOCCG GATCCAGGAT TCTQGGAGGC TTCAGTTACC GCTQGCCGAG CTCAAGAACT 1260 
GGGTATGAGG CTGOGGOGGG GCTOGAGGTC GOGCCCCCTG GTQGGACAAC AAAGAGGACA 1320 



1380 
1440 
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AACTCGAGQG GGCXTCRKAC CCAATCWCCX: TATAGTAKflC CTEANNN 
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(2) INFORMATION PGR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGrTH: 2323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEMESS : do\ible 

(D) TOPOIiOGy: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CrrOQCOGTT TCTCCTGCCA GGQGAGGTCC OCSGCTTCCCG TCSGAGGCTTCC GGACCAAGCC 60 

CdTCAOCTT CTCCCTCOQG ATCGATGTGC TC3COGCOGCC GCCGCCQCCG TCCCGCGTCC 120 

1TCGC3TCTCT GCTCCCGGGA CCCGGCTCCG CGCAGCCAGC CAGCATOTCG GQGATCAAGA 180 

AGCAAAAGAC GGAGAACCAG CAGAAATCCA CCAATGTAGT CTATCAGGCC CACCATOTGA 240 

GCAGGAATAA GAGAGGC3CAA GTGSGITOGAA CAAGGGGTGG CTTOCGACSGA TOTACCGICT 300 

GC3CTAACAGG TCTCrCTGC3T GCTQGGAAAA ACAACGATAA GmTCCCCT GGAQGAGTAC 360 

ITCTCTCCCA TGCCATOCCT CSTTAATTCCT GGATQGGGAC AAaXTTCCGTC ATOGCCTTAA 420 

CAGAATCCCC CAGAOXXSCTT OmXSCCCCC AAAGCATOGEA AGGTCCTCAC AGATTAITAC 480 

AGGTCCCTGC AGAAGAACTA AGCCTTTOGT CCAGAGTTTC TTTCTGAACST GCTCTPITCAT 540 

TACCITTTCT ATPTTTATGA TEAGATQCTT TCTATTAAAT TGCTTCTCAA 'TCATGCATTr 600 

TAATCmTA TAATGAAGTA AAAGTTGTGT CTATAATTAA AAAAATATAT ATATATATAC 660 

ACACACACAT ATACATACAA ACTCAAACTG AAGACCAAAT CTTAGCAOCJP AAAADCAATA 720 

TTCTTATACA TTTCATAATA AAA1TAGCTC TATGTAITrT CTACTGCACC TCAGCAGGCA 780 

GGTCCCAGAT TTCTTAAGGC TITGTrrGAC CATGrTGTCTA GTTACTIGCT GAAAAGTCAA 840 

TATATTTTCC AGCATGTCTT GACAACCTGT ACTCTTOCAA TtSTCATTTAT CA13nx?EAAA 900 

ATATATCAGA TGrOFCCTCT TCTGTACAAT TGACAAAAAA AAAAAnTTT TITTCTCACT 960 

CTAAAAGAG6 TQTGGCTCAC ATCAAGATTC TTCCTGATAT TTTACCTCAT GCTG?rACAAA 1020 

GCCTTAATGT TCTAATCATA TCTTACCTGT TQAAGACCTG ACTOGAGAAA CAAAATCTOC 1080 

AATAACGTGA ATTTTATCTT AGAGATCTGT GCAGCCTATT TCTGTCACAA AAGTTATATr 1140 

GTCTAATAAG AGAAGTCTTA ATQQCCTCTG TGAATAATGT AACTCCAGTT ACACGC3TGAC 1200 

TTTTAATAGC ATACAGTGAT TTGATGAAAG GACGTCAAAC AATOTCGCGA TOltX5TGGAA 1260 

AGTTATCTTT CCCGCTCTTT GCTGTGGTCA TICTGTCTTG CAGAAAGGAT GGCCCTCATC 1320 
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CAGCAGCAGC GCCAGCTGTA ATAAAAAATA A3TCACACTA TCAGACTAGC AAQGCACTAG 1380 

AACTOGAAAA GACCACAGAA AACAAAGAAT OCAACCCTTT CATCTTACAG GTCAACAAAC 1440 

TGTGATGATC CACATGTATG TOnTTGTAA GCTGTGAGCA OCGTAACAAA A3X5TAAArrr 1500 

GCCATTATTA GGAAGTQCTG GrrGGCAGTGA AGAAGCACCC AGGCCACITC ACTCCCAGTC 1560 

TQC3TGCCCTG TCTACACTAG ACAACACAGG AGCTQGGTCA GATTCCCCTC AGCTOCTTAA 1620 

CAAAGTTCCT CGAACAGAAA GTCCTTACAA AGCTOCCTTC TCQGATACTG AAAGGTCGAG 1680 

TITPCTGAAC TGCflCTGATT ITATTGCAGT TGAAAAAAAA AAAAACXTTAT TCCAAAGATT 1740 

TCAAGCTGTT CTGAGACATC TTCTCATQGC riTACTTCCT GAGAGGCAAT GTTTPrACrT 1800 

TATOCATAAT TGATTGTTGC CAAGGAATAA AGTCAAGAAA C^GCACCTTT TAATATATAG 1860 

GTCTCTCTGG AAGAGACCTA AATTAGAAAG AGAAAACTCT GACAATTTTC ArATTCTCAT 1920 

TCTTAAAAAA CACTAATCTT AACTAACAAA ACnTCTTTTG AGAATAAGTT ACACACAATC .1980 

GCCACAGCAG TTTOrCrTTA ATAGTATACT GCCTATACTC ATGTAATCOG TTACTCACTA 2040 

CTGCCTTTAA AAAAAAAAAC CAGCATATTT ATTGAAAACA TCAGACAGGA TTATAGTGCC 2100 

TTAACCGATA TATTTTGTGA CTTAAAAAAT ACATTTAAAA CTGCTTCTrcr GCTCTAOTAC 2160 

OmSCTTAGT GCAAATGATT ATTTCTATCT ACAACTGATG CTTCTTCTrA TTTTAATAAA 2220 

TTTATCAGAG TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2280 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 2323 
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(2) INFORMATION FOR SBQ ID NO; 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQKTTH: 683 has& pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SBQUENC3 DESCRIPTION: SBQ ID NO: 25: 
GGCAOGAGCC TGTGTGGTCA TGrTTCCTCGT GGrTGCAGTAC CTGACATGAG CCAGCCACGC 
TCAGTGGCTG AACAGCATTC CCACAGCCTG CAAGTGTGTG TGTGTGTGAA AGAGAGflGGG 
GCSGCCCAGftG CCGCCTTITC AAATGTTTGC CTCTCTGAAC TGTGAAGACA CTO3GGAGTG 
ATICTOGTCT AATTPCCAAC CTGCTCTGrTT TPCTGTGACA TCTTGGAGQG GAGCTAGTOC 
CACAOCATGC GCQGTGCTTA GAAATGAAAA AGTrcCCGGGT CTGTCTCTCT CACTCO^XSCT 
CTCATQGGGG AGGGAAAGAA TGGCTTTGGT GGCTrTGTTC ACACAGCTGA TCCGTOCTOG 

GAAGGrrcrcc acagtgagcc tgtgtgcagg actgtccaca oggttcacac ttotcaccat 



60 
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wo 98/39448 



254 



PCTAJS98/D4493 



480 



5 



CAGGCCTITC TQC3TCCTGAT AGGGTOGAGC AAAAGTGGAA AQGAAAGGAA AGAGGCTTTT 
COCACAGCCA TTATATTAAA TAGTAGGTrCG ATTCACATCT OGTGCTOCTG QCCACCrTCC 540 
CCTGTGCXrrC AGTCACATCT AGATGACTTGA CTOCCAATAC TTGTCACCAT TCCCK3GAAG 600 
CAGCTACCTA QGGGAAACAA GATGTAGTGC TATTGCOQAT AACAAGTAAG ATTTTCCACA 660 
10 CTAAAAAAAA AAAAAAAAAA AAA 



15 (2) INPORHATION FOR SEQ ID NO: 26: 



20 



(i) SBQOQICE C3JARACTERISTICS: 

(A) LQJGTH: 2036 base pairs 

(B) TYPE: nucleic acid 

(C) SraANDEEKESS: double 

(D) TOPOUOGY: linear 



683 



300 
360 



(xi) SEQUENCE DESC3UCPTICN: SEQ ID NO: 26: 

25 CTGAGAAAGG AAAGCATTCG GATCPGCTGC AAAAACACAT ATATCCATAA AGACICA3CT 60 

TATTCAGAAA ACAGAITOTG AACACAATCA CATTCGCATG AATCCTTTAA AAGGAAGAAG 120 

ACCTTAAAGT ATCTGCAAAT CTCAATTTCT ATTTATTCCT TCACTGAATA TAGAAACAAT 180 

GCTTATCTGA TTATTAGAGA TATTATTTTG GATATGTTAC 1TATTAACTT GCTAIGGCTC 240 
GTAAOCATGA TAAAGTCTOT TATTAATAAC AACATAATTC TTTTTTTAAA GAAGAAAAGC 

35 ITATTTTrCA TTGACAGTGT AEAGATTTAT CTACTTAGTT GTICTTTTGCT ATTAGTOnT 

TAATTTrrTT TTTAAGITGA GTGTTTGATA AATTTTAAGA OCCTGTCCCC AC Cnum ' i ' 420 

QAGTCCTGTG 1TCACTACAG GTEATATAGCY CAWTTTAAAA ATCCTAAAGC AAAAGAATTT 480 

40 

TATTTATAAA AGAATCMAMC MGTTC3CATGC ATCAGGCTGT QAAGTCAGAT ATTTAGTAAT 540 
AAAAGCAGCA GTGCCmTP TTGrTATTTAC (XATTGACCC CCACCAAATC CAACTGinT 

45 ATATTAAGAA AATAGTAACA ATTTTAAAAT CTCAGAGTAA AATCTATTTC ACTACATOCT 

ITTCCXCCCT TGnrCTGATT TAAGCAGTCT C?rACTTGGCA TCTCTACATT GKXTAGGGA 720 

CAGTGGrPGTT CTACAATATT ATCATGTATG ATGITTTATT GGTOCTTTTT ATTCATArJIG 780 

GCTTCTTACC AGAAACAGTA GGAAGAAACA CATGAACTGT GTACAAGACA TCAAACMTC 840 

CTOCTGATAT GTTGTTTTTT CACATGCrTT TGACTTTTCA CTTTTTAAAC GAGAGCCAGC 900 

55 AAGCAAAATA GATGrPGGCTG GC3TCTGCCTG TCCGGGOGGC TYTTTGCACC GAGCTCTCAA 960 

ATCCTGTGTA TTGAGGGTTC CTTTTTGGTA CTCAGGATTG GAGCTACAGC TOGCXXXXXX! 1020 

TCTCTCCCAT TCGTTTGAAG AGACACTGAG GGAAACAAGG G lVil.Tirm AQGTCTCCrT 1080 

50 



600 
660 
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GGCTQCCnr TACGQGATGG GAGOCTTCTC CGWFCrTTT GTTCTTCTCC ACCTCTTCTA X140 

GCTACTGCCX; GTCCAAQtTTT GTAGATGTTA TTCCCCAGGA GCCTGGGCTK GGGQCXnGAG 1200 

CTGGGCTGAA TOCAAAAGCA TQCAACCAGA AGQCX3C3GCAA GQGGAGGAAA AGCAGGOCTC 1260 

GCrTCAlTCG TCXrCTQGAG ATCTTCTOTAG CACTCAGCTC CAGCTTGCSGC CTOOQGAAOC 1320 

AGCCTGACCA AGCX3GCTC:AG GTGTOCCTGT TACAAGAAGA ACCTCCAGAA GGATAATTIG 1380 

CACATGGAGC TGTGATAACA CTAATCTTGA Tm ' lTmT TTTTACAAGT CATCA(3«SAT 1440 

GmTGCAAAG TCSAGTTTrAT TTTnTCTAA TrcCTTTATC TTTACTTAAA QGTOAAOGK; 1500 

TArrCCTCTG GGAQGAATAG GAAGAAAACA QGAATGTTAA TAATCTOSAA CAGAAAACTT 1560 

CCTOCCTTAT TAATATATAA TCYTCATGTA TTTATGCXOT AATGrTAAGCT GACTITTAAA 1620 

AAGCTTTCTT TTOTPGCATG CCCTGTGCAG GCATCTOPAT TGTACATCCA TCCCTTTCCTr 1680 

CCTCTTTTCC 1CTATAAAGT TAGTCAACAA AGAAATATTT TTGCCCTAGT TCATOnGCC 1740 

AAGCAATGCA TATrrrrTAA ATTTGTCATA TATQGAAAGA GCAaXTTTlCT TACATOTAAA 1800 

ACSCTTTACTG ATATACAGAT ATACTAATGT OTGAAGATCX: TOITCTITOC AAGTCTACAG 1860 

TTTTCAAATG TTGTTACCAG TGAAACACCC TTGTOGTTTA AACITOCTAC AATCTATrTA 1920 

TTATTCWrrr CCTCCCATGT AACTAAGAAT CATGGCTATA TTTCATATCA ACGTrATATT 1980 

GAAAGTGAAG GGAAATGATT AATACAAGGT ITTCTAACAA AAAAAAANAA ANNAAA 2036 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQOQJCE CHARACTERISTICS: 

(A) ua^GTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANECCNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUQ3CE DESCRIPTION: SEQ ID NO: 27: 

GGCAOGAGAT AACATAGGCA CAATAATACT GTATGTCTAC TTCIAGGA3T ATAAGGAATT 60 

AACATTGAGA TGACATTTCC ATITGA6AAG AAAATAGTTG CmCAGTOC CTnTATTTC 120 

ATTCCTGGAG AGAGCAGACT CGCACCAACA TTCAACCCCA GCGCTGATAT GACAGTAATC 180 

CTCAGAGGCA GAGOCCAGCA CAAAACAGCA ATCXTTAGAAA GTTACAATTC GAAAGITTCC 240 

TGCCAGCrrC GGGAATGACA CIGCAAAGCT GATC3CCAGAA ACTCCCAGAG TAATTCTCCr 300 

CATTACTGCT CTAOCCACCC AdTTCAGCT CCCCAAATTA ACTAGTOCAG TTCACTAATC 360 

CTCTTTACCT TTATCATTTA GGTGAQGCAT TGCACAAAAA CTCTCGACTT TCCCATATAA 420 

GGGCTGTQGT TCPCTGTQGT CCTGGATAAG AGGCATCACC ATTATCTOGA AACATOCACST 480 



256 



AAATOCAGAT TCTTCATCTT CTCCOCfiGAC CTCCTGAGTT AC5AAATTCAC AAGTTCTCCA 
GGTCATCTCA TACATGCTAA AGnTGftGAA CCATTCAGTA AftGTTAATGC ATTAAGAAGA 
GATTAGATAG GGATC3GTQC3C GrTATCTTCCT ACAGrPTTCCX: TGTTAACAAG AAAGTCAGAG 
GTCAGnTGAT CAGACATTAG ATTATTTATT GCTAAAACTA AAAAAAATTA AAAAAAA 

(2) INFX)RMATION FOR SEQ ID NO: 28: 

(i) SEQUQICE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 
<C) Sl^U^NDECNESS : do\jble 
(D) TOPOLOGV: linear 

(xi) SEQUENCE I^SCRIPTICN: SEQ ID NO: 28: 

GAATTCGGCA C6AGCAGCAT CCTAATTTTA GTTTGGAGAT GCATTCTAAA GGATCTTCTC 

TATTOCTTTT TCTCCCACAA TTAATCTPGA TTCTGCCTGT CTGTCCACAT TTCCATCAGG 

AACTGAACTG TTGTTITCAT AGGTAAATGA GAGACTGAGT TTTTTCAnT CTCAAGAGAA 

AGGGCATTTG CTCCTACAAG CTQAAAGGCA CCCCTGGGTG GCTQGGGCCC TCGTOGGAGT 

TTCTGGGGGA TTGACCCTTA CAACATQCAG TQGCCCTACA GAAAAACCTG CAACTAAAAA 

TTAmTTTA AAAAGGCTCC TOCAGGAAAT GCATATAAGG GCTAATCACC CAGTATTITC 

ARGCTTCGAA GARGTAATAR AMCCCTOGAG AGAGAAACTG AGACATSTAA GAGGGIGGGA 

ATGACTCAGT GGTGGCACAC TATGGAGTCC TGCCCACAAG TAGCACACAT CAACCCACTA 

CACAGAAATC CTAGG 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 556 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
AGCTTAACGT CATGATTCAT TAGGGGAATG CAAGQCAAAA CCATGATGAG AATOCCCCTA 
GACACCTCTT AGAAGAGCTG CEAGAAAGGC AGACAGCACC AAGCGCITAA ATGAGATGQG 
GGCACTGGTG CTTCTTCTCT GOCTACTGGT AGGGGTGCAG CAGAGTGGTT CAGTCTGGGA 
CAGTTAGCTG GACATCACGT GGACCCAACA CACGCATTTC CTGGGTTACT TACCAAGGAG 
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AATAGAAAGC AGC5CAGATCT TTAOVGCAGC TCTTACXTOW TTCCAAAACA ATOGAAAIXSC 
^ CCACATGTCC ACAAACAAGT KTOTCCTCTG CCTGTOCCAT GAAGCACAGT OTGQCTGRQC 360 
GTCAAGAGTC CCCACACTCA AAGGAGGCAG CflGATACAGG GCTOCACACT GrGTCATTCC 420 
ACACAOCTGA CATTCTGGAC AOJGACATGC TQGATGGCAA AACXSAGCATC GCXXTCAGAG 
10 C3ACTGCTGAG AAGC3GGAACG GCSGCTGCTGG GATGrPGGGTT GATTOTAOCA OTAGCTCATC 
GAGATGTGAC CTCAAA 
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(2) INFOHMATICN FOR SBQ ID NO: 30: o 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGrTH: 434 base pairs 

(B) TYPE: micleic acid 
<C) STRANDECNESS : doiible 
(D) TOPOLOGY: linear 

25 (xi) SBC^IQ*CE DESCRIPTION: SBQ ID NO: 30: 

CTAAATGGTG ACTGTGQCl-r TGTrCGAGACA GGCCCCAAAT GGTAGGTOTC AACACAACAT 60 

GCACAGAATG AGGAGACATG CAGAGTGCTG AAATACTGTC CTCGACAGAT CTCTTACATC 120 

ACTTTCTTTT CAGCTTATTT GTGTGGCCTG CCTTTGAAGA TAGAGCTTTO TTCATATITA 180 

CATTAAACCA AATTGTATAA VTATCTTCCA TTCTGACATG TEATTTAGCA AARGAAAAAR 240 

35 QAGTAATTCT ACATCAGCAT CTTTAGTGCA TGCTAAAAGA TEAAAAATGT CinTOGGGA 300 

ACATGTTTTG TATACATAAA TGTTTAGATA GAAATATTTA TAGAATNCTC TATOIGAGTA 360 

TTOATCTCCC TA2X3TATATT TATATCTAGA TGrTGTCAATC TriCTATrGA TA3GAAATOC 420 
TATGAATAGT GAGA 



434 



(2) INFORMATION FOR SBQ ID NO: 31: 

<i) SEQUENCE CHARACTERISTICS: 

(A) I*QJGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) SrraANDEDNESS : double 

(D) TOPOUOGY: linear 

(xi> SEQUENCE DESCRIPTION: SBQ ID NO: 31: 

CCACGCGTCC GATCTCACAG CTCOGACACT ATTGCGAGCC ATACACAACC TCGTOICAGG 60 

AAACCTTACTC CCAAACTAAG CCCAAGATGC AAAGTTTGGT TCAA3GGGGG TTAGACAGCT 120 

ATGACTATCT CCAAAATGCA CCPCCTQGAT TmTCOQAG ACTTOGTCIT AITOCTITIG 180 
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CTGGOCTTAT TGGACTCCTT TTGGCTAGAG GTTCAAAAAT AAAGAAGCTA GraTATCCGC 240 
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CTGGTrrCAT GGGATTAGCT GCCTCCCTCT ATTATCCACA ACAAGCCATC GTCTTTOCCC 



300 



AGGTCAGTGG GGAGAGA3TA TATGACTGGG GTTTACGAGG ATATATAGTC ATAGAAGATT 360 

TOPQGAAGGA GAACTTTCAA AAGCTAGGAA ATOIGAAGAA TTCACXTOGA ACTAAGIAGA 420 

AAACTCCATG CTCTCCCATC TTftAIXIAGTT ATAGGTEAAAC ATTCGAACTC CATAGAATAA 480 

ATCAGTATTT CTACAGAAAA ATGGCATAGA AGTCAGTATT GAMGTATTA AATOtSGCnT 540 



CrrCITCAGG AAAAACTAGA CCAGACCTCT GTTATCTTCT GTCAAAICAT CCTACAAQCA 600 
AACTAACCTG GAATCCCITC ACCTAGAGAT AATGTACAAG CXnTAGAACT CCTCAITCTC 660 
ATGTTOCTAT TTATGTACCT AATTAAAACC CAAC3TTAAAA AAAAAAAAAA AAAAA 715 

(2) INFORMATION FOR SEQ ID NO: 32: 

25 (i) SEX^JSNCE CHARACTERISTICS: 

(A) LENGTH: 486 base pairs 

(B) TypE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTICSJ : SEQ ID NO: 32: 
GAGCCAGTGC CGGOGAAAGG GGACCTTCCT CTACTTCCTG CCACAGACCC TCTCCCCACA 60 
CACTTCCTGC CCCTC3CTCTG CTOGGAGGCC ACITCCTCCC CCAGTQCK3G A1TCCACCCC 120 
CAGCTCACCC TCAAACATGG COCCCTCTCT OCTOCTGCTT GCCCCTCrCT GCTCCCIGGA 



180 

GGCTGTTCTG TCCTCCCCTC TTGAAAAGCA ATQCCAGCTT CCTOGQATCT TCTOCCAACT 240 
CCAGCTACCA TGCCCTTTGC TCCTGTCAGC TCAGCTCCTC AAGGGAA3TC TCTA^CCTOG 300 
C5TC5TCCTGCT TCCCTCCCTC AACCTCCTCA CCCTGCTCCA AGCTQGCAnC TGCCCCTCCA 360 
CTGCACAGAA CGC3OTCCCCC ACCACCTOCC TTTACAOGGA GGAAGCAGCA ACATCGAAGA 420 
ANCGAACTAT AOGGGCTACA ANGATGCTCA GCTCTGATCC CGAAGGCAAA AAGNATCITT 480 

486 



(2) INFORMATION FOR SBQ ID NO: 33: 

(i) SBQiUENCE CHARACTERISTICS: 

(A) LENGrrH: 725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUragCE DESCRIPTION: SBQ ID NO: 33: 

GTrcCPCTGG TAATAATTAG GnTATTCCCA GAAGCACAGT GTCATrcPrT AAATAAAAGC 60 

TTTOCTGTTT AAAQCrPTTC AAAQGAOCAG ACCACCTTGA AGATPOCCCC TA(3CX?nGAT 120 

ATGnCTCTAA TTCATTTrAT AAAAATTATT CTTCTCTrCA TTTTAAAGCT TTOGCTATAT 180 

10 AGTCAGAAAT C3TCCTAAATA ACAAACTATT TICTAnTAA TITAGGGAAG ACTAAAGGGA 240 

AGAAAAATGA AAACTCAGTC TTTATGTAACS CTCCAAQGAT ATTAGGGCTT AAAGGGCTTT 300 

TCTAGTTITA TSAGAATTTG TACTACTGAT TTTEATATAT TCCTCTTTTT GATCAACAGA 360 

. TCTCTQOGGA AATTGTTGAG TTACAATGGC ATTTCACTGT GATOCCTCTC AAGCTCAGAT 420 

CAGriTCTATA ACCCAATCAC AACXriCTCTC TTTGGTTTAC TCTCCTGTGA AATOTCAGCT 480 

20 CAAGrmccC AGAAGTOGrrc TOTTTATGAT GAGICAGAGT GCTTTTCCTC GC31X3GGACAG 540 

TTCCTQGCCC TCTTAATTTT GGTOTATOTG CTTOCAAGTA TCTAAACCTC CAGTCTCATC 600 

25 ATCCTAACTG TTAATTGTAT TATTGATTAT GTTGATTATC TTOCTTCAAG 660 

GTPCATACTT ITCAAriTGA TAGAAATAAA GrrTTTTTTCT GCTTATAAAA AAAAAAAAAA 720 
AAAAA 



30 



(2) INFORMATION FOR SBQ ID NO: 34: 

35 (i) SESQOaiCE CHARACTERIOTICS : 

(A) LENGTH: 437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEHESS: double 

(D) TOPOIjOGY: linear 

40 

(xi) SEQOQJCE DESCRIPTION: SBQ ID NO: 34: 
CACACAGCAT GCTGCCCTCA GACGTGTCXIA TCXrTGTACCA CATCAAAACG CTCCTCCTCC 
45 TQCAAGATAC TGAGAGATTG AAGCATGCTC TGGAAATGTT CCCAGAACAT TCCACGATGC 



50 



725 



60 
120 



CTCCTGCTTT TATTGGCTCT TGTCX3AAATC AAATTGGAAG ATCTPCAGrK: CX!AGCTGCAC 180 

CCAACX3TGGA AAAGTATTCC AGGTCXMCC CCAAGGAACC AACACCGATC ACATCGACnC 240 

AGQAATCTTA TAACCTACGT GGACTCTTTC CATCCGTACA TTCTCGTCCA CATGCCACTC 300 

ATCACTTGGC GTGCCCAGAT CCTOGCARGG CAACACCCTG TGATAATTCC AGGTCATTCT 360 

55 CTACATCTGC AGCTTGAGGT TAGCCTCATA TCACA3TACA TTCTCACTAN AAACNAAAAA 420 
AAAAAAAAAA AACTCNA 
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(2) INFORMATION FCR SEQ ID NO: 35: 

(i) SEQUKJCE CHARACTERISTICS: 
5 (A) LQJGTH: 943 base pairs 

(B) TYPE: nucleic acid 

(C) STTRANDEEENBSS : double 

(D) TOPOLCX^: linear 

10 (xi) SBC?UEIICE DESCRIPTICN: SEQ TO NO: 35: 

GGCAOGAGCT GGAACAGAGA CTAAATCCCA CGAAACTCAC ATTC3TTAAAC ACACTAAAAC 60 

AGAAGTACTT AOCTCITCAA GATTTAATAT ATAATOCTTTG ACATGATACA TOTACAIGAT 120 

GAATGACCAG ATCXTTATGG TCTACATTTT CCriTATCCT GTTAGTATTA CCTTCCTTAA 180 

TCTTTGTTCA TTAACATGCT AATTCCTCTT CAGTGrrPTAT TTTCTAGfTGA CAGAATGCTA 240 

20 ACAITTCrrA CACCCTQGCA GAAGGGAGAG AAATGTGTTT TCXSGGrrOGGT AACTAAATTT 300 

TTGAOTGAAA TATCATAAGA TGANAATGQA AANAAQGAGA CACAAANAGT TATWACAAAA 360 

AAACAATGGT TTTTTTAGCC ATTTGACTQG CTCTTTAAAT AGTCTACAAG ACATTCACCT 420 

TTAACATCAC TTriAGTGAA ATAAAATGTG CCATACTAGT ATCTIGCTTCA AAAGGQCAAA 480 

TGTGCTTTAG TGCCCTAAGG CTAAATTTTG GTCATTTGAC AICAGAGATC TTCTAACrTAT 540 

TGCACITAAT ACQCACCTAT TTOTCAATAG TGTrATTTTT TGGWTAGCAT TTTTTTTACC 600 

ACTAIOTTGT TGATAGCTTT TTGTTCTNTO AGGTPCaJAAN ATGACACyTGC TMATNTCAAA 660^ 

CAGAOTACCC ATNK5CAGAA CTAAGGGAAG CNATTTATGT ATGAAAGNAA TTtfTTCSAATT 720 

tKSTCATTOITC AACCNTTCaiA TTAAAGCTTA GACTAAATAG TAATATAIWG TC3GGNAGC3AT 780 

TTTCSGTTrTG TGATATTINT GTGNATTAAG GMATAGATGT TAAGCNTTAT TTTGTAGNAA 840 

AGrPGANTTGT ATGTQGITAA TTATAAATAA AACTGGTACC AGGNAAAAAA AAAAAAAAAN 900 

KAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 7WVA 943 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEKGTH: 604 base pairs 

(B) TYPE: nucleic acid 

(C) STTU^NDEEMESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 36: 
GGCACGAGAA ATCTTCATGC TGTAGTCACT CCAGACCATG GACTGGCnT CCAGCTGAAT 60 
GAATCCTATG TCTCGOCTIGC AGGTGGTTGG TTTTCAATGT TCTTGCTAAT TTTTTTTCTA 120 
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TOSGATCTTG CSGAGmTCT TTGTrTQCTC CTCTrfflTPGC CCftGCTTTAA TAAAACCAGG 180 

CC3CAAACAAA AACCATAGCA rrCTGAflCAA TAC3GGGGCCC ACATTCGACC CACSmTOTCA 240 

CTTTAATGGA CTTCAACSAAA AAATCTGAAT GCX3AAAAATG ACACTAGC5AA TCTATACTCC 300 

ACACATTTTA TOCX3VTATAA TOJrGTOTIT TCTTAflTTIT G' iT KJ TitJUJ GCCSAAAatTTC 360 

GCTTTCAAAT TAAAATGACC 'i' m V nClT TCAAACTTIT TOPnTCACT TOTAXWOTA 420 

AaSGTITCGA AAGA3TCATA ATTCTGAGAG AGGTITOCAA OCAGGAGATA CAAAGAAGTC 480 

TCAGTAGTAA TCTTOTTCAT GTGCTTTrAC AGCCAGCTAC A3TEAAGGAT GTATTAGITA 540 
15 CAGAAATTAT ATCTCTGTGT ATGrPGTCTCT ACTCAATAAA GTACATOCCT CCACAAAAAA 
AAAA 
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(2) INFORMATION FOR SBQ ID NO: 37: 



600 
604 



<i) SBC?UQJCE CHARACTERISTICS: 
^ (A) LaiGTH: 349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 
ID) TOPOLOGV: linear 

30 (xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 37: 

GTGAffTQCCC GGGAGCCCCG AGGCCCTGCC CCTAAGAAG6 ATATCTYTRA CCGCTCCCTT 

GTCCACACCC TAACCCCCCA GCTGCTCAGG CAGTQGGCAC ATCGCAGGGG CCTCACTCGG 120 

GGCACATAGA GCATITGGQG GACTGCGAC3T GCTCACCTTT GACTTCCTOC AGGTCGGGGG 



60 



180 



AAAACCAGAT CAT6ATGACC AAAGTYTACA TATPCTTGAT CTTCATOGTC CTGATCCTOC 240 
CCTCCCOXSGG TCTCACCAGG TATATGCCAC CACYTTCTGY TCTAAATTCA GAATAAGAGT 300 
CACATCAGGA GAGCACTGrrc OCCAOGANAA TGCAAACGGG TTCGCAGCA 349 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SBCyjENCE CHARACTERISTICS: 

(A) LQKJTH: 672 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 38: 
GTAGTOGTTG OGGTTGCCGG GATGGCGAAG ATCTOGCCGT ITCAAGTCCT AAAACGCACC 60 
^ TOGGTACCGG TGCnCTTGG TITOGTGATT GTWATCGTTG CTACAGAGCT GATOGTOCCA 120 
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QGAAOQQCAG CfiGOOGTCAC AGGCAAGTAA ATAGTAATQC CX3GAGCAAGT ITCCICOGGC 180 

TTTATCATGT CACOCACTGT GGTATATGCG nCTOGTCTG CCAACTTTCC CGTCAAOVAT 240 

TTCAGCAATA ATCAGATC3GC GGCTQGOGCA ATATTCAftGA TAAOCSCCTOG CAGTX3GTOCG 300 

GCTGATQGTT CAGTOCCTGC GSCACCGrTTT YTOCCGTMXi TTGCACACCA GGNTCTITAA 360 

ACAGTTTTCG SACCGCGTIT AGCGTCAAGG GTTCAATGCC GGTCGGTAGC TCGTCCrrAG 420 

GTTCACOGCG AOCATAAGCA TTAAACATCT CATCAATTTG CTTCTOGCTC GCGCTA1CAA 480 

TACTTTCCAG CAITATGTrTA OSCTOGCGGA AAOQQGTrAG CSTTIGCCCC ARCMGWTCAT 540 

AGGCAATGGG CTTAATGAGA TAATCAAATA CACCACAACG TACGQCITCA GACACayriT 600 

CXrATATOSCT GGCTGCAGTG GTAAACACCA OGTOGCOGGG ATAATGCX3CC TOCACCACJIT 660 
CATQCAGTAA AT 
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(2) INFORMATION FOR SBQ ID NO: 39: 

(i) SBQIUENCE C3IARACrERISTICS: 

<A) LENGTH: 1908 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 39: 

AGACTTTGATA TmTAGAAA CAGTAATTTT ACTTTTAAGG AAATTGGCTA GCTCTTTCAC 60 

TONAGAGCTG TAGGAAGCTC AACATTTCTT TGTAGAGAAC GTTGCTTnT TTOGATIGTA 120 

CAGGTATAAA AACATTQCTT TTGTTGAATT GTATAGGTGT AAAAAQQGAA TAACIGTATC 180 

CAGGTITCAA AAQGAAATCT GCTTTAGGCA TGAGTCATAA GATGCCATTC TACXTOEAGG 240 

CATITEATTT TCCTITAGAA ATGGACATCA GCTCTTCTCT TCTCACTGGT AACACATAGC 300 

CCCAAAGCAT GAGATTATrT TTCATTGGGT TTTTATTGTT GTTTAGTTTT GGTTTOPTAC 360 

C3CCAQCCCAG TCICTCTGCG GAACACTGAC TCTGCTCTCT AATCAGAACA AAGTTAGAAA 420 

TCTGCCGATA ACCTAAAATA ATTTAGAAAT GAATTAAAAA TGTGAAATCG GGTTAAAGTC 480 

ATGATGATAA AATAGCATGC AAGAAACAAG CTCCITCCAT CAGACTTQGC TACKJrrnC 540 

TTCTGGTACG ATTTGGTTTG GAAGAGCCTC TTGTrTCCTT CTCTTK3GGG TATCTCTTCG 600 

TTTCTTAATA TC3TTTGrrAAC ATTATTGAGA TATAATTCAC ATACCTTACA ATTCACTTAT 660 

TTTAAGGGTA CAATTrAGTG GTTTTTAGTG TATICACAAA CSTTOTCTAAC aTTCACCACA 720 

GTCAAirrTA GAACATTTCG TTACCCCAAA AAGAAACCCT GTACCCITCA GCAGTCACCT 780 

CTCATTTTCT OOCAGTGCCC ACCCCATCCC CGAGCCCCKG GAACCACTAA TCTATTrCTC 840 
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TCTCTGTftGA TITOCTrarr CTOGTCATTT CATATAAATG GAATICTACa ATATTCGCTC 

TTITOGGftCT CSQCmxxaA AiaTGMTrT CTATftTOGAG -TOAGAAAATT CTTCTCAaCT 960 

TGAGAMTTCT TArTGCTOTG AAAGGGAtSTG GITOGrTAAAA TCAATACSArr TCAGGCAAGA 1020 

QGGCXaGftTA CCTAACACX3T TTITCTCOCT GftATCTTATG CICACTACTT nTCCTCflTA 1080 

ACCAflGC»Tr TATCATATAT TACTACTTAT AATACTGTOG CTAOTCTCTA GAATOGAOCT 1140 
TOAAMCTIT GCCTCXriCAG TOGGGAft^ 

CAAAAATCAG GCX^AATOAC TTQQCAAATA ATTGACAAAG TOCTmCAC CTtJICTCTAT 1260 

crrrocTAGC agcttctata cctcaggcca ggtgagctcc ccaaattict -nrraMrr 1320 

ACTCCAGTGA GTTTCTGCTG TCTmTCAA GTATGTACCA TAGGACTTAA MGTCA7TR3 1380 

20 GATGOfflTGT AACACTGCTA AATAaOCTAA CTACAGAATT TTAICTACAG TACTOTGAGA 1440 

CAGTCAATTA TTGCCTAGGG TAGTTCAAAA ATATGATGTC AGCTBCTTAA GCCTITCCTT 1500 

^ GACTGATITC ACTQATArrC AGAflGTSTGT ACTAATCAAG GCICTITAAA ATACGGAACXJ 1560 

ACPCACrrAA TAACXaOGGA ACCAGCCAAA TACTQTCCAG COGCAGAATA TCCATATCAA 1620 

TGAGITOGAG GTGACTATrC TCICTAACTC CCTAATGATT GmTCTAAG CAITOIGQCT 1680 

30 TCTCAGTOGC TTGACAGCAT CTOX-imjiT GTATGTQGCC TOTITACATC ATOTATTGAA 1740 

TAATOTTGTr TGTTGTGAGC ATCAATGCCT GTAACACCAA ACTAAACACG TOmTTOGG 1800 

A-EATOPTTCC AATCPTTAAA TOAOCTTQCC CPGTCCAATA AATAAATGAT TOICTCAOCC 1860 

TGITAAAAAA AAAAAAAATT AAAAAAACTG GCajQQQGGGC CCQGTACN 1908 

40 

<2) INFORMATION FTOl SBQ ID NO: 40: 

(i) SBfflEMCE CHARACTERISTICS: 

(A) LENGTH: 458 base pairs 

(B) TYPE: nucleic acid 

(C) STRMajBCNESS: double 

(D) TOPQUXW: linear 

(xi) SBWJENCE DESCRIPTICW: SEQ ID NO: 40: 

CCTCAAAAAA AAAAANGAAA GGAAAGAGGT CTCTACACAA OOCCGIGATT CITCATOGCA 60 

AC3GGATAACA TCAGAAATGT TTCATTITCK GCTATTAGTT TCCAITCCTT TCCCCATCCA 120 

55 GGCATAABGA GAAACAAAAG ACAMGATGG TATTCTCTOT GTCCTCAGCT TTOGCACITr 180 

TGrPGATt?rr GCTAAGGAGC AGTGACCTTG CTAAAAAGAC TCAATAATCC ACCCACTCAA 240 

TAGCTAACCT GGGGAGGAAA TGAAAATTTC CTTTGTGGAT CTCCCCAAAT CCATTOITOT 300 



45 



50 
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CACCAGGCCC TCCCAGAACC TCCTCAGTTC CTTCACAGTG CAAOCCTOIG XACITCGCCC 360 
GCAACCCAAT AGTAITCTOC CTCACTTCAC CTTCCATGGG CAACTGCCCT CCXTITCTOGA 420 
5 CATAAAACCT CATATTTTAA ATMAAGITGA AATTTGAA 453 



55 



AAAAAAAAAA AAA 

60 



180 
240 
300 
360 



10 (2) INFORMATION PQR SEQ ID NO: 41: 

(i) SBC^iaiCE CHARACTERISTICS: 

(A) LElxmi: 1153 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SBQ0Q3CE DESCRIPTION: SBQ ID NO: 41: 
20 GGCACAGAGC CTCXX3ACCCA GGTGGTCTGG AGCCTC3CCGG GAGAGTOC5TC GCATCTOAGA 60 
GC3CT0GTCGT GGACTGTQGT TGQQGGAGGT QSaAGCTGTT TTAACCGTGT GCCCCCTCTC 120 
CICTGCCGGC GTGGGCATCC CCOGGGGCAG TGGAACCCGG CSCGCTCCTCC AGCTTCOGAG 
TCCAGCCAGC CTC3GGCGCGG GGCGCGCCCC GAGACACCCG AGGAGTCCCT TCCTCCCTOG 
TTACGTGGAC TGTQGAGCTG GTCTCTTGTG GCTCAGCGCC GnXSCGGAGCT TGAAGCGTAC 
30 CTC3CGGAGGT CGCACCAGC3G CGTGAGGAGG AGGflGGAAGG GCATGAGCCG AGCTTCaOGA 

ATCCGTGCTC CAAACTCTAC ACTCAAQGAT GCACTQCGCA ACTCTGGTOG CGATCQGCTC 420 
GGGCAGATGT CCTTQGAGTT CTACCAGAAG AAGAAGTPCTC GCTGGCCATT CTCAGAOGAG 4B0 
TGCATCOCAT GGGAAGTCTTG GACGGTPCAAG GTQCATGTGG TAGOCCTOGC CACGGAGCAG 540 
GAGCQGCAGA TCTGCCGGGA GAAGGrTGGGT GAGAAACTCT GCGAGAAGAT CATCAACATC 
40 GTGGAGGrTGA TQAATCGGCA TGAGTACTTG CCCAAGATGC CCACACAGTC QGAQCJTQGAT 

AACGTGrrTG ACACAGGCTT GCGGGACGTG CAGCCCTACC TGTACAAGAT CTCCTTCCAG 720 
ATCACTGATG CCCTGQGCAC CTCAGTCACC ACCACCATGC GCftGQCTCAT CAAAGACACC 780 
CTGCCCTCTG AGCGTCGCTG GATCTCTGQG AGCTCCTTGA TQGCTCCCAG ACCITCGCTT 
TTGGGAATTG CACTTTIGGG CCTirGGGCT CTGGAAOCTG CTCTGGGTCA TIGOTGAGAC 
50 TTGGAAGGGG CAGCCCCCGC TQGCTTCITC GTTTTGTGGT TCCCAGCCTC AGCTCATCCT 
riTAATCTTT GCTGACGGTr CAGTCCTGCC TCTACTGTCT CTCCATAGCC CTGGTGGGCyr 
COCCCITOT TCTOCACTGT ACAGAAGAGC CACCACTQGG ATGGQGAATA AAGTTGAGAA 



600 
660 



840 
900 
960 
1020 
1080 



CATGAGTTTG GGCTGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1140 
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(2) INFORMATION FOR SBQ ID NO: 42: 
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60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQIffEH: 1983 base pairs 

(B) TYPE: nizcleic acid 

(C) STOANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SBQOeiCE DESCRIPTION: SEQ ID NO: 42: 

GGCA0C5AGAG GGGCCGAC3CC GACAAGATC?r TCTTGCTCSCC TCTTCCGCSCr GCGGC3GCGAG 60 

TAOTOGTCOG AOGTCTOOCC GTGAGACGIT TCGGGAGCCG GAGTCTCTO: ACCGCAGACA 120 

TGACGAAGGG CCTTOrm'A GGAATCTATT CCAAAGAAAA AGAAGATCAT GTOCCACACTr 180 

•PCACAAGrPGC AGGAGAGAAT OTTGATAAAT TGrTAGCTGG AAAGCTCAGA GAGACITTCA 240 

ACATATCTQG ACCACCTCTG AAQGCAQGGA AGACTCGAAC CTtTEATOGT CTCCATCAGG 300 

ACTTCCCCAG CGrTOGTOCTA GTTQGCCTCG GCAAAAAGGC AGCTOGAATC GACGAACAGG 360 

AAAACTQQCA TGAAGGCAAA GAAAACATCA GAGCTGCTGT TCCAGCGGGG TCCAGGCAGA 420 

TTCAAGACCT GGAGCTCTCG TCTGTGGARG TC3GATCCCTC TO6AGACGCT CAGGCTCCTC 480 

CGGAGGGAGC GGTOCTTQGT CTCTATGAAT AOGATGACCT AAAGCAAAAA AAGAAGA3XX; 540 

CTGTGTCGGC AAAGCTCTAT QGAAGTQGGG ATCAGGAGGC CTOGCAGAAA GGAGTCCTCT 600 

TTGCTTCrGG GCAGAACTTG GCACGCCAAT TGATQGAGAC GCCAGCCAAT GAGATCACGC 660 

CAACCAGATT TGCOGAAATT ATTGAGAAGA ATCTCAAAAG TGCTACTACT AAAACCGAGG 720 

TOCATATCAG ACCCAAGTCT TGGATPGAGG AACAGGCAAT GGGATCAmC CTCAGTOTOG 780 

CXAAAGGATC TGAOGAGCCC CCAGTCTTCT TCGAAATTCA CTACAAAGGC AGCCCCAATC 840 

CAAAOGAACC ACCCCTGGTG TITOITOGGA AAGQAAITAC CmGACACT GGTCCTATCT 900 

CCATCAAGGC TOTQCAAAT ATGGACCTCA TGAQOQCTGA CATOGGAGGA GCTXXAACTA 960 

TATGCTCAGC CATOGTCTCT GCTGCAAAGC TTAATTTOCC CATTAATATT ATAGGTCTOG 1020 

CCCCTCnrc TGAAAATATG CCCAGCGGCA AGGCCAACAA GCCOQGQGAT GTrcriTAGAG 1080 

CCAAAAAOGG GAAGACCATC CAGGTTGATA ACACTGATGC TCAGGGGAGG CTCATACTGG 1140 

CTGATGCGCT CTGTTAOGCA CACACGTITA ACC0GAA£3Wr CATCCTCAAT GCCGCCACCT 1200 

TAACAGGTGC CATGGATGTA GCTTTGGGAT CAGGTCCCAC TOGGGTCnr ACCAAITCAT 1260 

CCTGGCTCTG GAACAAACTC ITOGAGGCCA GCATPGAAAC AGGGGACCGT CTCTOGAGGA 1320 

'roCCTCTCTT CGAACATTAT ACAAGACAGG ITCTAGATTG CCAGCrTQCT GATCTTAACA 1380 

ACA3TGGAAA ATACAGATCT GCAGGAGCAT GTACAGCTGC AGCA3TCCTC AAAGAATTCG 1440 
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TAACTCATCC TAAC3TOGGCA CATTTAGACA TAGCAGGOGT GATQACCAAC AAAGATGAAG 1500 

1TCCCTATCT ACC3GAAAGGC ATGACTGGGA GGCCCACAAG GACTCTCATT GAGnCTTAC 1560 

TTOCnTTCAG TCAAGACAAT GCrTAGlTCA GATACTCAAA AATGTCTTCA CTCTOTCTIA 1620 

AATTGGACAG TTGAACTTAA AAGGmTTG AATAAATOGA TCAAAATCTT TTAACGGAGA 1680 

CAAAGGATOG TATTTAAAAA TCTAGAACAC AATGAAATTT GTAIXXXHTC AT XTmUTr 1740 

CATTTCACAC AAAGATTTAT AAAGCTAAAG TTAATATCTT ACTTCATAAG GATITrrAAG 1800 

ATACTCTATA AATGATEAAA ATTTTTAGAA emXTAATC ACTTTTCAGA CTATATGlTr I860 

TTCATTGAGA AGCAAAATTG TAACTCAGAT TTGrTGATGCT AGGAACATCA GCAAACIGAA 1920 
AATTACTATG CACTICTCAG AAACAATAAA TGCAACrTGT TGTOCAAAAA AAAAAAAAAA - 1980 



AAA 



1983 
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30 



40 
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50 



(2) INPORHATIC»I FOR SEQ ID NO: 43: 

(i) SBQfOENCE CHARACTERISTICS: 

(A) LH«3TH: 1406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) T0P0UX3Y: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATGATGA31GA CTTTGAAGAC GATTTTATTC CICTTCCTCC AGCTAAGCGC CTTCAGGITA 60 

ATAGnTGGAA AAGACTCTAT AGATATT6AC ATTTCrrCAA QGAGAAGAGA AGATCAGTCT 120 

TTAAGGCTTA ATGCCTAAGC NCTTGGTCTT AACTTGACCT GGGATAACTA CTTTAAAGAA 180 

ATAAAAAATT CCAGTCAATT ATTCCTCAAC TGAAAGTITA GTIGGCAGCAC TTCTATTCTC 240 

CCTTCACTTA TCAGCATACT AITCTAGAAA GTOTACAGCA TACTGACTCA ATTCTTAAGT 300 

CTGATTTGTG CAAATTTTTA TCGTACmT TAAATAGCCT TCTTACGTCC AATTCTCAOT 360 

TAGAGGTAAA GCCCTGTTGr AAAATAAAGG CTCAAGCAAA ATTGTACAGT GATAGCAACT 420 

TTCCACACAG GACGTTGAAA ACAffTAATGT GGCTACACAG TrmTTAAC TOTAAGAGCA 480 

TCAGCTQGCT CTTTAATATA TGACTAAACA ATAATTTAAA ACAAATCATA CTAGCAGCAT 540 

ATTAAGGGTr TCTAGTATGC TAATATCACC AGCAATGATC TITCGCTTTr TCATTTAnT 600 

GCTAGATCTT TCCCCCTTGG AOTTTTCTCA GTTICACACT GTTTGCTGGC CCAGCTICTAC 660 

TCmCTQGC CTPrcrrrAAT ATCGCAAACC ATTOGrrrOGG AGTCAGATTC CHTICTEAAA 720 

AAAAAAAAAA AAAAOGACAT ACGTCACAGC TCACTTTTCA GTTCATTATA TGTACCGAQG 780 

GTAGCAGrrer GTGGGATGAG GTTCGATACA GNCCTATTTA TTGCTTGTCA TCTAAATTAA 840 



60 



wo 98/39448 



267 



PCtAJS98/04493 



AAAOCnCTA TTTAACTCTT OTCAATCCTr TTAGATAAAA TTOITCTITC CAAGAAItaT 900 

^ TGC3T0CTTAT TTITTCAAAA AITPGCTGTC AACAACGTGA TCACAACAAG CAACATrTAT 960 

CTAA3GAACT ACaOCTATCT TAATITOGTT CTTCAAGlTr TCraOGCAC ITOEAAAAIX; 1020 

CTACAAGC3AA TATTAAAAAA ATCTATICAC TrTAACTTAT AATAGTTTAT GAAATAAAAA 1080 

10 CATGftGTCAC AGCTTTPGTT CTGTCGTAAC CTATAAAAAA AGnTOTCTT TCftGATTCAA 1140 

TCTAAAGAAC TGAAAACAAT GTEATATCriTG TAAATAITTC TOIGTTOIGA GAAATITITC 1200 

TCATAAGAAA TTAAAAGAAC TTACCAGGAA CCTTTTTAAG TTAGAAATAT TCCATCCCAA 1260 

TAAAATAGGA AATTATAAAT ATATAGTrTT AACXCTCCAT CACTOGGAGT CTTOGCTATC 1320 

TAGTTATCTA GTTATTArai AACCACCAAG ATrmTTCG CTATITACCG TAACCAAAGG 1380 
20 GGCCGATTAA NTGGTTTGAA GNCTTG 

1406 



25 (2) INFX)RMftTION FOR SBQ ID NO: 44: 

(i) SBQUajCE CHARACTERrSTICS: 

(A) LENGTH: 1391 base pairs 

(B) TYPE: nucleic acid 
^0 (C) STRANDEEWESS: double 

(D) TOP0IXX3Y: linear 

(xi) SEQUE^3CE DESCRIPTICN: SEQ ID NO: 44: 

35 GCX3CCTGAAG GCGGCRCXXX: AG3XXCGACC AGTGCTCGCT CCTGCTCGGG GCGCTOCGCX: 6( 

CCCGGGCGTC GCCATGAGCA C3TGAGCTGGA CATCTTCGTG GGC3AACACGA CCCTTATOGEA 12C 

^ CXaGtSACGTG TATCGCCTCT GGCTCGATCG 1TACTCQGTG ACCGACGCX3G TCGCCCTCCG 18C 

QC3TQCX3CTCG GGAATCCTGG AGCAGACTQG OCSCCACCSGCA GCGGTCCTGC AGAGCGACAC 240 

CATGGACCAT TACCGCACCT TCCACATGCT CGAGCX3C3CTC CTCCATOCGC CGCCCAAGCT 300 

45 ACTGCACCAG CTCMCTTCC AGATTCCXXr CTCCCGGCAG GCACTACTCA TCGftGAGCTA 360 

CTATGCCTTT GATGAGGCCT TTOPTCGGGA GtSTCCTOCSGC AAGAAGCICT CCAAAGGCAC 420 

CAAGAAAGAC CTGGATGACA TCAGCACCAA AACAGGCATC ACCXTTCAAGA GCTCCCX3GAG 480 

ACAGTTTGAC AACTTTAAAC GCXTTCTTCAA GGTOGTAGAG GAAATCCX3C5G GCTCCCTOCT 540 

CXaCAATATT CAGCAACACT TOCTCCTCTC TGACXXOTTG GCCAGC3GACT ATCCAGCCAT 600 

55 ccjrcrrcTTr gctaacaacc gctitcagac agggaagaaa AAAcrccAcrr atctcagctt 66o 

CXStSTGACTTT GCCTTCTGCG CTGAGCTCAT GATCCAAAAC TOQACCCTm; GACCCCTTCGA 720 
^ CTCACAGATG GATGACATGG ACATQGACTT AGACAGGAAT TTCTCCAGGA CTTCAAGGAG 780 
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35 
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CTCAAGGrrcc TAGTOGCTOA CAAGQAOCTP CKX3ACCTCC ACAflGAGCCT GC?K7roCACT 
GCTCTCOXSG AAAGCTQGGC GTCITCTCTG AGATGGAAGC CAACTTCAAG AACCTOTCCC 
5 GGGGQCTC3CT GAAOSTOTO OCAAQCTGAC CXIACAATAAA GAroiCAGAG ACCTOmCT 960 
GGACCTOGTC GAGAAGTITC TGGAACOCTG COCXTTCOGAC CACTTOGCCAC TCAGCGAOCT 1020 

GasGTTcrrc ctgaatcagt atpcagogtc -ictocaatcx: CTCGAIXSGCT TCCCSACACCA lOSO 

GCXXXnCTQG GftCX3GCTACA TOC3GCACCCT ODGOSGCrcC CTCCTCCGCC TOTATCAIGA 1140 
CTGAGCSTGCX: TCCCAAC3GTC CGCCrAOCT GACAATAAAG TTOCTCTCAG TTreGftGACT 
15 «mXTOXn» 005^^ 

AOCrKSGTGrrC CGTCTACAAG GATQGATOTG TOCNGTGGer CCTTOQQAAC TCftGACATAT 



1200 
1260 
1320 



20 -*«^^-wvL ittiTCTOCTCA CAAAAAAAAA 1380 

AAAAAAAAAA A 

1391 



25 

(2) INFORMATION FOR SBQ ID NO: 45: 

(i) SEQUENCE C3ffiRACTERISriCS : 

(A) LEMGTH: 1569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SBQOENCE DESCRIPTION: SBQ ID NO: 45: 
CXXIAaSftGTG GAGATGCXTO OGGCCCn^ 

GGCX3QC5CCGG CTCCCTACCC TOCflGACICT 0CGCTATC3GC TCCAAGGCTC TTACCOXCA 
CCGTOC3TGTG ATGCACTTTC AOCGC3CAGAA GCTGATOGCT OTGACTCAAT ATATCCCCCC 
GAAACCAGCC ATCCACCCAT CATGCCTGCC ATCTCCTCCC AGCCCCCCAC AGGAGGAGAT 
AQCXXTCATC AGC3CTTCTCC GCCOGGAGAT AGCAGCAGflT TTCCAGGACA ACCGAATCAT 
AGCCGTCTGC CAGAATGTCG CTCTGAGTGC AGAGGACAAG CnCTTATTC CGACACCAGC 
TGCGGAAACA CAAGATCXTTG ATGAAGGTCT TCCCCAACCA GCTTCCTCAAA GCCCTTCCTC 
50 GAGGATTCCA AGTACCAAAA TCIGCTGCCC CTTnTOK3G GGCACAACAT GCKXTOCm: 
ACTCAAGAGC CCAAGC?rCAA GGAGATGGTA CGGATCTTAA GGGACTCTCC CAITCCTOCC 
GCTQCTAGGT GGCIGCAITG ATGACACCAT OCTCAGCAGG CAGGGCTTEA TCAACTACTC 
CAAGCTOCCC AGCCTGCCCC TQGTGCAGGG GQAGCTTCTA GGAGGCCIOV CCTCCCTCAC 
AGCCCAGAO: CACTCCCTCC TOCAGCAOrA GCOCCTtX^ 
60 GTACATCAGA GAGCAACGCG AGRAAGGATT CTGTCAIXJTC GGCCAATO3G AAGCCAGATC 



60 
120 
180 
240 
300 
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540 
600 
660 
720 
780 
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CTGACACOCT TCCGGACTCG TAGCCAGCCT C3TTTAG0CAG CCCTCCGCAT AAATftCACTC 

iGCGrTAOTG GcrorecrcT cctcaatggg acatctosaa gaacitoqgg tcggqgagtg 

TCTITGTCAC TTCGTTTrCA CTAGTAATOA TATICTCftGG TATAGGOCCA CTTOGAGATC 
CAGAOGATTC CMTTCflGAT GTCAC3TCACC GGCITCGTCC TTACTmTCC CAACITOC3GA 
CGTGATAGGA GCAAAGTICTC TCCA3TCTCC AGC5TCCAAGG CAGAGA3XXT GaWU^WMAG 
GGCTA3TOPC COCTCCCTCC TOXTCACTG CCTCTTOCTC CACGQGCTCC TCAGCCCACT 
CrCTTOGGGC ACAACCTCOC ACrGCX:flCAG TAGCTCAACC AAGCACTICT GCTCAGftATC 1200 
GCACCTGCTG AGAGCCTGCT GTOTGCCAGG AGTOCICTTA CATOTATTM; 1260 

'ITCCTrEACr GCTGACCACA ITOTACOCAT ITCACAGAGA AGGAGCAGAG AAATTAflGTO 1320 
CXTTTQCTCAA GGTCATOCAG TTAGTAAGTC GCAGAACAGG GACTK3AACC AAGCCCTCTC 1380 
CTCTGAAGAC CGCGTCCTGA ATTTCTTCAC TAGAGCTTCC TCATCAGGTT ACCCAGAACT 1440 
GGC3TCCCATC CACCATCCAG GTGTOCTTCG ATGTTAGTTC TCCACCCTCG AGCHCTACGC 1500 
TOTGAAAAGT TTGGGAGCAC TOCTITATAA TAAAATGAAA TATATTCTAA AAAAAAAAAA 1560 



1569 



60 
120 



(2) INPORMATICaj FOR SEQ ID NO: 46: 

35 (i) SBQUQICE CHARACTERISTICS: 

(A) LENGTH: 1924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANEECNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SBQUETKIE DESCRIPTION: SEQ ID NO: 46: 
«3GCCCCCCC WaSWKTTTTT TrTTTTTTTT TTTAATTAGG ATAAIXXXTTT TATTAACGAG 

45 AATGAAAOGT TCATTCCTCC TTCCACTCCT TCTCGTTQGT TTTCTOGACA CAGCTCACCT 

GATCCTGCTA GAAACGTTGT CAGTCTGCTT GTGGCTTCCC TCCTOGAITC ACTCACGCTC 180 
TOPGATC3TCT TGAGAAGTAT CTATCCACTT CATGTGAATG AGCACTCCAA TAOCAGCCAA 240 
CATCAATCAT TCTTACCTAA AGAATAATAA GAAAAAGTTA ATATAAAAGA CAAGGGTATA 300 
AAATAAAGGT TTGAAAATGC TAGTCAACIT CAAAATTTAA AGACTAAAAA TOIAGAGATA 360 

55 AAGATTQGGG GTAAGTTACA GCATAAAAAA ATAGGAAGAA ACTTCATCGT GGGGGGGAAA 420 
TCTAAAATTA TTCTTACATA AAATAAGTAG ACACCTGAAT TAGAATGAAA ACTOTAITrr 480 
CTTTAAAATG TAAAAGCCTG ACTCTCAGTT TCACCAGTCT GAGCACAACT TTCACTCCAA 540 
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COCAAAATAT«TAimrrPAltn«AflGCTATC^^ 
TmACAK:Aa:iCTlTm..CATATCAA^ 

5 ™CA™nx:AQQCAo«n«A^ 

CmXX3«3«5 TACACAAOK TtJB^^ 

-mCOCMCr AAGTOCTCAC TCACTPITAA ACrTAGOtSCT GGAAAAGCTA TOCCTTACTC 840 

GGACflGCAAG GAAOCAArtT TmCTGAGG GAGAAGACAT TCACCTICfiC IMATCCCIG 900 
«»«3GGCX:aC A<n«atfaAA ACAAAGAT^ 

15 TOCCTGAATG ATTACTQCAA AGGGTATATG AACTAAGAGT TCCCTOITCC ACATCTACXS. 1020 

TCCATAAGGG ATACTATATC GnTTGCArr CTTCCCCCCA TOTCCACAT ItJTCCTATCT 1080 

2^ cccrmcAc TCT^^ ^^^^ 

tTCAAAAGCA AOGnTITAT OSITAATQGT TTACCAGCAA CRJITCAGAT TTCCAGITCA 
CHCTEAAAAA TTGOCAATCA TTATCTAGCA QCAATOACAG ATCATTAGGA GCAGTCAAAT 
25 CCTCTGAATT CTTTCCCTAA TAOGC^ 

AACATTATCA GCTAAAGCCA AAACX^AATA AAGGCCCAGA CCAACA-TCCT GGCTCTCTAA 
^'^^^^^AAATOVOTAACJIGAAAGG^ ^^^^ 
CMC«aCAA TGATTAACAA TAGGAGfiCAT GCAACCCCCA ITAAGtJrrAA AA^JTCCAAAA 
CTAGTCnCAC GCATCTCTtT ATTGGGGAAA AGTGAGACTA TrATCCATTC TTOGTAQGrT 
35 TCCAACCTTC CATOAAGAGC ACCCMTGCA TITCmCAT CTTTCAGAAA GCACCGSTAT 
CronCCAAG GGCXTTAACAG TAOGAAAATA CATTCTOQCA TCACACCTCT (aACCX3«GA 

^ ^^'^'^^^ ™aaaataat o™^^ ^^^^ 



1200 
1260 
1320 
1380 



45 



50 



TOGGTATAGC AirATTACTC AAAOCAdTA ATOXCAGCT TTITCAGITr TTTAAAAAAA 
OOCACTQCAC TAAGAITCAC AATTCAOTCC TACATACAAA TTAAAGCTAG TAAGAACACA 
CTAAOGTCAC AAGTITCTCA ITCTAAAGTG CAAAAGCCTA ATCATCTCAA AGTCAACAGG 
GTAA 



(2) INFOHMATIQN FOR SEQ ID NO: 47: 



(i| SEQUENCE CHARACTERISTICS: 

(A) LEW3TH: 475 base pairs 

(B) TVPE: nucleic acid 
(C> STRflNDEDNESS: double 
(D) TOPOIiOGy: linear 



1500 
1560 
1620 
1680 



1800 
1860 
1920 
1924 



60 



(Xi) SEQUEtCE DESCRIPTICN: SBQ ID NO: 47; 
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TOCntnXSOQG CCCAGAAAMC AAGGGACCflG TGAAAACAMC CCXIAGAGACT TCTA^TCOGCC 
AC3GAAAGCCA TTGOCAmVC TGMCCCTTG AAGGGCAAGG AGCSGAAACftG TOITACCAGA 
OOCCAGTAAG AACTOCTOPC ATOAAGGftGG GGCXaCClTC TAAGAGACAT CACTACTAOC 
Afi^^CTCTGG TGOCAAA3TC CTOJIX3TCTC rtCTITCGAGA AACCAACCAG ATACATCreC 
TOSAC3ACCCA GGTGGGCACA GAGAAGCX3TG GAGflGAGAAT CTCGGAAGAG AAATOGflGAA 
TAAGCAGCAC AGTCTTATTC ATETCTCTAA ATTCCTATOT AGAAGGCTTCA GICTPAGAAA 360 
TAAAGn-ATT CTACrACSriTC CAAGrTTAAGT GTTICIXJrrr arTCTOOTT CCTOITAGCA 420 
TAAGTAAACT CXCTITOGAA CTACACAGGT A3X?rcTCrcC rTCAACATOT GIX3AA 



10 



25 



30 
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50 



GCCCAGCCMA CYTCTGTCAR AATCTGCTTC CCTCCACAGC TCACCC 



(2) INFDRMftnON FOR SBQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LQJGTH: 1366 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 



SEQUENCE nESCRIPTICN: SBQ ID NO: 49: 
TAGGTGTCAG CCGOCACCCC CCCCCCATAT GCAGATTTAC TSGGCA-TOCT AGK3C3CCAGC 
TTCTAACACA GCTGGTATTT CAAGTCTCCT GGGACCTCAC TCAGGAA3GA TACCCCCTCA 
GTAGAAGCAG CAGCSTGATCT TAACTCCTTT CAAAGAGCAG CSCCTCTCTOG GAAGCCArECT 



60 
120 
180 
240 
300 



475 



20 

C2) INFORMATION FOR SBQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 48: 

AAGGGACAGA GACCTGGAIT CAGATCTCAT TTTACAATCA AGACCCCAAT GCAGAAAGIC 60 

ATGTCTC5AAA TTCTGAGCTT AdCTICTGC CTGCTCQGAC CTCCTCTCGA TCAGAGAAGG 120 

35 6AGGAAAAGG ACTAATCAGA GGAGCCAATG AAGTCACTCC ATCAGITICC TCAACCCTCC 180 

CCAGCTAGAG ATTAACGTYT GAOCWPCAAC CTAGGACACT GTOCAGAJTOG CTACITOCIG 240 

^ GOGCACATGA AGACCAAAGC CAGGACCAAG CCOCMASCCT GCTOAACACG GCAGARKnT 300 
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60 



60 
120 
180 



W0 980M48 



272 



PCT/US98A)4493 



10 



15 



20 



25 



30 



35 



40 



45 (2) INFORMATICSN FOR SEQ ID NO: 50; 



50 



(i) SBQfUENCE CHARACTERISTICS: 

(A) LENCTH: 1405 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEENESS : doxible 

(D) TOPOLOGY: linear 



CCTCAGCAGG CACAGCAACC CCTCTGGAAA TGGATCACAA ACTCACITCT CAGCCAGCXA 240 

C3GCCAAGCTT CTATTGTAAC ACJrAGGCACA CTATAGTCGG ATCATCACAT CAGCTOGC5TT 300 

TrrcGrriTAG tcatctagag tcgtctqgac taaagctctt icAOffrcrcc ttoccctotc 36o 

AGrrGCGTGAA CCTCCCCACC CGAATTGCCT CAGTTCTCCT GAGCCTCATC TCTCTCCTCG 420 

•raJTOGGCCA GGCCCCroCA TGC5GAAGGGA GCCTGCTC3CG GC3GCAGGCCA QCTOGGC3CTO 480 

CTCACCTATG CGCAATGANA GTTATTGAAG CSVCTCSGTICT TGATOITOCT GAGCGrTATCC 540 

ITCATQCXXrA GCC3CX3AAC3TC GGCCAGGTCA GCCAGGTQCT GCCAGCOCTC TCTCTCGGAC 600 

TTGTCTTCCT GTCOCAGC3C3G ACCGTOGAGA AAGTGTCAGG GC3CCC3CICAC TGCAGCAGCC 660 

TGCTCTGCTG CCTTCCCTC5G CAGTCTTCTG GGGGTGGATT CCCTACAMCT AGA3CTICAA 720 

GGccTTAcrr rrccTcccAc aaaggagtcg cagccacgct agctctcact ixsccAcrorc 

ACAAAGTTCA CGTAGCAGG?r CTAGGCAAAG ACTGGGCAAT IGAGCAGAGG AGACGGACCT 

GTCAGrrCTGA CCRYGAGSCG GRCCCCTTCA CCTTOGCTGG CSCTOTrcrTC GIOTrAGCT 900 

TTTGTCAGCJr TffrCCTrGTT TC3GATCOCTC AACTAGGTCA TAAGCACTCG AGC3GGGMXSA 960 

CCCGCCTTCG ACGTGTTPCT TTAACCTCAT CCATATAATA GGCXXX3TOC3G AltSGTTOTAG 1020 

AGC3TAAAGCA CSGATtSATGCST OmTAAGAC CAGAGCTTCG GACCAGC3GCT CCTACACCTA 1080 

ATTTTCTCTC CTGCTAGCTG AACAAAGGTC TAAATTAGCT TAACAAAAGA ACAGGCTOCC 1140 

GTCAGCCAGA GTTCTGAAGG CCATCSCTITC AGTTTCCCrP C7ITCACAA!rr CXTTCTCCAGr 1200 

TCCTATGAAA GCACAGAGCC TTAGGGGGCC TGGOCACAGA ACACAACCAT CTTAGGCCTC 1260 

AGCTGTQAAC AGCAGGGGGT TGrTCTCTCTG TTCTGrtTCr CTOCTTOCCG AACTTTCTCA 1320 
ATAAACCCTA TirCTrATTT ATAAAAAAAA AAAAAAAAAA AAAAAA 



780 
840 



1366 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

55 GCAGTAAITC CIOTTAGCCA CTGCATCCAC CAAAACTAGT TTArrTTTCC CCTCAAASTC 60 

ATGATTTrTA CGnCTGTTAC AAAGGGAATT TTCJCTGATAG CTCTTTOGCT CCCACTCTTC 120 

^ CATTTTATGC TAATAGATTC CATTCTAGGG CCCAGCOGTC TCTTCACTCA TCGTCTTCCC 180 
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TTTAACOCTr GQCATGTATA A-EftGAATTTT GCTTGAATGAA AGAACCCAAA TAGGCCAGAT 
ACTCXCCCCA GCrOCTGATA TCCATAAAAG GCTTOGGAftT GCATTATCTA AITCTCCTTA 
(nCTITITCT TCnrrrAGAA AAAAAAAACA AGATOGGCTC AGATOGAIXSC CTACGTAAAA 
ATGGTTCCTA GCTCTGTACT CATAACTrTT CTTTGAA'TrG AGTAGTGAAA GGEAAGGAGGA 
CSGAAflGGAAA TTAAATCTCC TTCTAGTATT CTCTGGACTC AAGTCTCACA TATCAGATAA 
TAACCTATAT TGAAATQCCA AGAATTCTAT CTGAAACAAG AGAACAOTIT GACACATTTA 
TCATGCCTTC ATATTACATA TEAACTGAAA CCAATTAATA AACATATGAA ATATCCAaTC 
CACAAGGCAA AGGCACCTAA ACCTTTTGTT TCrmTTCTA CATAGCAGAA ATTCATITPr 
TTTTrATTrr TTTAGGGGAA CCTATATAAT TATGACCCAG TGATCTCTTT TCCJTCACITA 
AGCTTATGAA TTCACXTrTAC AATTGAGTTG ATTCTAGATG GTTACTACCT TCAAAAQGAT 
GTTC5C3TQCCT TATGTGACAC GAGCCAGAGC CTCXTCGQGA ATAAACAAAG CAGCnTTCAT 
C3CCAACACCA ACTCGTAGCT TTACTGGC3CA GATOGGGAGT GGITCACAGA CTTCCCAAAA 
TGTGCX5GGCT TTCGOATTIT CCACACCATC CCAOSTOPGT TOITCATTCT TCCTCTTrrc 
ACACPCTTCG ATGGATWATT TCaiAAATCSGT GRAAWYMMCY YYKRAATITC CXXMTAGCC 
WTC3«3CCACC ATTOTTOATG ACACCATAAC CAAATAGTTC CWTAATGITC AAATATTAGA 
AACCTGTTAC CAGCCYKSHA KTWACCCWWA WmTCCCAT CSTTTCTOGAA 1TCATA3TOA 
AATAGCAC5GG CTAAGGAATT ACTQGCAAGT TTTAGCCTGT GC3CTAATACC TTAGGGTTAT 
TTAAATATTT GrrAAriTTAT TTAAATOITC ATGAATOTTT GAAAGGAACA AAATTATCAG 
GGATGGCTCT TTOCCATC30G TCTCATTTTC ACCXTrCiriT CTOTAAGAAA AAAGAACAAT 
GTCTTAATCT ATTnTAAAG TTTITOGTAT ACTTTCTAAT TCCAATITEA ATAAAACTTP 
TWTRTAAAAA AAAAAAAAAA AAAAA 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1405 



45 



50 



55 



60 



(2) INPORMATIQN FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LajGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 51: 
OGGATITrCT AGGACCCCAA AAAAAAAAAA AGGC31AAAAA AAACCCNCAA AACCANCCAA 
AACCCCAAAA AAAAAAAAAA TCCACAAAAA CAAAAAAACT ATAAAAAAGA AAGAATTAAA 
AAcinCAGA GAATTACTAT TTACTTTATT AACTTAOSGA TTTATTATAT AAATATATAT 



60 
120 
180 
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TCACCr«5CA ACATATCTCT CXOTPCrac CTCCTTnCAT AA1X3AAGACA TAGCCCMTC 
CCCTOCftGAG GTOGAGGTOG GCTGATQCXX: TGGCTOCCTO GraTTTCATC GmTCCTCC 

cccTACxrrrp TrmrrGAG irrArrcrcA TOmrrr itcitootit ctogataaac 

CACCCrCTOG GGACAGGATA ATAAAACATC TAMSOTTIT AAGAAGGAAA AAAAAAAAAA 
AAAAAACTOG GOGQQQGCCC CGAA 



(2) INPORMATIGN FDR SBQ ID NO: 52: 

(i) SBQOEMCE CHARACTERIOTICS: 

(A) LBxmii 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTICaj: SEQ ID NO: 52: 
NAAGTATCTT GGCCACTTTA TTACAGAGGA CX^ITAAAIT^ TTCCAIXTTCG ATAGGGCATA 
ACATACAGftG AATGAGftCTA TGCCAGAAAT GGGAGGAGGC AITTCAAACA ACAl^AT 



CTCAGGGACA GATGGATTGA TTCTGCTATT GGTAGGCXnX5 GAAGCAANGG TCAGAAGTAG 
CAAAAAATGG ATACCAAAAG CACTATIWGT CACCCAAGCT AAOTXIAATA GCIX«^ 
TAGGAGAAAT GCAGC?mTC CTCTACACTA AGlTCrcCAA CTCTKMAA GCCTCCAAAA 
ACAAATGrrrA GGGGAAAAAA AOGCAGCTOG TTATGAAAAG ATA3ATCICA TnCATIAAA 
AAATCAATGT CAAOGCIOTT AATAGAATCC TmAICITC AGGACAGAOG CAATCCXCTA 



AACAAACACC AGCTCAAGAG CCTCTGATGC CAACXTTAGAG GOTACCCAAA CACAAAdTA 
GCATAGAOGT AAGAATCTCT ATGTCmTC GTOGAOGCAA AGCCATITOG nX3OTACITC 
ACAGGAACAT ePTTCTACCA AGTCITCATC ATATGOIATC TCCCACGAGT CTCCA£?ITOr 
TO3CACCACr GTGTCATAGC TGAGAATAOG CTGAAAGCOT ACnriTCATC CTCGAAACCT 
AlTTACAArr GOCAGCTTGAT GTCCCTGCTG CXrACTTAAAA AAGGCTTCGG OXTTOGCATAG 



GCAGAMAGGC CnX^rOTPCCC CKXJrcCCGA TTCTOGGCIC GAGGCX:AATr NCCTTAT 



(2) INFX5RMATIQN FOR SBQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 base pairs 

(B) TYPE: nucleic acid 



240 
300 
360 
420 
480 
504 



60 



180 
240 
300 
360 



480 
540 
600 
660 
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(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SBQUEKCE DESCRIPTION: SBQ ID NO: 53: 
AOGACTACAG TGTTATACCX: TCCAATOTT GCAQGTOGGC AIX3GAACACT GCTTOTATCA 60 
CTCroroCAC CCTATAAATC CATATATCCA CAAAAACACA CATCCA3CCA TCAACATATA 
CATGGTTTOG GATGAQCAGG TCAATAGTTr TGAGAGGGAG T TiXynu^Vr I TriTlTlVr 
CATTATACTC TTAAAITCrP GTCAGrTTATC AAACAAACAA ACAGAAAAAT TCTITOGAAA 240 
AACCTTGCAT AaXTTTTPC TATCAAGTCC TITAAAATAT AGACTAAATA CACACA3CCT 300 

GOCAGTrrrr tcitacagtg acagtatcct tacctgccat ttaatattag cxTOTTArrr 

TTCTCACGTA TATTTACCTG TGACTTGTAT TOTrTATTTA AACAGGAAAA AAAACA3TCA 
AAAAAAGAAA AATTAACTGr AGCGCTTCAT TATACTA3TA TATrATTATr ATTATRJIGA 
CATrTTGGAA TACTGTGGAA GinTATCTC TTOCATATAC TITATAOGGA AGTATTACGC 
CITAAAAATA CGAAAATAAA TTTTACAAGG TTOOGGmT GGTOCTGGAA AGACTAAATT 



20 



30 



35 



GA 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) La«3TH: 1749 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: dotible 

(D) TOPOLOGY: linear 



120 
180 



360 
420 
480 
540 
600 
602 



(xi) SEQUENCE DESCRIPTIGM: SEQ ID NO: 54: 
AGTCACTGAC TTGGAGCCGC TOOGGGGAAG TCCOGCCCAG ACAGGCGC5TO GGTOGGAATC 60 
CCTCACTTCA GTITCAAGAG GGTOOQGATC CAAAGGQGTT AAAACGAGOG AACCCOGATC 120 
45 CCCGACCACA CTTOCOGCCT CCCTAAAAOG CACACCCCGC TAGCCAITOGG CAGCCGCGAC 180 
CACCTGTTCA AAGTGCTGGT OGTGGGGGAC GCCGCAGTCG GCAAGA0C7IC GCIGGIX3CAG 240 
GATTAarcCC AGGACAGCTT CAGCAAACAC TACAAGICCA 0GC?IGGGA£7r GGATTrTCCT 300 
CTGAAGGTTC TCCAGTGGTC TGACTACGAG ATAGTQCGGC TICAGCTOIG GGATATTOCA 360 
C3QGCAGGAGC GCTTCACCTC TATGACA06A TTGTATTATC GGGA3GCCTC TOCCKJICTr 
55 ATTATGOTTG ACGTTACCAA TGCCACTACC TTCAGCAACA GCCAGAGCJIG GAAACAGGAC 
CTAGACAGCA AGCTCACACT ACCCAATQGA GAGCCGGTCC CCTCCCTCCT CTO3GCCAAC 



40 



420 
480 
540 



^ AAGTGTGATC TGTCCCCnG GGCAGTGAGC CGGGACCAGA TIGACCGCnT CACTAAAGAG 600 
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AACGcnrrcA caggttggac AGAAACATCA GTCAAGGAGA ACAAAAATAT TAATGAGGCT 660 

ATGAGAGTCC TCATTQAAAA GASXSATGAGA AATTCCACAG AAGATA3CAT GTCTITOrcC 720 

A0CX:AAGC5GG ACTACATCAA TCTACAAACC AAGTCCTCCA QCTOGrrcCTC CTOCTACTAG 780 

TGTTTGQCTT ATTTTCCATC CCACTTCTGG GAGGTCCTTT AAGTCTCTTC C Cmwrm 840 

CCCACCTGAC CATTTTATTA ACSTACATTTG AATTCTCTCC TGACTACTCT CXACTAAOGA 900 

GGQCCXMTC TCACTTAGAA AAGACACCTG GAACCCATGT GCATTTCTOC ATCTCCTOGA 960 

TTAGCCTTTC ACATGTTGCT (SCTCACATT AGTGCXZAGTT AGTCCCTTCG GTXJEAAGATC 1020 

TTCTCATCAG CCCTCAATTT GTGATCCGGA ATTTTGTCAG AAGGATTAGA AA3CAGCACC 1080 

TOCXnTTTAG AGATCATAAT TCTCACCTAC TTCTGAGCTT ATTTTTCCAT TTCATATTCA 1140 

TTGATATCAT GACTTCCAAT TGAGAGGAAA ATOAGATCAA ATGTCAnTC CCAAATITCT 1200 

TOTAGGCCXT TCTITCAGAT TCITTCTGTC TTGGAATGTA AACATCTGAT TCTOGAA3GC 1260 

AGAAQGAGGG CSTCTCSGQCAT CTGTC3GA3Tr TTGGCTACTA GAAGTCSTCCC AGAAGTCACT 1320 

GTATTTITGA AACTTCTAAC GTCATAA3TA AGTTTCTCTr GTCTTOGCAT GAAGAAIACJT 1380 

CAAGTTTnT GGCXX^GGCAT GGrTGGCTCAT GCCKGTAATC CCAGCACTTC GGGAGGCCAA 1440 

GQCAGC3COGA TCACATGAGG CCAGGAATTC GAGACCAACC TGGTCAGCAT GGCAAAACCC 1500 

CXTTCTCTACT AAAAGTACAA AAATTAOCCA GGCGTGATQG CACX3TCTCTC TAATCCCAGC 1560 

TACTCTQGAG ACTGAGGTOG GAGAATOGCT TGAGACK3GG AGGCA£3AGGT TOCAC51X3AAC 1620 

CGAGATCATG CX^ACCGCACT TCAGCCTGGG TCACAC5AGAA GGACTCCGTC TCAAAAAAAA 1680 

AAAAAAAAAA AAAACTCGAG GGGGQGOCCG GTACCCAAAT CGCCSTGATA GTCATCGTAW 1740 

ACAATCNAA ^^^^ 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1896 base pairs 

(B) TYPE: micleic acid 

(C) STRANDEEB^ESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

AAAGAGAIGG GCTCTTTATT TTCTCGAAAA ACCAATTIGG AGTTACTCAT TTTrcCATAA 60 

CATTAAATTT CTTACAGTCA ACTACATATT GTCCATAAGT GdTCATCAG GACTCAJTCGC 120 

CCTCCTGTCT ACTQGCTCCA AATAGACCAT GTCAGCTTCA CCCCCTQGCT TTCTOTCTAT 180 

GQGTGGCCTG TQGTATATGG AAAAGTAGCA GGGTGGTCAG GGTGGGAGAC ACAAGATOIT 240 
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•rrrATAGTCT AGAOCCrrrA AAAAACCCAG CAGAATGTAA TPCAGTATIT GnTTATrcGC 
TCTTTriTGA CAGATTOITG AAATTAAATC AATTGAAAfiG CSAAACTCAGA CTACTAQGAC 
GTTTATTAAA AGGAAAAAAA TGTPCTTGCAA TGTOCTCTAA TCACAAGAGG AGAAAAaSWiC 
TTGTTTCXTr GATCTCTCAG AGGTCACAGT AACCTOGGCC GAGCTOrrAT TATTrATTAT 
ATAATAGTAG TAGGAAGTTA ATAACTOGTr CTCTUIXJITC CAAOCACAAT ATTACAACPr 
CTTITCAACC GTAAATATCA GAATGAATCC TCTTCCCAGG GGASTCAACA GAAGCTEAAT 
GTTTACAACTr GTTTGAATTT GTCATCTGAA ATAACACAAA ATTAAAAACA TCATITCTCT 
AATITTCCAA CTAGAGGAAG AGAAACTTGT GGAAAAGTTX: ITlTrrmt: ITlTlTmT 
CITAAAGAAG GGCAGCCAAG GTAGTAACCT AAAAATAGTG CCCAGQCA1A TCAGAGTTOT 
CCTACX5AGGT TAAflGAACAC ACTGTrCCAC TGrTATGGCTT IGGCXXTCAG TCGCXMQGA 
GGTCAACTTG ACCCraXAT GTTQGTTTGA CITACTAAGA CACAGGAATC AlTOnTTCC 
TTGACCAGGG TCTCACACCC TQGAGGAATG TTAAGTAAGA GAAAGAACCT CTTTCCTCAA 
TATTGACATC TAAAAGACCA AAGTAATTTT TCTGAACTTC TCCAArTCTC AGAACTCTCC 
AAGGAATTTA CAGTGATTTT AGTOCTTGrrC AGCATTTTTC CAICAGGACT TrCATACATT 
TGACTCnTA GITCACAGGT TCCCATTGAT TGTCAGCAAG ATATTTAICT CTTTAGCCCT 
TGGGGATCCA GCTGAGAcirA ATCTCTTCCA rmTTTACC CCHCTAITOTA CAGATATCAT 
TrcTTGriCTA TGCCATGACT TGAAAAAGOT TGGGAAGCTC TTTAGCAATA TCAGCTAAAA 
GGATATGAAA TCACAGGrTGA TAGCAGTTGT CATTCAGTAA TOPCCTACAA GCAGCACXXX: 
AAAGGAAATA TAGTCCTAAT CTTTACTATC CACTTCTAAA ITOVATOIGA ATTICATftCA 
TGTTAITAGT TGTTTTCITr ATAA1T1TAT AAAAATTATT CATCGGGACT TTAACTTCCA 
CTTCCATGCT ATOGGATGTG 1TCGGCTCCA TGCAAGAACT TOGAAGAAAA ACAGGCAGGA 
ATGCAITTGC ATAATGACCX: AGATCATCAT OTrCTGCAAC TCaGAATTAT ArTTCATCAT 
TGCTTCTAGA AGTCTGCAAT TCTPTACTTT TCTTTGCTGC ATTATTATCT AGCTTCCCATC 
ACTQGA'EAAT GTGGAGTGAC TAGAGAAGTC AYATATCACT GTAAGCTACA GTTAGGGGTA 
ACACTTTAGA GGTTTATTAT TTTTAAAAAA CTTTTCTTCA ACTCCTOQQC CAACATOGCTT 
GAAACCCCGT CTTCTTACTr AAAAATACCC AAAATTAGGC CAGGGGCCTG GATQGCTX3GG 
GTCCCTCTTA ATCTTCAGCT ACTTTOGGGGA GGGCTTGAAG CCAQQGAGGA ACTOCCCTCG 
ANCCOCQGQG NQQGCCAGNA GGTTO3CCAG TTGACT 
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(2) INFORMATION FOR SBQ ID NO: 56: 

(i) SBC^JQICE CHARACTERISTICS: 

<A) LQKsTH: 1753 base pairs 

(B) T5fPE: nucleic acid 

(C) STRANDECNESS: doilble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 56: 

TCTTTTTAAA ATAGACATTT GTGGGGCTCA CACAATATAT GAAATAGTAC CCTCTAAAAA 60 

AGAGAAAAAA AAAATCAGGC GGTCAAACTT AGAGCAACAT TCSTCTTATTA AAGCATAGTT 120 

TATTTCACTA GAAAAAATTT AATATCAAGG ACTATTACAT ACTTCATTAC TAGGAAGTTC 180 

TTTTTAAAAT GACACTTAAA ACAATCACTG AAAACTTGAT CCACATCACA CCCTCfrTTAT 240 

TTTCCTTAAA CATCTTQGAA GCXTAAGCTT CTGAGAATCA TCSTQQCAAGrr GTGATGGGCA 300 

GTAAAATACC AGAGAAGATG TTTAGTAGCA ATTAAAGGCT GTTTGCACCT TTTVAGGfiCCA 360 

GCTGOGCTC3T AGTGATTCCT GGGGCCAGAG TQGCATTATG TTTTTACAAA ATAATGACAT 420 

ATGTCACATG TTTGCATGTT TGTTTQCTTG TTGAATTTTT GAACAGOCAG TTGACCAATC 480 

ATAGAAAGTTA TTACTTTCTT TCATATCSGTT •rTTGGTTCAC TQQCTTAAGA GGTTTCTCAG 540 

AATATCTATG GCCACAGCAG CATACCAGTT TCCATCCTAA TAGGAATGAA ATTAATTTTC 600 

TATCTACTGA TAACflGAATC TC3QGTCACAT GAAAAAAAAT CATTTTATCC GTCTTTTAAG 660 

TATATGTTTA AAATAATAAT TTATGTGTCT GCATATTOCA GAACAGCTCT GAGAGCAACA 720 

GTrrCCCATT AACTCTTTCT GACCAATAGT QCTQGCACCG TTGCTTCCTC TTTQGGAAGA 780 

GGAAAGGGTG TGTGAACATG GCTAACAATC TTCAAATACC CAAATTGTGA TAGCATAAAT 840 

AAAGTATTTA TTTTATGCCT CAGTATATTA TTATTTAATT TTTEAGGTAA TQCCTATCTC 900 

1TOGTCTATT AAGGAAAGAA GCAATCAGTA GAGAATTCAG GATAGTTTTG TTTAAAITCT 960 

TGCAGATTAC ATGTTTTTAC ACTQGCCTGC TATTGAQGAA AGGTATTCTT CVATACAACT 1020 

TGTTTTAACC TTTGAGAACA TTGACAGAAA TTATGCAATG GTTTGTTGAG ATACGGACTT 1080 

GATGGTGCTG ITTAATCAGT TTGCTTCCAA AGTGGCCTAC TCAftGAGGCC CTAAGACTQG 1140 

TAGAAATTAA AAQGATTTCA AAAACTTTCT ATTCCTTTCT TAAACCTACC AGCAAACTAG 1200 

GATTGTGATA GCAATGAATG GTATGATGAA GAAACTTTGA CCAAATTTGT TTTTTTGTTC 1260 

TTGTTGnTGT TTTGAATTTG AAATCATTCT TATTCCCTTT AAGAATGmT ATGTATGAGT 1320 

C7PGAAGATGC TAGOGAACCT ATGCTCAGAT ATTCATCGTA AGTCTCCCTT CACCTGTTAC 1380 

AGAGITTCAG ATCGGTCACT GATAGTATGT ATTTCnTAG TAAGAATGTG TTAAAATTAC 1440 

AATGATCTTT TAAAAAGATG ATGCAGTTCT GTATTTATTG TGCTGTGTCT GGTCCTAAGT 1500 
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GGAGOCAATT AAACAAGrTTT CftTATCTATT TITOCAGTGT TCAATCTCAC ACACICTACT 1560 

TTGAAAMTT CCTTCCATCC TC5AATAACGA ATAGAAGAGG CCATATATAT TCCCTCCTTA 1620 

5 TOCTTCAGAT TTCACTACCT TTATGTTAAA AGTTGTGTAT AAnTCTTAAA ATCTOIGAAA 1680 

GAATAAAAAG TQGATTTAAA TEAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1740 
AAAAAAAAGG OG6 

10 1753 



15 



20 



<2) INFORMAriON FOR SEQ ID NO: 57: 

(i) SBQUQJCE CHARACTERISTICS: 

<A) LHOGTH: 1220 base pairs 

(B) TYPE: nucleic acid 

(C) STOANnECaiESS: double 

(D) T0P0UX3y: linear 



(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 57: 
GCGGAAGTTA CTGCAGCCGC GC3K3TICTGC TGrrOGGGAAG GGAGAAGGAT TTGTAAACCC 
CGGAGOGAGG TTCTOTTAC COGfiGGOCGC TGCICTGCGG AGACCCCCGG GTCAAGCCAC 
CGTCATCATG TCTGACCAGG AGGCAAAACC TICAACTCAG GACTO3GGGG ATAAGAAGGA 
30 AGGrPGAATAT ATTAAACTCA AAGTCATPGG ACAGGATAGC AGTCAGATTC ACrTCAAftGT 
GAAAATGACA ACACATCTCA AGAAACTCAA AGAATCATAC TCTCAAAGAC AGGCnCTTCC 
AATGAA1TCA CTCAGGTTTC TCTTTQAGGG TCAGAGAATT GCTCATAATC ATACTCCAAA 
AGAACTGGGA ATGGAGGAAG AAGATGTGAT TGAAGTITAT CAGGAACAAA OQGGGGGTCA 
TTCAACflGTT TAGATATPCT T iT m - mT TTTCTTTTCC CTCAATCCTT •mTATinT 
40 AAAAATACTT CTTTTCTAAT (nGGTCTTCA AAACGGAATT GAAAACTCGC ACCCCATCTC 
1TPGAAACAT CTOGTAATTT GAATTCTAGT QCTCArEATT CATTAnCTT TOnTrCATT 

ITACGIXntX: ACAGAGAGGT CACCrr^ ^20 
AAATAAGATC GACCAATGCA AC?rGTTCATA ATGACTTTCC AA1TOGCCCT GATOTTCTAG 780 
50 CATGTOAITA CTTCACTCCT GGACTOIGAC TTTCAGTGGG A6MGGAACT TTITCAGAGA 840 
ACTGAACICT GGAAAAATGA CCTTrCCTTA ACTTGAAGCr ACmTAAAA TTTCAGGGIC 
TGGACCAAAA GAAGftGGAAT ATCAGGTTGA AGnCAAGATC ACAGATAAGG TCAGACTAAT 
GACTAACTCC AAAGATGGCT TCACTGAAGA AAAGGCAirT TAAGATmr TAAAAATCTT 
GTCAGAAGAT CCCAGAAAAG TTCTAATrrT CATTAGCAAT TAATAAAGCT ATACATOAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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900 
960 
1020 



1080 



60 



AAATGAATAC AACAGAACAC TGCTCTTTIT GATTPTAm CTACnTITC GCCTCGGATA 1140 
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TQCXjmTAA flSGGACATPG ■TCTCTAOCSG CTTCKITBAA Al»AACAATft -mCTAAAAA 1200 
"TCAWAAAAAA AAAAAAAAAA 



1220 
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(2) UtroiOlATICJN FOR SBQ ID NO: 58: 

(i) SBCUQJCE C3»RACTERISTICS: 

(A) LENCJIH: 1049 base pairs 

(B) TOPE: nucleic acid 

(C) STRAMDEENESS: dotlble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE EESC3UPTI0M: SEO ID NO: 58: 
TOC30GOCTGC AGACACACXA TCTACTCAGC GTGQGTCACC TCTOTOAACA TCACTOACTC 60 



CAAGCCTCCC TCAATITCTG CnCCAGOCCA TCAQGGACCC ACAGCXXETC GC3AGGATCCT 
CJCCSCaTCTro GCCAATOCSGG AAATCCSTCCA CXaCCAOSAC CCXXXSAGTCIA GCSflCCACTAC 
25 CCAGOCaCCA AGAJSGTADCA TTCCTOGftCA GAGCTTCTO: AATAGGGGCC AIGGTCCTCC 
aXaOGGGGT CCTQGCCCCC QOCAG(»GCa QGa«^ 

COOCnCAAT GACCTCAACC QGCAGCTGCT GAACAOCGGC TTTCCGCACT GGCA-raXXK 
CAACCATGCT GTGGBGCXX3G TGAOCTOCaT OCTGCTCCTC TTCCTCCTCA TCATCCITCG 
TtSTTOGTOGC CTCCTCCTOG TTGQCCITGT CTACCTOGTC TCCCftOCTCA GICAGCGGTO 
35 ACCTCTGAGG GCTGATAGGG GTQGGTTICT TGAGflGGGAC TTCCTCQGCC ITOGIGKaG 
WSCAGGCATA TTTQGflOGGG ATCTQGTGGT QCCTTGAAGG TATCATCAGA GAQGGGACCA 
^ CAGGTOTOTG TITCCCCrrT GTOTEAAGCG TGAGGCAGAG QGAGAOCTTA GTCCCAGCAT 
TTCCC3VAAGT GTGQGrTGGCT COGITOGTTC CXXSAGATACT TTTAGGTCGr ATOGGGCCTO 
CACTAAGTGG CACAAAATCA GAQCAAGAAA GCX3ATOCCCT TCCCAAITCT CTCAATOCTT 
45 ITATOCCGAG AAGATCTCAG CTQQATCCrA AOOCTrcOG ATCCXTOTOG AAGACATOCC 
GACGTCTCCT CTGCTTAGGG AGCAC5GACTT GGGCTTAGQG CftGGTOGAAA AAATTCCAGA 

cirmTAQC ACTGrmTG rrrrAATOGT ATATnrrAT tcgctacpit ArrorrrAGG 

ACAAGTQGTA GTGOCATTCT ATrTATTGTG ACCTTTTCAA TAAATAGATT TAACTAAAAA 
AAAAAAAAAA AAAACTCGfiG GOOQOGCXr 

55 



60 



(2) DffORMATICaj FOR SEO ID NO: 59: 

(i) SBQOEtlCE CHARACTERISTICS: 
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(A) LENGTO: 1776 base peuxs 
<B) TYPE: nucleic acid 
(C) STOANOECNESS: dotible 
^ (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID UO: 59: 

AAAGAGGATG TC3IAGCTAGA GGTCCCCGAT CXXrKXTTCGG ATCGQAAGCA CAAGGCTCAG 60 

10 GGACTQGATT GTAAAGGCAC TAAGTOGrTTC TGCGC5TGAGA A3X»GACATC GCX3C3ACCTCT 120 

ACSCTTCACAT CCTCTTTCCT TGCAGSTCTG GACATCCTGA OCCCAACTCC CCCACACTCA 180 

GTC3CAGTGAT GAGTtSCGGAA GTGAAGGrPQA CAGCX3CAGAA CCAGGAGCAA TTTCTOCTCC 240 

TACSCCAAGTC GGCCAAGGGG GCAGCGCTC5G CCACACTCAT CCATCAGGTC CTX3GAGGCCC 300 

CTC3C3TCTCTA CGTGTITOGA GAACTGCTGG ACATGOCCAA TC3TEAGAGAG CTOOCTtSAGA 360 

20 C3TGACTTPGC CTCTACCTTC CGQCTC3CTCA CAGTCTTTGC TTATCGGACA TACGCTCACT 420 

ACTTAGCPGA AGCCCGGAAT CTTCCTCCAC TAACAGAGGG TCAGAAGAAT AAGCTTCGAC 480 

ACCTCTCAGT TG?rCACCCTG GCTGCTAAAG TAAAGTGTAT CCCATATGCA GnCTTOCTCG 540 

AGGCTCTTGC CCTGCGTAAT GTGCGGCAGC TGGAAGACCT TC3TGATTCAG GCTOICTATG 600 

CTGACGTC3CT TCGTQGCTCC CTGGACCAGC GCAACCAGCG GCTCGAGGTT GACTACAGCA 660 

30 T0GGC3CGGGA CATCCAGOGC CAGGACCTCA GTOCCATTGC CCCSAACCCTK AANAAAAACC 720 

ATTAAAGTTA CGAOQGCAGC AGCAGCCGCA GCCACATCTC AGGACCCTCA GCAACACCTC 780 

ACTGAGCTGA GQGAACCAGC TCCTC3GCACC AACCAGCGCC ASCCAGCAAG AAAGCCTCAA 840 

AGGGCAAGGG GCTCCGAGC3G ANCGCCAAGA TTTQGTCCAA GTCGAATreA AAGRACTOTC 900 

GTITCCTCCC TGGGGATGTG GGGTCCCAGC TGCCTGCCTG OCTCTTAGGA GTCCTCAGAG 960 

40 AGCCTTCTCST GCCCCTGGCC AGCTGATAAT CCTAGGTTCA TGACCCTTCA CCTCCCCTAA 1020 

CCCCAAACAT AGATCACACC TTCTCTAQC3G AGGACXCAAA TGTAGGTCAT G lTlTll T rm 1080 

GTEACTTTCTG TrTTTTGTGA CTrCATGTGT TCCATTGCTC CCCGCTGCCA TCSCTCTCTCC 1140 

CTPGTTTCCT TAAfiAGCTCA GCATCTGTOC CTGTTCATTA CATGTCATIG AGTAGGOGGG 1200 

TAGCCCTGAT GGQQGT0C5CT CICTCTGGAG CATAACCCAC AGGCGTITIT TCTGCCACCC 1260 

50 CATCCCTGCA TGCCTGATCC CCAGTTCCTA TACCCTACCC CTGACCTATT GAGCAGCCTC 1320 

TQAAGftGCCA TAGC3GCCCCC ACCTTTACTC ACACCCTGAG AATTCTOGGA QCCACTCTOC 1380 

CATGCCAGGA GTCACTGGAC ATGTTCATCC TAGAATCCTG TCACACTACA GTCATTTCTT 1440 

TTCCTCTCTC TGCSCCCTTGG CTPCCTOQGAA TGCTOCTGCT TCAACCCCAG AGCCTAAGAA 1500 

TQGCAGCCGT TTCTTAACAT GTTGAGAGAT GATTCTTTCT TC3GCCCTCGC CATCICGGGA 1560 

SO AGCITCATGG CAATCCTGGA AGGCTTTAAT CTCCTTTTGT GAGTTTGGTC QCSGAAGGGAA 1620 
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GQC5TATATAG ATTCTATTAA AAAAAAAAAG GTA1MATC5C AmEATCTAT ATATAATATC 1680 
ACGCAGAAAT AAATCTATGA GAAATCIATC TACAAAMS<AA AAAAAAAAAA AAAAAAAAAA 1740 
AGGAATTOGA TOTCAAGCTT ATCGATACCG TCNACC X776 

(2) INFDRMATIC3N FOR SBQ ID NO: 60: 

(i) SBQUajCE C2IARACTERISTICS: 

(A) LQK3TH: 443 base pairs 

(B) TYPE: nucleic acid 

(C) STOANEEENESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 60: 
ACAGATAAAT AAATAAATAA TAAATTAAAT TAAATAAAAA ATCTCAGCTA ATCTCAATAA 60 
ATTGAGAGAT TTCACATGAA AGCCAGGATT TCTGGCTTCC CAGGAACAGT CAGAAGAGCT 120 
25 AGCTAGCAAC ACTGGTCTGC TTQGCTACCT TCTTTGGAAC AACAT6AAAT CTAGCTCCCT 



180 

TrrrmTTT tttttqgcxx: acttcatcca ttcacatgac ctgcctogcc tctocagcta 240 

AGTGAGTATG CAACAAAAAT GTAGCACAGG •rrTTGTOGCT GAACTACGIG GnTCAGGTC 300 

CAGCTCTGCC ACTTGCTAGC ATQACCTCGT GCCGAAITCC NSCACGAACT TiTriTmT 360 

'mTrCAGTG CrcCAGTCCC CCTATTQGAG AATCX^TGCCC CCCCCTOQGA CAGAASCTTC 420 

35 ACCCTGGCCC CGCGANTCCC TGA 443 



40 (2) INFORMATiaN FOR SEQ ID NO: 61: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2888 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEE3NESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 61: 
TTAATGTTGT CAATAACCAC CAGGCCAAAC AGAAITEATA TCACCTOGAT GAAGATCATC 60 
ATGGTATAGC TrCCGTTCCT ACTAAACAGA TGAAGTrTGC AGCCTCAGGC GNCTITCTCC 120 
ACCACATGGC TGGGCTAAGC ACTTCCAAGC TTTOCATGTC CAAGGCCCTC CCTCTCACCA 180 
AAGTQGTrCA GAATGATGCA TACACAGCTC CTGCTCTCCC TTCCTCTArr OGAACAAAAG 
CCTTGACCAA CATGTrCCCGG ACACTQGTGA ACAAGQAAGA ACCCCCCAAA GAGCTCCCAG 



240 
300 



CTGCTGAGCC TGTTCTCAGC OSVTKXSAAG GCACCAAGAT GACT^ 360 



wo 98/39448 



283 



PCT/DS98/04493 



CTCGAGTCAC TGAGCaOGAC ATTGTTGAGC 
CTCGACTGGT CCATCCTOGG GTAGOGGAGG 

5 

CCGCATATAA GAAGTACAAC AACOGGTGTC 
ACATGAATGG GAATGTTATC ACCTCAGACC 
10 CATCAATGAA AAAQGAGAGC GAGCTGCCTC 
CCCCOGCYGA AGTGGACCCT GACACCATCC 
CTKTGACCAC GCAGCCCACA GAATTCAAAA 

15 

AGAAGTQGGG GCAGAGGAGG GTGGCTCTGT 
CCATCGGACT GGAGAOCCCT GATTGTGGGA 
20 TQGATQGGAC COGCCTTTCT O W ntTltJ lT 
TTTCCTGTAG TATGTTTCTT CATCTCArcG 

ccrccccGHG cxrrcAGCccc aagctgattt 

25 

GGGTGGCTTT CTICTGCSCCC CATQGGATGC 
TTTCCftGQGG COGAGGGGCT GCCTTTCCTT 
30 GGGATGGAGA GCCCTGGTGT CCTGACGGGA 
TCrGTCITGT CAGTQGAC3GT GCCTGGC?IOG 
CCAGTGGCTC CAGGCCTCAC TAGTCGCAAG 

35 

TCTATCTAAG YTCTTGGCTT GGAGTCCCGT 
GTTCACCTTT CCCTTTTCCT TGAGTTGTGC 
40 CTGGGrPGTCT TTGCTGGGAG GGQGCTOICT 
GCACTTAACC ACACCCTQGT TTTarGTAGC 
AGGCTCAGCC TCCCATTGTG CAGTGCTTC3G 

45 

GlTrGTTCCT GGCTCTCCAT TTCTQGCCTC 
CTQGAGGCAT ATATCCAGCT GCCACCAAGG 

50 CCXXMCCAT CCATGACCAG AGGATTATTT 
C3GAGCAGGGC AGCTCTACCA GGCAAGGTCT 
GAAACnCAG AGCCCAGGCA GTCCX7PGAAT 
CCTGCTGGTT GGGACTTGEAAG AGAATCCAGG 
GGTTCTOGQA GCTCTGCAAA ATCAGTAGCA 

60 TCAAGAGCTC CCAAGATTTG CTTGAGGCTA 



TTTTCTGTGT GTOIGGGGCC CTCAAGCGAG 420 

TOGTGrrTTGT G?UiAAAGGAC GATGCCATCA 480 

TQGACGGGCA GCOGATGAAG TOCAACCTTC 540 

AGCXXMCCT GCTQCQGCTG AGTGftCAGCX: 600 

GCAQGGTGAA CTCTGCCTCC TCCTCCAACC 660 

TGAAGGCACT CTTCAACJrCC TCAGGGQCCT 720 

TCAAQCTTTC AGCAGGQGAG TGftGGCAGCC 780 

TTCCCCAAQG CAAAGCTTAT GACCAATGGG 840 

AGGGTTGCCA GGGATAAAGA GCTTCCTCAC 900 

CTGCCCrGTG CTCTTCTCTC TACGrTTAACG 960 

CXMGGTAGG ClTClXJ l' inT TCAGTCTGTG 1020 

CITATCTGGA AATGGTACAC TGAATTCTCT 1080 

AGCGTQGGGG CTGTCTGAAG GACCCTGCTT 1140 

TGTGrrGTATT AAC3CTTTTCA AACAATQGAG 1200 

GCCAGGTCGG CCTGAGftGCT GTGCXXXTCC 1260 

GGAGCAGGTC TCAGGCXTTCT TGrPCCTCTCC 1320 

GGCAQGATGA GGCTGCACXX3 CTOGGAAGAG 1380 

GTCGTCTCCR CCCAGAGGAA GrPCTCCAGA 1440 

TGAATGCCXX: ACCCCAGCTC ' iV lTl lXXriT 1500 

ICTGAGCXXn* CCCGGTTCTC ACCTCGOCTG 1560 

CX3CCAGCTCT CTTCTGGrTPG GGCCTTTCAA 1620 

GrriTOGAGCT TATTTGAATG GAAGAGCJTCA 1680 

AC3TTGTCTAC AGGACAGrrC3G TCAGC3GATQC 1740 

GGCACTGTTT GTTCCCACTr ATGrPGAGnPGA 1800 

TCCTOCCTTG GCAGAQGAGG AGGAGTCAAG 1860 

TTCCCCAGCA TAGGCGCAGA CAGTTGGGAC 1920 

GACCAGGCCA GICTTGTCAC TGAGrrGGTPCC 1980 

CTGGCAGAGC TGGAGCCAGT TCQGGAGCAC 2040 

AGTC3CTGGAA AAGGCACATG CCGAAGATAC 2100 

GCXXMrTGAA RAAAACCAGA GACTCATGriT 2160 
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TCCAGGGGTC AGTCTGrrCAG GCAGGAAGGA CTCAGGATTT GAACCCAGCT TCAGTOTOCA 
GQCTCTGAQG CTGCCXWOGA CGGGAAAGTC CAAGGAAGGG GCCTQGTOGT GCTCCACTTO 
CAGTTCTTTA AAGAATOCTG CTTnTATTC TCCTAACCCT TTCAAGTOQG TOCftSACTTC 
TCGTTAGCAG CTGGAAGACA TTCCTCCCAC ACTTTTCCCT TCCTOGCCCA AGAGAGCATC 
CAGAAGGCftG TAGGACCTGG TITTTCAGC5T ACTOGGAGCC GC5GGGCPCAC TGCTTOCACT 
GTOCTTAGC3G TAGGGATGGT AAAIATCCTC 0CTGCATC3QC TTTATCCTCC CTCICATCCC 
AAAGCAGGTA TCTTCTOGTT GTCACAGAGT TTCATTGAGT OCAGCTOCAG CCAOyroGCC 
ATCTGGAGCT GGTGCTATAG GTOACCATCT GGTACAITCA GQGGACCTCT TTOCCTCCTC 
CACTCTATAA GCAGTCATCT TQGGAGACCG GGAGGAGAAG GTCGTOGGCT AOTCCTCTOT 
CCrcCTCCflC TTCCCAIGCC TCTATGTTAC CCATCTGTCT CTCCTG?K5CA GAAQGAGAGG 
AAGGGGCATT AAGAGATGAA GCaGTCATTAT GTATTACTTA TCCATTTCK3 AATAAACATT 
TCTTTAITCCT AAAAAAAAAA AAAAAAAACT CGAGGGGGGG CCCGGWACCC AMATCGCCSK 
AAAGrrOAG 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 1851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANEEENESS : double 

(D) TOPauxSV: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CACTAGTATA ATTTATAATT ATAACCTATT CTGATTTCTr TTCAAATATT AGGTCTCCTA 
CHTGCCTATG AAGGTTTGCC ACTTCATCTT GCACTGTTCC CCAAACTITC GACTCAGCTA 
TGCCAGACTC AGTCTGCTAT GTCAAAAAAC TGCATCAAGC TTTTGTCTCA AGATCCIOTT 
TTCGCAGAAT ATATTAAATG TATCCTAATG GATGAAAGAA CmTITAAA CAACAACATT 
GTCTACACGT TCATGACACA TTTCCTTCTA AAGGTrCAAA GTCAAGTCTT TTCTCAAGCA 
AACIOTQCCA ATTTGATCAG CACTCTTATT ACAAACITCA TAAGCCAGTA TCAGAACCTA 
CAGTCTGATT TCTCCAACCG AGTTGAAATT TCCAAAGCAA GTGCTTCITT AAATOGGGAC 
CTGAGGGCAC TOGCTITCCT CCTGTCAGTA CACACTCCCA AACAGTTAAA CCCAGCTCTA 
ATTCCAACTC TGCAAGAGCT TTTAAGCAAA TGCAGGACTT GTCTGCAACA GAGAAACTCA 
CTCCAAGAGC AAGAAGCCAA AGAAAGAAAA ACTAAAGATG ATCAAGGAGC AACTCCCATT 



wo ^9448 



285 



PCT/US98A)4493 



AAAAGQCX3GC GnCTTAGCAG TGATGAGGAG CACACTCTAG ACAGCTOCAT CAGTCACATC 
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AAAACAGAAA CCAGGGAC3GT CCTGACCXXIA ACX3AGCACTT CTGACAATCA GACCAGAGAC 720 



5 TCCTCAATTA TTGATCCAGG AACTCAGCAA GATCTTCCTT CCCCTCAAAA TACTTCTGrr 



60 



780 



AAAGAATACC GAATGGAAGTT TCCATCTTCG TTTTCAGAAG ACATCTCAAA TATCAQC3TCA 840 

CAGCATC3CAG AAGAACAGTC CAACAATOC?r AGATATGACG ATTGTAAAGA AITTAAAGAC 900 

CTCCACPGrrr CCAAGGATTC TACCCTAGCC GAGGAAGAAT CTCACTTCCC rrcTACTPCT 960 

ATCTCTGCAG TTCTGTCTGA CrTAGCTGAC TTGAGAAGCT GTCA'TOGCCA AGCTITOCCC 1020 



1080 



TCCCAGGACC CTGAGGTTQC TTTATCTCTC AGTTC7PQGCC ATTOCAGAGG ACTCTTTA(3r 

CATATQCAGC AACATGACAT TTTAGATACC CTGTGTAGGA CCATTOAATC TACAATOCAT 1140 

GTCGTCACAA GGATATCTGG CAAAGGAAAC CAAGCTQCTT CITCACATTA GCTTCTAGCAT 1200 

GTCTACTTTT AAGTCCCTCA CXXXX:AACCC CCATQCTGTT TGTATAAGTT TTGCTTATrr 1260 

GTmTCTGC TTCAGTriCT CCAGTGCTCT CTGCTTGAAT GGCAAGATAG ATTTATAGGC 1320 

ITAATTCTTG GTCAGGCAGA ACTCCAGATG AAAAAAACTT GCATCTTCAG TATAePTCCT 1380 

AAAGGGCAAT CAGATAATGG ATATGTTTrA TGTAATTAAG AGTTCACTTT ACTGCCTTTC 1440 

ArrTAATATG GCTGTCTQGG AAGAACAGGG TTGCCTAGCC CTGTACAATC TAATTTAAAC 1500 

TTACAGCATT TrrACOCTGT ATGATATGGT GTCCTCTGTC CCAGmTOT ACCFTAXAGA 1560 

GGCAGATTGC CTCCGATCGC TGTOGTTCTT ATTATCAAAA TTAAGTTTAC TICTATACGG 1620 

35 AACAACCACA AGAAATTTGA TTCTGTAAAG AATCCTCTTT AQCTGTQGCC ttSGCACTATA 1680 

TAAATQGTGC TTTATTTAAC AGAATACCTG TGGAGGAAAT AAAGCACACT TCATOPAAAA 1740 
ATAATTGOTT TAmTTATT GACAITGACTG ATTGArrGCT ATTCTCTCCA CTTAATTAAA 



1800 



CTGATTGTGA TGACTTWWAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 1851 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQI3TH: 3542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : double 

(D) TOPOLOGY: linear 

(XX) SEQUENCE DESCRIPTION: SEQ ID ^K): 63: 

TCCAATQCTG ATGAGCCTTCT TOGCTGGCAG GCCAGCTCCT TGCCTQCIGA TCACCTTK3C 60 

ACAGAAAATG CCATCATQCT GAAACGATTC AATAGGTATC CGCTGATCAT TCACCCCTCT 120 

GGACAGGCCA CAGAATTCAT TATGAATGAA TATAAGGWTC C3TAAGA3CAC ACGGACCAGC 180 
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TTCCTOGATC ACGCCTTCAG AAAGAACTTA GAGAGrTOCAC TCAGATTCQG TAAOXCCIT 
CTOCrPOCAGG ATOTOGAAAG CTAOGATCCA GTTPPGAACC OGGTOCIGAA CCCHGAAGTO 
OOQCGAACAG GGGQGAGAGT GCTGATCACT CTOQC3QGAOC AGGACATAGA CCTOTCGCCA 
TCGTTICTCA TCrrcCTOrC CACCCGGGAT CCAACTCTCG AGITCCCACC AGAaXTTCTOT 
TCCCGGGTTA CTITrGTAAA CTTCACAGTT ACCCGTAGCA GTITACAAAG CCACSKnCTA 
AATCAAGTAC rCAAAGCAGA AAGACCTGAT CTTQGACGAGA AACGAOCTCA TCTTCTTAAA 

errCAAGGGG aatttcagct costttqcgt cagctggaaa aatoctact acaagctctc 

AACX3AGGTOA AAGGGCGCAT TTTOGATGAC GACAOSATCA TAAC3CACTCT GQAGAACCTC 

aagagagagg ctccagaggt caccaggaaa gttcaggaga cqgacaitct catccaggag 

C3TGGAGACCG TOTCCCAGCA GTCAOCTCCCG CTCTCCACCG CCTOCAGCAG CATCTACTIC 

accatggagt ccctcaagca gatacactpc ttctaccagt actccctcca GrmrccTC 

GACATTTATC ACAACGTCCT ATACX5AGAAC CCGAACCTGA AGGGTCTCAC CGACCACACA 
CAGCXXXTTCT CCATTATAAC AAAGGACCTC TrCCAGGTGG CGTITAACCG A£?roGCTCGA 
GGCATGCTGC ATCAGGACCA CATTACCTTT GCCATGCTCC TGGCAAGAAT CAAACTCAAG 
GCXaCCGTGG GGGAGCXCAC CTAOGATGCA GAATTCCAGC ACTTCITCAG AGGAAAJTGAG 
ATTGICCTQA GTGCTGGCTC CACCCCCAGG ATCCAGGGCC TGACTCTCGA GCAGGCGGAG 
GOGGTGGTGA OGCTGAGCTG CXTTCCCGCG TTTAAGGACT TGATTGCAAA GGTICAQGCA 
GACGAGCAAT TTGGCATCTG GOTGGACAGC AGCTCCCCGG AGCAGACTOT GCXCTACCTC 
TOGAGTGAAG AAACACOTGC AACACCCATT GGCCAGQCX7V TCCACCGCCT QCTCCIGATC 
CAGGCTTTCC GGCXXGATCG CCICTTGGCC ATGGCCCACA TGTTTOnTC AACAAACCTT 
GGGGAGTCTT TCATGTCCAT CATQGAGCAG CXrCCICGACC TCACCCACAT TCTQGSCACA 
GRC3GTCAAGC CXIAACACTCC TOTCTTAATC TGCTCTC3TGC CTOGrTTATCA TCCCACJTOQA 
CATGrrCGAGG ACX^TTGCAGC CGAGCAGAAC ACGCAGATCA CTTCAA3TOC AATCGGCTCT 
GCAGAAGGCT TTAACCAAGC AGATAAGGCA ATAAACACCG CTGrCAAAGTC GGGCAGCTTOG 
GTGATGCTGA AGAATOTGCA TCTQGCCCCA GGGTQGCTGA TGCAGCTOGA GAAGAACTK5 
CMTCCCTGC AGCOGCATGC CTGCTTCCGA CTCTTCCTCA CCAIGGAGAT CAACCCCAAG 
GTGCCTCTGA ATCTGCTCCG TGCGGGCCGC ATCTTTCTGT TCGAGCCACC GCCAGGGKTC 
AAGGCCAACA TQCTGAGGAC GTICAGCAGC A3TCCCGTCT CAOSGATATG CAAC?ICTCCC 
AACGAGOGTG CCCGCTTGTA CTTCCTGCTC GCCTQGTTTC ATOCGATCAT CCAAGAACOC 
TTACGATACG CACCACTGGG GTGGTCAAAG AAGTATGAAT TTGGAGAGTC 1GACCTCCGG 
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TCANYTTGCG ATAOSGTCGA CACGrTGGCTCS GATGACftCGG CCAAGQCSCAG GCAGAACATC 2040 
TCACOGGATA AGATCCOGTC GTCTOCACTA AAGACCTTAA TOGCCXraffTC CATTimOCSC 2100 
GGGCGCGTOS ACAACXSAGOT TGAOCAOCGT CTQCTCAACA CCTTCCTOGA GCGOTOTTC 2160 
ACAACX^GGA CHTTOGACAG TGAGrTTTAAG CTOGCATGCA AGC3TCGACGG ACATAAAGAC 2220 
AITCAAATOC CAGATCC3CAT GCAGC3CX3AGA GGAGrTTTGTG CAGTGGGTOG AGTTCCTCCC 2280 
CGACflCCCAG ACGCCCTCCT GGCTOQC3CCT GCCCAACAAC GCCX3AGAGAG TCCTCCTTAC 2340 
CACACAGC3GT GTOQACATGA TCACTAAAAT GCTGAAGATG CAGAOCTTOG AGGAIXSAGGA 2400 
CGACCTGXXC TACGCAGAGA CTCA^ 2460 
CCCTGCCTCG ATGCGC3ACAC TGCACACCAC OGCGTCCAAC TGCXTTCCACC TCATCCXXXZA 2520 
GACGCTQAGC CACCTCAAGC GCACCGTOGA GAATATCAAG GATCCTTICT TCAGC3TICrr 2580 

TGAGAGAGAA GTGAAGATOG GCXXZAAAGCT GCTTCAGGAC CSTTCCSCCAGG ACCTTOCAGA 2640 

TCTPCarCCAG GTGTGCGAAG GAAAGAAGAA GCAGACCAAC TACTTOOGCA CGCTCAOXIAA 2700 

CGftGCTAGTG AAAGGGATCT TOCXTO3GAG CTGGTCOCAC TACACGGTCC CTOCCGGCAT 2760 

GACCGTCATC CAGnGGGTOT CCGACTTCAG CGAGAQGATC AAACAGCTOC AGAACATCTC 2820 

ACTGOCAGCT GCATCPQGTC GOGCCAAGGA GCTAAAGAAC ATCCACGICT GCCTCGCTTOG 2880 

CCTCnTOGTG CCTQAGGCGT ACATCACTGC CACCAGGCAG TATOnXSOCCC PJSCCCAACfiG 2940 

CTCGTCCCTG GAGGftGCTCT GCCTGGAAGT CAACGTCACC ACCTCACAGG GCGCCACCCT 3000 

TGACGCTTGC AGCTTCGGAG TCACGGGTTT GAAACITCAA GGGGCCACC?r GCAACAACAA 3060 

CAAGCTCTCA CTGTCCAATC CCATCTCAAC OGCCCTTCCC CTCACGCAGC TCCXSCTCGGT 3120 

CAAGCAGACA AACACCGAGA AGAAGGCCAG TGTOGTAACC TTACCTOPCT ACCTCAACPT 3180 

CACCOSTGCA GACCTCATCT TCAiXGTGGA CTTCQAAATT GCTACAAAGG AGGATCXTTCG 3240 

CAGCrrCTAC GAGCGGOGTC TCGCAGTCTT GTGCACAGAG TAAACITITC TAGCTGCCCC 3300 

TTTCTGTAAT AGPGAAAGTT GGTATTTAAC AnTATTCAT TlTrAAAATA TITGGAAGC?r 3360 

CTGAGCTICT GAAAAGAAAG TGGTO3GTCT GAGGTITOGAG GAAGCTCAAT GQAATCTCAC 3420 

CSGTTGGQAGrP GCTGGAAATT GGAAGGATAC CAQGAQCTAT TTGGGAAGGC CAATCGCffTCJ 3480 

CXTOrTTTGA GGAAATAAAA CACTAAGCAT GAAAAAAAAA AAAAAACTTA CAANCCNCAA 3540 

^ 3542 



60 



(2) INFORMATION FOR SEQ ID NO: 64: 
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(i) SEQUENCE CHARACTOIISTICS : 

(A) LQJGTO: 883 base pairs 

(B) TYPE: nucleic acid 

(C) SraANEEDKESS: double 

(D) TOPOLOGY: linear 

(xi) SESQOEIICE CESCRIPTICN: SEQ ID NO; 64; 
AGGTGATTTT AATGATAQGT GTCATATATA QGACGOATAA TCICTTTACA TTCTOTlCTr 
CTCGATOCAC TCACAAGOGG GTAACTAGGT GACAASAAAA CAAAGATCTT ATTCAAAAGA 
GGTCTTACAG CAAOXAACG TCTCATCTTC CCATAGTAAA GATCACGGCG CCITCAGCTA 
AGCTACAGGC AACACCACTT CaSCCSTTTCT CTTOCGCCCT GCJTCCAAGAT GC3CGGATCAA 240 
GOCACGOGAC GTGTTGTGrPC TGAGATCCCG GTC5CTGAAGA CTAACGCCC3G ACCCCGAGAT 300 
CXyrGAGTTGT GGCSTGCAGCG ACTQAAGGAG GAATATCAGT CXXTTTATCCG CTATCTOGAG 360 
AACAACAAGA ATOCTGACAA CGATTGCTTTC CGACTGGAGT CCAACAAGGA AGGAACTCGG 420 
TQGTTTQGAA AATQCTQCTrA TATCCATGAC CTCCTGAAAT ATGAGTTTCA CATCGACTTT 
25 GACATTCCTA TCACATATCC TACTACTOCC CCAGAAATTC CAGTTCCTCA GCTOGATOGA 
AAGACAGCAA AGATOTACAG GGGTOGCAAA ATATQCCTGA CGGATCATIT CAAACCTITC 
TGGGGCCAGG AAJPGTGCCCA AAlTrGGACT AGCTCATCTC ATQGCTCTGG GGCTCQGTCC 660 
ATGGSTGGCA GTQGAAAItx: CTGATCTQAT TCAGAAGGGC GTCATCCAAC ACAAAGfiGAA 720 
ATGCAACCAA TGAAGAATCA AGCCACTGAG GCAGGGCAGA GQGACCTTTG ATAGtSCTACG 780 

35 ATACTASmr CCTGrroCATC acacttaact catctaactc ttccccggac ancctccact 



480 
540 
600 



840 



CTAGTTCTTA CTAAGrTANTG CAGTAGCATT NTOGQGAAGA ACA 883 



(2) INFORMATION PCJR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 1541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY: linear 



60 



(xi) SBQUEKCE DESCRIPTION: SEQ ID NO: 65: 
C3GCACGAGGT GGCCTCTACC CTOQGCTCAT CTGGCTACAC AGGGACTCTA AACGCITCCA 
GATTCCCTGG AAACATGCCA CCCGGCATAG CCCTCAACAA GAAGAGGAAA ATACCAaTTT 120 
TAAGGCCTGG GCTGTAGAGA CAGQGAAGTA OCAGGAAGGG GTGGATCACC CTCACCCAOC 180 
TAAATGGAAG GCCCAGCTGC GCTGTGCTCT CAATAAGAGC AGAGAATTCA ACCTCATOTA 240 
TGATQGCACC AAGGAGGTGC CCATGAACCC AGTGAAGATA TATCAAGTGT GTCACATCCC 
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•TCAGCCOCAG GQCTCGATCA TTAACCCAGG ATCCACAGGG TCTOCTCCCT GOGA3GAGAA 



360 



GGATAATGAT GTOGATGAAG AAGATOAGGA AGATGAGCTG GATCftCTrcGC AGCACCA2CT 420 

TCCCATCCAG GACACCTPCC CCTTCCPGAA CATCAATOGT TCTCCCA3T3G CGCCAGCCAG 480 

TGTGGGCAAT TGCAGTCrTCG GCAACTOCAG CCCGGAGCXIA CTXJTOGCCCA AAACTCAACC 540 

CCTCGAGATG GAAGTAOCCX: AGC3CACCTAT ACAGCCCTPC TATAGCTCIC CAGAACTOK; 600 

GATCAQCTCT CTCOCAATGA CTGACCTGGA CATCAAGPTr CAGTACCGTO GGAAGGACTA 660 

CX3GGCAGACC ATCACCGTGA GCAACCCTCA GCXXTOCXXA CTCTTCTATO GQGACCTOGG 720 

TCXCATQCXrr GACCAGGAGG ACXTTCTrrGG TCCCGTCAC3I CTCGAGCAGG TCAAATTCCC 780 

AGGTCCTGAG CATATTACCA ATGAGAAGCA GAAGCTOTTC ACTAGCAAGC TCCTOGACOT 840 

CATOGACAGA GGACIQATCC TQGAGGTCAG CGGTCATGCC ATTTATOCCA TCAGOCTOIG 900 

CCAGTOCAAG GTrGTACTOGT CTGGGCCATG TGCCCCATCA CTTGTTOCTC OCAACCEGAT 960 

TGAGAGACAA AAGAAQGTCA AGCTATTTTG TCTQGAAACA TTCCTTAGCG ATCTCATTOC 1020 

CCACCAGAAA GGACAGATAG AGAAGCAGCC ACCGrTTTGAG ATCTACITAT GCrTTTOGGGA 1080 

AGAATQGCCA GATGGGAAAC CATTGGAAAG GAAACTCATC TTOGTTCAGG TCATTCCAGT 1140 

AGnxSGCTCGG ATQATCTACG AGATGTaTTC TGGTGATTTC ACAOGATCCT TTGATAGIGG 1200 

CAGTGTCCGC CTGCAGATCT CAACCCCAGA CATCAAGGAT AACATCGTTC CTCAGCTCAA 1260 

GCAGCTGTAC CGCATCXOTC AAACCCAGGA GAGCTTGGCAG CCCA3QCAGC CCACCCCCAG 1320 

CATPGCAACTG CCCCCTQCCC TGCCTCCCCA GrTAATTGTCA AIGCCATCTT CTTCCTrCTC 1380 

TTTTTTATAA TAITCTACAT ATGGATTTTT TTATPGrETTA GATTTAACCA GCTTTTAAAT 1440 

CTCIGTTTTC TGTCACAGTG TrAGAAGTTT GTGATTCTCC AAATATOCTT AGATTTAAAG 1500 

CTGATTTAAT TTAItSGAAAA AAAAAAAAAA AAAAAAAAAA A 1541 

(2) INP0RMAriC3N FOR SBQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 732 bcise peiirs 

(B) TYPE: nucleic acid 

(C) SraANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE nESCRIPTICN: SBQ ID HO: 66: 

AGAAAATGAA TGITOGAAGG TQCCTGCCGA GGCGGGACAG AGTGTITCCr CGCGCTGGAG 60 

^ AAGGCTCTGC TCAGCOCTGA GAGTCCCTTC CTGCCCCACC GATACTOGCA CTTTAAAAAG 120 
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GAAGCTGACC GCACftGrPGTC CAGACGAATT GGCCCCCAGA A3MGGGGAG TTCTClt.Vm 
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CCCTPCTGTG TCTC30GTGAC CTCACCCAGC CTAGGAGGGA GGTOCATTCA QGGTAGATTT 240 
5 GCCTCTCATT CAAAGTTCTG GGOCTTTQQG OQGAAAACAG CCAGCTITOG OSCTCITOGG 
GAGACTCCTC CAGACCAGGA ACCCCAGAAG GAGACAGAGC CTCOCACATC CTCCCACGCC 
AGGCCCTGGG CCAGGGIGAT TGGACTGAGA ATTK3GCCAC AACCAAAHTC ATCCTOGCTC 
GAACCAGAGG CCAGAAAGCXr TGGCCinVlC CCCATOTGGG AGCCCTCTCC TCAGCCCTCT 480 
TOTCCCCTTC AGCTCAGTGA ATTCCCACCA GGTOCCXaCA QCTCCTOGAC TTCAAASTCT 540 
15 ATATATTGAG AGAGTTQGAG AGTATATCAG AGATATTTTT GCSAAAGGACT TCGICTATOC 600 
AATOTCAGTr TGGAA2X7ITC TTGAAAffTTT AATGnTTTA TTAGGAGATT TAAAGAAAAT 660 
AAAGGTCTAC AATATCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 720 
AAAAAAAAAA AA 



732 



(2) INFORMATION FOR SBQ ID NO: 67: 

(i) SEQUENCE CHARACTEMSriCS: 

(A) LENGTH: 629 base pairs 

(B) TYPE: n\icleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 67: 

TTAAGGAATT CGGOKXSATC CCGGCAAGTA ACATGACTAA AAAGAAGCGG GAGAATCTOG 60 

GCX3T0GCTCT AGAGATCGAT QGGCTAGAGG AGAAGCTGTC CCAGTGTOGG AGAGACCTGG 120 

AGGCCGTGAA CTCCAGACTC CACAGCCQGG AGCTGAGCCC AGAGGCCAGG AQGTCCCTGG 180 

AGAAGGAGAA AAACAGCCTA ATGAACAAAG OCTCCAACTA OGAGAAGGAA CTGAACTTIC 240 

TTCGGCAAGA GAACOQGAAG AACATGCTGC TCTCTGPQGC CATCTTTATC CTCCTCACGC 300 

TCGTCTATGC CTACTGGACC ATGTGAGCCT OGCACTTCCC CACAACCAGC ACAGGCTTCC 360 

ACTTQGCCCC TTQGTCAGGA TCAAGCAGGC ACTTCAAQCC TCAATAGGAC CAAQGTOCTC 420 

GGGTCTTTCCC CTCCCAACCT AGTCTTCAAG CATGGCTTCC TGGCGGCCCA GGCCTTGCCT 480 

CCCTGGCCTG CTOGGGGGrTT COGGGTCTCC AGAAGGACAT GGTGCTOGrTC CCTCCCTTAG 540 

CCCAAGQGAG AGGCAATAAA GAACACAAAG CTGAAAAAAA AAAAAAAAAA AACTCGTAGG 600 

GGGGGCCCGT ACCCAATCGC CCTNTCGTC 629 
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(2) INFORMATION FOR SBQ ID NO: 68: 

<i) SBQDENCE CHARACTERISTICS: 

(A) LENCTTH: 1751 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANEECKESS : double 

(D) TOPOLOGY: linear 

(xi) SBQOHICE DESCRIPTICN: SBQ ID NO: 68: 

CTC3CTAQCCG GCCGGOGCAG GCTC3CCGAGC GQCTTCAGCGC GCAGGCCAGG CCAAAGCCCT 60 

GGTACCOGOG CGGTOCGGGC CTCAGTCTGC GQCCATQGGG GCGTCCGCGC GGCTCCTOCG 120 

15 AGOGGTCATC ATQGGGGCCX: CGQC3CT0GQG CAAGGGCACC GTCTCGTCGC GCATCACTAC 180 

ACACTTCGAG CTGAAGCACC TCTCCAGCGG GC3ACCTQCTC OQQGACAACA TCCTCCGQGG 240 

CACAGAAATT GGCGTGITAG CCAAGGCTTT CATTGACCAA GGGAAACTCA 1CCCAGA3GA 300 

TGTCATGACT CGGCTGQCCC TTCATGAGCT GAAAAATCTC AOCCAGTATA GCTGQCICTT 360 

GGATQGrmr CCAAGGACAC TTCCACAGGC AGAAGCCCTA GATAGAGCIT AJTCAGATCGA 420 

25 CACAGrrGATT AACCTGAATG TQCCdTrGA GGTCATTAAA CAAOGCCTTA GTGCTCGCTC 480 

GATTCATCCC GCCAGTGGCC GAGTCTATAA CATTGAATTC AACCCTCCCA AAACTGTOGG 540 

CATTGATGAC CTGACTQGGG AGCCTCTCAT TCAGCGTGAG GATGATAAAC CAGAGACGC?r 600 

TATCAAGAGA CTAAAGGCTT ATGAAGACCA AACAAAGCCA GTCCTOGAAT ATTACCAGAA 660 

AAAAGGGGTG CTQGAAACAT TCTOCGGAAC AGAAACCAAC AAGATTTGGC CCTATGTATA 720 

J5 TGCrrrCCTA CAAACTAAAG TTCCACAAAG AAGCCAGAAA QCTK:AGTTA CTCCA-KIAGG 780 

AGAAATGTGT GTAACTATTA ATAGTAAGAT GGGCAAACCT CCTAGTCXTTT GCATTTAGAA 840 

^ GCTCCTTTTC CTAAGACTTC TAGTATGTAT GAATTCTTTG AAAATTATAT TACTTTTATT 900 

TCTACTGATT TTATTITCGA TACTTAAGGAT GTGCCAAATG ATTCQGATAC TAAGATOCAT 960 

CGTTTGAAAT CATCTACTGT GTTGrrATGCA GTTATCCTCA AAAACATCAG CGATGTCTGA 1020 

1-5 ACCTTTAAAA CATCTGTTAG AGCAAAATTA AAAGAGCATT TQGTAGTAAT CTAACTTTTr 1080 

GrrrCAGTTAA TAAGTGGTTG ATAAAGTTTC CATATTTTTC TOGAAAAGTT AAAAAAACTT 1140 

ACATGTCATT TGGAGAAAAT AOGTAATCAG AAATTTGTGC ATAGATTGAT GCCAAAAAAG 1200 

ACATTTCCAG CATTGTGGAA CATGGTGAGA CACTATATAA AATTCCAGAA AGAAAGCAAC 1260 

TGGATTTACA GATTTATTGr GAGACACAAA TTCACTOCTG CCTITACACT AAGAAATGTA 1320 

5 TATGTTAACC ATATATGCTG TATTTATTTT C?rCC?rTAAGC ATACTTTCAG TTTACTCAGA 1380 

ATTTTCAAOT TQCTATAAAG ATGTATCAAT TAGCATATAG AAAAATATTA CTTTAAGATC 1440 

ACTTGTTTCC TTTGAAAATA CCTGTGTACT GAGQGTTATG ATTTGTGICA AAAATKSACA 1500 
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TAAGTGCTTr TACAAGCAOC AAWJITCAAT GAATmCAA CAAAATCTAA TTAAAGTCTA 1560 

TGrmrCAGT TATGACTCAG GTTAAGAAAT GTOTTTTAGG MXnTWmXSC TOCTmrrcT 1620 

TTITGATCCA AAroPGTCAT CTOOCCTGAT AAATAACAAG TOA3NCTACC ATCTCCOCCG 1680 

CCAATAAAAA AAAAAAAAAA AAAAAAAAAC TCGAGGQQGG GCCXX3GTACC CAATTCTCCG 1740 

NAATAGGNAG T ^^^5^ 

(2) INFORMATION FCR SBQ ID NO: 69: 



(x> SBC30ENCE CHARACTERISTICS: 

(A) LENCTTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) ffTRANDEENESS : doxible 
20 (D) TOPOLOGY: linear 



(xi) SBC3Ua«:E DESCRIPTICN: SBQ ID NO: 69: 

GGCACGAGAT TATGTATTAA AATCTTTTTG AATTC5TGAAA TA3TAGAATA TIOTTACTAT 60 

TOSACCCAAC TCAAAATCTC CATQGGAAAA TACCTGTCGA TACCCACflCT AITCTTCAAA 120 

ATAATCAGAT GCAGTATCAC AGCTC3TC5TCA GACTCTAGTA CCAOITOGGC AATCAAGCSCA 180 

CAGCTAAAAA TTGAAAACAA AGATCTOGAC AACAAAACAG CCAAAGGTCG GGCHCAAGAA 240 

GCTCTGACCn' C3TACCTAGCT GTAGAATGCT ATGCACACGT GCCAGGnCTA CSTOTOCATAT 300 

CCAGGAAAAA CTGCAGAGAG CCCCAGTCTT CACCTCTQGT TGACCATC3AG CTCTOICTAA 360 

GCAGGAAGTG AAGGCTAAGG CAGATTTAAC3 CTCTGAAAGC ATTCCACAAC ATACACACAA 420 
ATCGTGCAAA GCATTAAGGA AATCTTGTTA CrOCTAAGTG TTGCTOACCC AGGAACAACT 
40 CCTACTCAGC TQGACTTAAA AATAAAAA 



45 (2) INFORMATION FOR SBQ ID NO: 70: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGrrH: 245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEMESS : double 
<D) TOPOLOGY: linear 
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508 



(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 70: 
55 TACATAGAGC AAAGAGAAAT TTCCAGAATT TCTARAATTC TGGAAAGAGA ATTPTCCTGA 60 
GATTGCftGAT TTGCTTGTGT CCTCAQGTGA TGATGAGGGC TGTTTTCCCC TCTICTCCTr 120 
TCCTCACACT CATGCTTCCT CTCCTAGACT GTCTGGTrGG CATCAICAIG TCCTACCTAG 180 

60 
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GCATTrCTTT CACTGATACA AGGAAAACTC CAGQC?rrAAA AAAAAAAAAA AAAAAAAAAA 
NCNCG 



(2) IKFX>RMATIQN FOR SBQ ID NO: 71: 

10 (i) SEQUENCE GHARACTBRISTICS : 

(A) LOXTTH: 361 base pairs 

(B) T5fPE: nucleic acid 

(C) STEIANZ:SCMESS: double 

(D) TOPQLOOT: linear 



GATCTCAGCT CACTCCAACC TCTCTCOCTC CCGGGTTCAA GTGATTCTCC TCCCTCAGCC 

30 T 
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(xi) SBQOaCE DESCRIPTICN: SEQ ID NO: 71: 

ATGTTCCTrCA TGAQGATGCA CTTGTGCITC TOCAAGTATT GCTGCAQCTT CATAGTGACT 60 

CCCACCAGCA CCAGCAATAC AGCTAGCTAC CTC3TOGCCTT GGATCTCAGC CAGCATGGCT 120 

GC3GAGAGGGA GCftGCTGGGC ATGTACCCTA AATGCTGTTA CXZAGGGAAGG ACTCCCAGAG 180 

TGAAGACAAG TAGGGACTTC CTOCAGAGGT GCSTACATGrTG CTCTCTCTAT CCATACTTrr 240 

TTrTTTTTTT TTTTGAGATA GAGTTTCACC CTTGTTGCCC TOGCPGGAGT GCAATOOTGC 300 
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35 (2) INFORMATION FOR SEQ ID NO: 72: 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) IiQIGrm: 713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 72: 

45 AGGATCACAC AATAGAGAAC ACTGTAGTAA CATTTCGGrTC TQCTCACAAG ACCCAGAACA 60 
TTGATCAGIT TTTGrTGTTG GTTTATTATT TTTCTGTTAA AAAATICTGA AAAGTrTCTT 120 
TTAGCTAGAT GATATTTTAA TAGCTGOGAG TGCTTTGGAA CTATAAAGAT GTCACTACTr 180 
AACACACATA CCrTATGTIT TOi ' lTm r m ' TGirTTACAC TCAGTATAAA TCAGGAGAAG 240 
TTAGCCAACC ATCTAGCATT TAGAATCCTC TTTTTTATTG TCTTCTAAGG ATATGGAOCT 

55 TCCCATAACA GCAACAAAAC AGCAACAAAA ACATTTCATA AATATCACTT GATAGACTCT 

AAGCACCTGC TTAACnTGT GTOCCAAATA TTTAGTGrPGT ATATATATAT- ATATATATAC 420 
ACACACACAC ACATATATAT TCAACAAATA AAGCAAAATA TAACATQCAT ITCACATTIT 480 
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GTCTTTCCCT GTTACGATTr TAATAGCAGA ACTOIATGAC AAGTrTAGCJr GATCXTTAGCA 
TATGTTAAAT TCAAATTAAT GTAAAACAGA TTAACAACAA CAAAGAAACT GTCTAITIGA 
GTCAAGTCAT GCTTOrEftTT ATAATAACTT GGCTTOGGIT A3XXATCAAA VXACMTTTA 
TACTGOTATC TGATTGTTTA TAATAAAGAA TACTGTACTT ATAAAAAAAA AAA 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEOTOgCE CHARACTERISTICS: 

(A) LQIGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGV: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
GAAAGTCAGA GCTCTCCAAT CCCTCAGCAC CTTTTAGATT TC3CTCCAAAT TAGAAACCTS 
GQGACTATC3T GTTCTGQGCA ATCACAQGTC TC3GAAAAT0G CTCTCXTyGQC TCTTGATACT 
GAGACACSTOG TCATCTTACC AGACATGCAT CTGATTTTAA GCCTCAGQCT AATCCACAAT 
GCTCGGOCAT GCCTATGATT AACAAACAAA AGCAAAATCT GCTTTTATAG TTTAGGAAAC 
CTQGATAGAA CAGTATTTTT CAGCAITCTT GGATAAAGCA GTTCTGCATr TTTAAATTOG 
GACIGCAGAA GTCACTGrTCT ATAGrTTGTGA AATACAAAAA ATGGTATGTT TCATCAGAAA 
AGGAAGCCCG TQCCTGGCAC TTGGAAAGAT ACTGAGCATC ATAACCCTAA TtSAGAAAATC 
TAGGCTCTGT GAATGTTAAC TACAAATCAG GTTAGGAAAG CATATGACAC CCTTTGTCAA 
ACTAAGCTTC ACTAGGAGGA CCTC7PGCTCA TAGAAGAATA TC3CTTTAAAA GTATCAATTT 
TCCACAGTOG ATGATGGAGA AAAGTTCATT TGCACCAGAA TQCTGATAGT CACAATACAC 
AQCCTGACAT ATATAACAAT ACAGTTTTCT GTAAACAGAA GrTTCTTCCTC TTCCAATTCA 
GGAGTCAGTC AGAGCATAAA TATTQCATC3T TTCACTTTAG AAACTGAITC ATTTTAGAAA 
GCAGATCTGG ATTATTTTGC AGQGTAGAAA TGAAGGCTAT TTCTGGCATT CTTGCTCAAA 
AAGTCAATAT ATGTACATTA AGTATAAAAA AGGGTCTCTT TCACCTCTTT TOTTrCGTAG 
CATTGGCTAC ATAACTOGTC CC 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUQ^ CHARACTERISTICS: 

(A) LQiGTH: 4602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 
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(D) TOPOLOGY: linear 
<xi) SEQaE^3CE DESCMPTION: SBQ ID NO: 74: 

GCGAGGGC3GC GKGGGGAGCA QOGCXXSAROC CGCCGCXTTCC GCCTCCGCOG CCTAGGACTA 60 

GGGGGTOGGG GACGGACAAG CXXX3C3ATQCC QQGGGAKftCG GAAGAGCCGA GACCCCOQGA 120 

GCAGCAOGAC CAGGAAGQGG GflGAGQOQGC CAAOC3CQCCT CCGGAOGACX: OSCAACAACG 180 

GCCCXXTTGAG GOSGICGCGG CX3GCGCCTGC AGQGACCACT AGCAGCX3GCG TX3CTCAGGGG 240 

AGGTCGQGAC CGAGGCOGGG COSCTGCGRC OQCCGCGCMG CAGCTGTGTC OCGCXGGAGA 300 

15 AQGCX3GAGTA T0CCCQCXXX3 CGAQGAQCAG CCCCAGCGCC AGGCCTCCCG ACGTCCCCQG 360 

GCAGCAGCCC AQGCCGCGAA GTCCCCC5TCT CCAGITCAGG GCAAGAAGAG TCCGCX3ACTC 420 

CTATGCATAG AAAAACTTAAC AACTGATAAA GATCCCAAGG AAGAAAAAGA GGAAGAAGAC 480 

20 

GATTCTGCCC TCCCTCAGGA AGrTTTCCATT GCTQCATCTA GACCTAGCCG GGGCTGGCGT 540 

AGTAGTAGGA CATCTGTm; TOGCCATCGT GATACAGAGA ACACCCGAAG CTCTOGGTCC 600 

25 AAGACCGGTT CATTOCAGCT CATTTGCAAG TCAGAAOCAA ATACAGACCA ACTTGATTftT 660 

GATGTTQGAG AAGAGCATCA GTCTCCAGGT GGCATTAGTA GTGAAGAGGA AGAGGAGGAG 720 

GAAGAAGAGA TGrTTAATCAG TGAAGAGGAG ATAOCATTCA AAGATGATCC AAGAGATGAG 780 

30 

ACCTACAAAC CCCACPPAGA AAGGGAAACC CCAAAGCCAC GGAGAAAATC AGGGAAGGTA 840 

AAAGAAGAGA AGGAGAAGAA GGAAATTAAA GTGGAAGTAG AGGTOGAGGT GAAAGAAGAG 900 

35 GAGAATGAAA TTAGAGAGGA TGAGGAACCT CCAAGGAAGA GAGGAAGAAG ACGAAAAGAT 960 

GACAAAAGTC CACCTTTACC CAAAAGGAGA AAAAAGOCTC CAATCCAGTA TGTCCGTTGrr 1020 

GAGATGGAAG GATGTGGAAC TCTCCTTQCC CATCCPOQCT ATTTGCAGCA CCACATTAAA 1080 

40 

TACCAGCATT TQCTGAAGAA GAAATATGTA TGTCCCCATC CCTCCTGTGG ACGACTCTTC 1140 

AGOCTTCAGA AQCAACTTCT GOGACATGCX: AAACATCATA CAGATCAAAG GGATTATATC 1200 

45 TOTGAATATT GTCCTCGGGC CTTCAAGAGT TCCCACAATC TQGCAGTCCA (XCGATGATT 1260 

CACACTQGCG AGAAGCATTA CAATGTGAGA TCTOTGGATT TACTTCTCGA CAAAAGGCAT 1320 

CTCTTAATTG GCACATGAAG AAACA!PGATG CAGACTCCTT CTACCAGTTT TCTTGCAATA 1380 

TCTGTGGCAA AAAATTTGAG AAGAAGGACA GOGTAGTCGC ACACAAGGCA AAAAGCCACC 1440 

CTGAGGTC3CT GATTGCAGAA GCTCTGGCTC CCAATGCAGG CXSCXTTCATC ACCAGCACAG 1500 

ATATCTTQG6 CACTAACCCA GAGTCCCTGA CX3CAGCCTTC AGATQGTCAG GGTCTTOCTC 1560 

TTCTPCCTGA GCCCITOQGA AACTCAACCT CTGGAGAGTG CCTACTGTTA GAAGCTGAAG 1620 

GGATGrrCAAA GTCATACTGC AGTQGGACGG AAOSGGTGAG CCTGA3X3GCT GATOGGAAGA 1680 
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TCTTrorGGG AAGCX3GCAGC AGTGGAGGCA CTGAAGGGCT GGTTATGAAC TCAGATATAC 1740 

TOOGTOCTAC CACAGAGGTT CTGATTGAAG ATTCAGACTC TGCOGGACCT TACTTGGACAG 1800 

GAAGACTTGG GGCATGGGAC AGCTCAGACT TTCTATTTAA AAGriTAAAAA GGACAAAAAA 1860 

AAAATCTAAA GCATTTAAAA TCTnGTGAAA TAACTGftAGG GCCTGCTCIT TOCATTGnOG 1920 

ATCACflGCAC ACACATACAT ACACCCTCCA CCTCOCCATC CXXTOriir it: CCTCTGrnXX: 1980 

TCCCXnTATA AAATTGATGT TGTCTrTACC AGAAAGGTAG ACAAAAAAGA AGCAGCAGCA 2040 

GCTCTTAAAG TGAGGGTTAT TCTCATACTC GGTTCCAGCC ATCAGCAGAC TTCCTCCTCA 2100 

TCGGCAGATC CCCCTTPCCA ACCTGTAACT CTGATGTOCT CTQGATCAGC TTTTAACITr 2160 

TAATCATATA TTACTGTCTT CTAAATCCCT TCTOCTOCTC TACTOCTQCX: CTATGGrTTCT 2220 

GQCTCCTACC COCTGOC3QCA CACTTATCTT CAAATACTAT AGAATTCTAA TCTCTCAAAT 2280 

CATAGCTCrrC CAGTGGCTTT TAAAGAAAGC TGGTCCTCAG CACTAACAAA ATCACTACAA 2340 

TAGCCTAGTG CTTTTTTOGA AGCCTTTrTA QGGAAGAATG TTAGGTTCAT GGTAACTAGT 2400 

ATCCTCTITC AGATTTTTAC AGTCITQAAA CTTTAAGAATT TTGAGAGGOT GAGGAGGGTT 2460 

GTTCAGAATC TAAATTACAG ATAGATGATT GTTTCTTGTG AATTTGTTTC TmiX. - lTrr 2520 

TTTTTGTCCC TACCA3TTCC TTACAITrCC CTTOOOQCCC ATCTCTQGCT CCTTCCTTTT 2580 

TGTTTCTTQC TTTGCTTTAT CAGTTCATTC CAGCTCCCTG TTAGTGAAGG ACACTCCTOT 2640 

TAGTGAAGGA ACAAAGTCTA TGAG?rcCTAA AATTTTAAGT CAAAGAAAAC TGCTCTCTTT 2700 

CCCCTTTACT AACACTTCTC AAGAGGAAAA ACTTCAATAG CCAAAGITAA TAATCCTATA 2760 

TAATAATTGC TTTGGCTTTC ACCTAAAATT CTGQGCATCA CAATTTCCTT GQGATAGAGG 2820 

TTGTGTTGGG GAATftGATTG CTTATTQCTG TTCACTQGAG AGAAAAGGTCA GTGTTTTTCT 2880 

ACAAGGTCAT ACCGCCAGAA GCCCCAAATC CTATTTTGQC TCATCTTCAG GTAAAGAGTA 2940 

ATTCCTTATCC TGTGTCCCTC AGAAGCTAGA ATOGAAGGCT TACCCTATTC ATTCITIATT 3000 

C3TCAGAAATG CATGATGGCT CTTOGAAAGA ATGACXSTTTT GCTCGAAAAA AAAAAAARAA 3060 

CMGTnCTGT TTCACAAACA TGGCTTATCA ATTTTTTCAA A6AATTCTTT TTTCCCAAAA 3120 

AGAGGAGTAA CAAAATGTCA TTTCTGAAAG AGGCTTACaT TATACCAACT AGTGrTCAGCA 3180 

TTTGGGATGC CAGGGAACAG AGAGTPGAGAC ACCTACAATC ACCAGTCTCA AA30:GCTAT 3240 

TGTTrcTrrr cagagtgttg cagatttgcx: atttctocat aatatgggqa tagaaaaigg 3300 

AATAAAGATA GAAQGGATGT AGAASWroCT TTCCTGOCAA C3VT0GlTrGG AGTCGACTTT 3360 

GGTATATTGA CTAGATITCA AAATACAAGA TTGATTAGAT GAATCTACAA AAAAGTTCTC 3420 

CTCCTCTCAG GTCCCTITTA CACTTTTTGA CTAACTAGCA TCTATATTCC ACACTTAGCT 3480 
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TTTTTGTCAC ACTTATCCTT TOTCTCCXSTA AATTTCATTT GCAGTOGITA CTCATCAGAT 3540 
ATTTTAGCCA CCTACACAAA AGCAAACTGC ATTTTTAAAA ATCTTTCTCA GATOQGAGAA 3600 
AATGTATTCT CXnTTCCTAT AOCX3CTCTCC CAACAAAAAA ACAACTAGTT AGITCTACTA 3660 
ATTAGAAACT TQCTOTACTr irrCITITCT TTTAGGGGTC AAC5GACCCTC TITATAOCTA 3720 
CCATTTOCCT ACAATAAATT ATTGCAGCAG TITOCAATAC TAAAATATTT TTTATAGACT 3780 

TTATATmr CXTTTTGATA AAQGGATOCT QCATAGTAGA C3TTOGTGTAA TTAAACTA3X: 3840 

TCAGCCGrrrr cccTGcrrrc cxttctgctc catatocctc AirGrrccrrc cagggagck: 3900 

TrrTAATCTT AAAGTTCTAC ATPTCATCCT CTTAGTCAAA TTCTGTTACC TmTAATAA 3960 

CTCTTCCXZAC TOCATATITC CATCTTGAAT TQGTOGTTCT AAATTCTGAA ACTCTACTTC 4020 

AGATACfiGCT ATTTAATATT TCTQC3GAGAT GTOCATCCCT CTTCTritJ iX; GTTCCCCAAG 4080 

GTTGmTGC GTAACTCMA CTCCTTGATA TGCTTCAGAG AATTTAGGCA AACACTOGCr 4140 

ATQGCCGTGG GAGTACTQGG ACTAAAATAA AAATATCGAG GTATAGACTA GCATCCflCAT 4200 

AGAGCACTTG AACCTCCTrT (TIftCCTGTTT GGOGAAAAAG TATAATGAGT GTACTACCAA 4260 

TCTAACTAAG ATTATTATAG TCTQGTTCTT TGAAATACCA TT lTrn v It: CmTOICTr 4320 

TTTCCCACTT TOCAATGTAC TCAAGAAAAT TGAACAAATG TAATCGATCA ATTTAAAATA 4380 

TrrTATTTCT TAAAAGCCTT TITTCXCTGT TGTAATGrTGC AGGACCCTTC 'TCCTITCATC 4440 

GGAGAGACAG GTAGTTACCT GAATATAC3GT TGAAAAGC5TT AOCTAAAAAG AAATTATAAT 4500 

AAAAGGGATA CTrTGCTTTT CAAATCTTTG TTTTCTCTTA TTCTAGGTAA GGCATATTAA 4560 

AAATAAATAT GTAAAGAAGA AAAATAAAAG TTGTCTTCAT GG 4602 
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(2) INFX)RMATIC»I FOR SBQ ID NO: 75: 

(i) SBQUQICE CHARACTERISTICS: 

(A) LENGTH: 1255 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 



(xi) SBC3fUENCE DESCRIPriGN: SBQ ID NO: 75: 

OGCGOCCCGG GCCGGCGGGT TTCTCTAACA AATAAACAGA ACCCGCACTG CCCAGGOGAG 60 

CGTTGCCACT TTCAAAGTGG TCCCCTGGGG GAGCTCAGCC TCATCCTCAT GATGCTGCCA 120 

AOGCGCACTT TTTATTTrTA TTTTATTTTT ATTTrTTTTT TAGGATCCIT TTCGGGCTTC 180 

ACTCTCAGAG CCAGmTTA AGGGACACCA GAGCX3GCAGC CTGCTCTQAT TCTATOGCTT 240 

60 GGlTOrrACT ATAAGAGTAA TTGCCTAACT TGATTTTTCA TCTCTTTAAC CAAACTIGTC 300 
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(i) SBQOEJICE CHARACTERISTICS: 

(A) LENGTH: 475 base pairs 

(B) TYPE: nucleic acid 

(C) SHIANDEEMESS: double 

(D) TOPOUX3Y: linear 
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600 
660 
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QCCAAAACaVT ATTTCAOCGT TTCCAAAATT CAGATTCTC3C CTCTOCQGAT AAATATITOC 360 

^ CAOGAATGAG TAACTCCTGT CACCACTCTG AAGGTroCAGA CAGAAOCTTT TCACACAITC 420 
TTAGCACTGA ACTCCTCICT GATCTAQGAT GATCTGrTTCC CCCTCTOGAT GAACATCCTC 
TQATGATCAA GGCTCCCAGC AGGCTACTTT GAAGGGAACA A'TCAGATCCA AAAGCTCITC 
10 GGTOTTTATT TAAAATACTA GTGTCACTTr CTGAGTACCC GCCQCTTCAC AGGCTCAGTC 
CAGGCCTGTG TGCnTOTAG AGCCAGCTGC TTGCTCACAG CCACAITTCC ATITCCATCA 

TTACTGOCTT CACCTOCATA GTCACTCTTT TGATQCTGGG GAAOCAAAAT GGTCATCATA 720 

TATAGACTTT ATOTATAGCC ACAGTTCATC CCCAACCCTA GTCTTCGAAA TCTTAATATT 780 

TGATAAATCT AGAAAATGCA TTCATACAAT TACAGAATTC AAATATTGCA AAAGGATCTO 840 

20 TGTCrrrCTC COCGAGCTXX CCICTTCCCC TTCATTGAAA ACCACCACQG TOCCATCTCT 900 

TGTGTATGCA GOGCTATQCA CCTQCAGGCA CGTGTGTATG CACTCCCCGC ITOTOTPrAC 960 

ACAAGCTGTG GQGTGTTACG CATGCCTGCT TTTTTCACTT AATAATACAG CTTOGAGAGA 1020 

TPnTOEATC ACATTATAAA TCCCACTCGC TCTTTTTGAT GGCCACATAA TAACTACTOC 1080 

ATAATATGGA TACGCCTTAT T-PGATTTAAC TAGTTCCCTA ATCA3GGACT TTTAAGTICT 1140 
30 TTCCTTTITT TTTCTmTT GCTACTGCAA ACGATGCTAT AATAAATGTC CTTATCAAAA 



1200 



AAAAAAAAAA AAAAAAAAAA AAAAAANCCC NSGGGGGGGG CCCCGGGAAC NCAAT 1255 



(2) INFORMATION FOR SBQ ID NO: 76: 



(xi) SBOOENCE DESCRIPTION: SEQ ID NO: 76: 
GGCAOGAGAG AAATGTTrGA TTCTCTTTCC TAOTTTAAGG GATCTTCTCT CTTOTTGATC * 60 
TTGAAAACTT ACCTTAGTGA AGATGTGrTTT CAACATGCTG TTGTCCITrA CCTOCATAAT 120 
CACAGCTATG CATCTATICA AAGTGATGAT CTGTGGGATA GnTTTAATCA GGTCACAAAC • 180 
CAAACACTAG ATGTAAAGAG AATGATGAAA ACCTGGACCC TQCAGAAAGG ATlTCCTPrA 240 
55 GTCACTCTTC AAAAGAAAQG AAAGGAACTT rTTATACAAC AAGAGAGATT dnTTAAAT 300 
ATGAAGOCTG AAATTCAGCC TTCAGATACA AOGrTACATGC OCTCTTTCTT ITCATOCCAT 
CTCTTTTGCA CTCTCAGGrTG GAAATATTTT GAACTCTrTT ATAATCATAA GTTCTTOTCA 
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AACCEAACAA GATEATOCXT TCCTAAGAAT ACTTAAOCTT OCTACCAAAT TAAAA 

(2) INFQRMftTlON FOR SEQ ID NO: 77: 

<i) SBQUQJCE CHARACTERISTICS: 

(A) LQ9GTH: 465 base pairs 

(B) TYPE: nucleic acid 

(C) smMUDEOJESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 77: 
TTCTCTCTGC TCTTCGACTG CACCGCACTC GCGOGTGACC CTCACTCCCC CTACTCAGCT 
CAGCGC3TGCT GOCATGGCGrT GGCGGCGGCG OGAACCRGCG TCGGGGCTCG CGGCGTOTTC 
GCrCTCXXOT TC3CTCGCCCT QGCCCTGTtSC GrTCCCCQGGG CCCGGGC3CCG GCXTCTCCSAG 
TGGTTCTCGG CCGTC3GTAAA CATOGACTAC GrOGACCCGC AGACCAACCT GACGGnCTC3G 
AGOGTCTCGG AGAGTQGCCG CTTCGGOGAC AGCTCGCCCA AGGAGOGCGC GCAOGGCCIC 
C3TGQGCGTCC CC3TQGGCGCC CQQCGGAGAM CTCGARGGCT KCGCGCCCGA CACGCGCTIC 
TTCGTOCCCG AGCCCGGCGG CCGAC3C3QC3CC C3CGCCCTC5GG TaSCCCTOCT (Xmx^TQGOG 
QCTGCACCTT TCAAGGACAA AGrPGCTQCTPG GCGGCGCNGA ANGAA 



(2) INFX>RMATION FOR SEQ ID NO: 78: 

(i) SECJUHICE CHARACTERISTICS: 

(A) LENGTH: 1907 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBCNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE EfESCRIPTIOM: SEQ ID NO: 78: 
ACATtSCAGCC CAACTACAGA ITCTTATGGA ATTCCTCAAG GTTOCAAGAA GAAATAAGAG 
AGAGCAACTG GAACAGATCC AGAAGGAGCT AAGTGTPTTG GAAGAGGATA TTAAGAGACT 
GGAAGAAATG AGTGGCTTAT ACTCTCCTGT CAGrPGAGGAT AGCACAGTCC CTCAATITCA 
AGCTCCTTCT CCATCACACA GTAGTATTAT TGATTCCACA -GAATACAGCC AACCTCCAGG 
TTTCAGTGGC AGTTCTCAGA CAAAGAAACA GCdTOGTAT AATAGCACCSr TAGCATCAAG 
AOQAAAAOGA CTTACTQCTC ATTTTGAAGA CTTGGAOCAG TGTTACnTT CTACAAGGAT 
CnCTCGTATC TCAGATQACA GTCGAACTGC AAGCCAGTTG GATGAATTTC AGGAATGCTT 
GTCCAAGTTT ACTOGATATA ATTCAGTACG ACCTTTA^ ACATTCTCAT ATCCTAGTCA 
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TCTCTATAAT GGTrCCACTA TASTCTCTAG TAITCAATTT GACOQGGATT GTGACEATTT 540 

TQ0GA3TOCT GGAGTTACAA AGAAGATTAA AGTCTATGAA TA3GACACTG TCftTCCAGGA 600 

TGCAGTOGAT ATTCATtACC CTGftGAATGA AATGACCTCC AATTCGAAAA TCAGCTCTAT 660 

CACTTGGAGT AGTTACCATA AGAACCTCTT AGCTAGCAGrr GATTATCAAG GCACTCTPAT 720 

TTTATQQGAT GGA3TCACAG GACAGAQC3TC AAAGGTCTAT CAGGAGCA3G AGAAGAQC?IG 780 

TTQGAGTGTT GACTTTAATT TGATGGA3XX: TAAACTCTTG GCTTCAGGTT CTCATCATOC 840 

AAAAGTTGAAG CTGIGGTCTA CXTAATCTAGA CAACTCAGrPG GCAAGCA3TG AGGCAAAGGC 900 

TAATOICTGC TGrrCTTAAAT TCAGCCCTTC TTCCAQATAC CATTTGGCTT TCGGCTOIGC 960 

AGATCACTGT GTCCACTACT ATC5ATCTTCG TAACACTAAA CAGCCAATCA TGGTATOCAA 1020 

AGGACACCGT AAAGCAGTCT CTTATGCAAA GTTTCTGAGT QGTGAGGAAA TrGTCXirmc 1080 

CTCAflCAGAC AGTCAGCTAA AACTGTGGAA TGTAGQGAAA CCPTPJ^TTQCC TACGrrTCCTT 1140 

CAAGQGTCAT ATCAATGAAA AAAACTTTOT AGGCCTQGCT TCCAATQGAG ATTATATAGC 1200 

TTGTQGAAGT GAAAATAACT CTCTCTACCT GTACTA1!AAA GGACTTTCTA AGACTTTCCT 1260 

AACnTTAAG TTTGATACAG TCAAAAGrTGT TCTOGAC3W^ GACOGAAAflG AAGAaXSATAC 1320 

AAATC3AATTT GrrTAGTGCTG TOTGCTGGAG GGCACTACCA GA3X3GGGAC3T CCAATGTCCT 1380 

GATTGCTGCT AACACTCAGG GTACAATTAA GGTQCTASAA TTGCTATGAA GGGTEAACTC 1440 

AAGTCAAATT GTACTTGATC CTGCTGAAAT ACATCTQCAG CTGACAATCA GAGAAGAAAC 1500 

AGAAAATGTC AOXTrGATCTC TCTCXXX3^ GTCATCATOG GTTTrGGATT TGTTTTCAAT 1560 

ATTTTTTTCT TTmTCTTT TCCCT CLTIT ATGACCTTTG QGACATTGGG AATACCCAGC 1620 

CAACTCTCCA CCATCAATCT AACTCCATGG ACATTGCPGC TCTTGGTGGT GTTATCTAAT 1680 

TTTTOTGATA GGGAAACAAA TTOTTTTGAA TAAAAATAAA TAACAAAACA ATAAAAGTTT 1740 

ATTGAGOCAC AGTIGAGCTr GGAAAGTTTT TGTCAAATGC NOCAAGAGAT AACTCnTTT 1800 

ANGAAGTAGC ATATGTCAAC TATAATOTAA CAGTGAATAA TTTGTAAAGT TCGTATTTCC 1860 

CAACXTCPTT QQGAATTACA CATATCAATA TAAACAAAAf ATAAAGT 1907 
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(2) INF0RMATIC5N FOR SEQ ID NO: 79: 

(i> SBC^JENCE CHARACTERISTICS: 

(A) LQIGTH: 1168 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDMESS: double 
<D) TOPOLOGY: linear 

(xi) SBC3UENCE DESCRIFTICN: SEQ ID NO: 79: 
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CSCTGGGGrrGT CCXXKCSGCC ACCftlXOTCA TCXXTTTACTT GATCAAGCAC ACTCQGA3X2A 60 

COCATGACTG ATGCTTATAA ATTTC3TCAAA GGCAAACXaC CAATTATCTC CCCAAACCTT 120 

AACTTCATGG GCSCAGTTGCT AGAlGrrTCGAG GAACSACCTAA ACAACGCTCT GACACCGAGA 180 

ATCCTTACflC CAAAGCTGAT GQGCGTQGAG ACGGTTGTGT GACAA3X3C?rc TOGA3X3GAAA 240 

GGATTGCrOC TCTCCATTAG GAGACAATGA GGAAGGAGGA TOGATTCTOG Tr m ' mvr 300 

TTCTTTTTTT TTTTGTAGTT GGGAGTAAGT TTCTGAATOG AAACAAACTT CTTITAAACAC 360 

TrrAirnTA acaagtgtaa gaagactata acttttgatg ccattcagat tcacctccca 420 

CAAACTGACA AATTAAGGAG GTTAAAGAAG TAATTTTTTT AAGCCAACAA TAAAAATATA 480 

ATACAACTTG TTTCTCCCCC iTrH-VXTlT AAGCTATTTG TAGAGTITAT GACTAAATAG 540 

TCTGTGCAGG TTCATAGACC GAAGATACTA CACACTTTAA ACCAAITAAA AAGAACCAAA 600 

AGTAAATAGA AAAGACATTG AATCACCAAG GCXTO3GATC AACXHtSGGCT (TTCCACACAG 660 

AAAACAAAAA CCCAACCAAA CCAAGCCTTG TICTGCTCAC TGGTCCAAAG AGAAGATCAG 720 

GGCAGCTTAA GTOGTCTAAG RATCOTTCAG GCATTCTTTA AGGAGAAAAA GGATACCTTT 780 

GATTTTCTGT GTTTCATGCT CTCGATTTTr T nTriTrit: CtTCTCTCGG TTTAAGAGAT 840 

TrmTTGAA ATAGTCAGGA ACTGACCATT ATATGCXOTC ACTOGCTTCT TOTOCAATAA 900 

TATGATGrrrr TAAGTGTCCA AACAAGTTAG AGCTGGCAGC TCAA3X3ATAG ACAAATACTG 960 

CAAATTTGCX: AGCTTQGAGA TAGAAAGGAA TTCAACAATA TATCAAATAC TTTCCTrCCC 1020 

ACCTTTTTCC TITmTTTT TTTTTTCTGA TTTGATTCTG GTTACACnX3C CATAAACCTT 1080 

GTTACAOATC TATATCAGAA TGTAAGAAAA AAAAATTTAT TTAAAAATAT TTTICGCAAA 1140 



40 AAAAAAANNA AAAAACTCGA GGGQGGCC 
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45 (2) INFORMATION FOR SBQ ID NO: 80: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 80: 
55 AGAAAATCAC ATCCTAACAA AGAAGTCTGT CTAAGACAGT ACATCTCCTC TTCAACTPGC 60 
ATCTTTCCAC AGGACTITCT GTTrTTAGGG ATGAGACTAT TCTCTCCTTC ATCAAGGAAA 120 
GAGAAATGTT CAGGGTTGTA GGGATGGCAC ACTTATTAGT TCTOCCTCTC TCAAAGCTITC 180 

ou 
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CTGCAGGACA GTrTQGTCAG AOCTGCAATT CTTAGTOCAT GCHCTAAltSC TTCAGTAICT 240 

crrcTiTCcc ttpcctgtct caggaatcag ctgagaattc ATTccsATrcrr amsccrcrrA 300 

GCXXXTTACT GKSATTTCTT GGTTGCACTT TCATTTGCTr TACHTCTAGA ATCACCTOPT 360 

GACTCCTCAG ACTTCACCTA ACTITGGAAA CTCTCTTTTG GAGC5CTTCTC ATTrCCCCCT 420 

AArrCTOTQC TOCCTGAGCC CTAGAATTTT CCCACCAACG AATTAITCCA GCTAGATCCT 480 

AAGTPGCTGG ATCTAGTTGA TATTTAAACA ATATCTAGTr GATAOTTCTC ATTCAGITOG 540 

ATCCAGAAAC CAGTATXnCT NAAAAACAAC CTCTCATACC TICTQGACCT AATTTTOKTr 600 

QCGrrGrrGTGT GTQOSCGCAT ATGTATATAG ACAGGCACAT CTTTTTTACT TTTCTAAAAG 660 

CTTATGCCTC TTTGC3TATCT ATATCTGTGA AAGTTTTAAT QATCTOCCAT AATCTCTTOG 720 

C3GACCTTTGT CTTCTGTGTA AATGGTACTA GAGAAAACAC CTATATTATC AGTCAATCTA 780 

GrrrGGrrrrrA ttcgacatga aggaaatttc cagataacaa cactaacaaa crcrcccnG 840 

ACTAGGGGGA CAAAGAAAAG CAAAACTGAC CATAAAAAAC AATTACCTX3G TCAGAAGTIG 900 

CATAAACAGA ATTAGGTAGT ATATTGAAGA CAGCATCATT AAACAGTTAT G T lUri t: i l, t: 960 
1TGCAAAAAA CATGTACTGA CTTCCCGTTG AGTAATGCCA AGTTCTmT TTTAITATAA - 1020 

AACTTGCXCT TCATTACAT6 irrTCAAAGTG GTGTGGTGGG CCAAAATATT GAAATCATOG 1080 

AACTGACTGA TAAAGCTGTA CAAATAAGCA GTGTGCCTAA CAAGCAACAC AGTAATOITC 1140 

ACATGCTTAA TTCACAAATG CTAATTTCAT TATAAATTGT TTTGCTAAAA TACACTTTGA 1200 

AACTArrnr CTGTATTCCA AGAGCTGAGA TCTTAGATTT TATGTAGTAT TAACTGAAAA 1260 

AATAOSAAAA TAATAAACAT TGAAG 3^395 

(2) INFORMATION FX3R SEQ ID NO: 81: 

(i) SECJfUENCE CHARACTERISTICS: 

(A) LENGTTH: 1290 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEENBSS : doijble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

TCTCCAGCCC CAATTTCTAC GCGCACOQGA AGACGGAGGT CCTCTTTCCT TCCCTAACGC 60 

AGCCATGGCT CGTOGTCCCA AGAAGCATCT GAAGCGGGTG GCAGCTCCAA AGCATTOGAT 120 

GCTGGATAAA TTGACCGGTG TGTTTGCTCC TCGTCCATCC ACCGGTCCCC ACAAC7ITCAG 180 

AGAGTGTCTC CCCCTCATCA TTTTCCTGAG GAACAGACTT AAGTATOCCC TCACAGGAGA 240 

TGAAGTAAAG AAGATTTGCA TGCAGOGGTT CATTAAAATC GATQGCAAGG TCCGAACTCA 300 
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TATAACTTAC CCTOCTQGAT TCATOGATGT CATCAGCATT GACAAGftCGG GAGAGAMTT 360 

COGTCTGATC TATGACACCA AGGGrPCGCTT TGCTGTACAT (X3TATTACAC CTCftGQAGGC 420 

CAAC7EACAAG TTGrPQCAAAG TGAGAAAGAT CTTTC3TOGGC ACAAAAGGAA TCCCTCAICT 480 

GGrrGACTCAT GATOCCCGCA CXZATCCGCTA CCCOGATCCC CTCATCAAGG TCAATCATAC 540 

CATTCAGATT GATTTAGAGA CTQGCAftGAT TACTGATTTC ATCAAGTTCC ATTCACCCAG 600 

CCAGCypQGTC TCGTCACCTC AGAGC3CTCCG CaGACTCCTG CCCAGGCCAG GACTCAGGCA 660 

AGCCTCAAGG CACTTCTAGG ACCTGCCTCT TCTCACCAAG ATGAACTCAC TOGTITCTTC 720 

GCAGCTACTG CTTTTCCTCT GTQCCACCCA CTTTGQQGAG CCATTAGAAA AGC7K3GCCTC 780 

TGrrOQGGAAT TCTAGACCCA CAGGCCAGCA GCTAGAATCC CTGGGCCTCC TOGCCCCSGG 840 

GGAGCAGAGC CTOCCGPGCA CCGAGAGGAA GCCAGCTGCT ACTQCX:AGGC TCAGCCGTCG 900 

GC3GGACCTCG CTGTCCCCGC CCCCCGAGAG CTCCX^GGAGC CXXX^GCAGC CGGGCCTCTC 960 

OGCCCCCCAC AGCCGCCAGA TCCCOGCACC CCAGQGOGCG GTGCTGGTCC AGCGQGAGAA 1020 

GGACCTGCCG AACTACAACT GGAACTCCTT CGGCCTCCGC TTCGGCAAGC GQGAGGCGGC 1080 

ACCAGGGAAC CACQGCAGAA GOGCTGGGCG GGGCTGAGGG CGCAGGTCCG GGGCAGTGAA 1140 

CTTCAGACXX! CAAAGGAGTC AGAGCATGCG GGGCGGGGGC GGGGGGCGGG GACGTAGGGC 1200 

TAAGGGAGGG GGCGCTGGAG CTTCCAACCC GAGQCAATAA AAGAAATGTT GCGTAACTCA 1260 

AAAAAAAAAA AAAAAAAANC TCGGGGGGGG 1290 
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(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SBQDEKCE CHARACTERISTICS: 

(A) IiQjGTH: 684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) T0P0UX3Y: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID IK): 82: 

TTTATICTAT TCTOTAACTA TAGAACTTCT ATTTOATTCT TmTCGACT TCCTAAGTrG 60 

TCTTTWATOG TTTTWAGnTC CATGCTGAAG TTTTCAOTAT TGACTTATCC CCITCAACAT 120 

GAGTTGTTTT ATAGACTCTR ATQATTCAAA AATCTTACAT CTTITCGTAG TCTCTTTCAT 180 

"TrGTYCACTG TITCTGTTGA TTCTWACTCA TGGTATTTTA ATTCTTCGTr WriTITmC 240 

TCnVIAGAWA CATTCTTTGA AAAATAATTT GGAGGAATAT TIGATTCTTA TCAACAAGGC 300 

ATTACTCACC AGAGAAGATT TTTTTGTTYT ACCARGTGCC TARGAATCCT AACAGTCTGG 360 



wo 98/39448 



304 



FCT/US98A)4493 



GftMCACATAG AMCACCAGGT GATGAGACAA TCCTGGGART CCTOmTAC TTTOGSCCAT 420 



10 



CmTCTCCC AACCCTGTOG GAAIARTCAT YCATATCCTA RCTOCAGGCT ARAAGGTOCT 
5 TTATCAGAGC CCAACTTOGA GGGCTCTGGG CnTAGCTAC TOTCACCCCA TCATAACTCA 
GCrrCATQGA TTGATTCTCT 'mTATCTTT CAGATTrTCT TTTAAAAATC TTTOITmT 
TTTTTCTTCC GAAAGATTCC CCCAACATTA CCAITCCCCA C L T l U aj r i - m AATTTTrTTC 660 
GCTCTCATTT TGAATmTC AAGA g84 
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(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SBQfUQJCE CHARACTERISTICS: 

(A) LQICTH: 2024 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^ (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 83: 

CTGCAGGAAT TOGQCACAGC TQCGCTGGAG GCTTCATCTT TQCCGCCGCT GCCOTCGCCT 60 

TCCTGGGATT GGAGTCTCGA GCTTTCTTCG TrCGTTCGYC GGCGGGTTCG CGCCCTTCTC 120 

30 GCGCCTOSGG GCTC3CGAGGC TtSQGGAAGGG GTTOGAQGGG GCTCITCATC GCCQCGITEA 

AGTTGCGCTC GQGGCGQCCA TGTCGGCCGG CGAGGTOGAG CGCCTAGTCT CGGAGCTCAG 

CGGCGGGACC GGAQQGGATG AGGAGGAAGA GTQGCTCTAT GGCGATGAAA ATGAAGITCA 
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AAGGCCAGAA GAAGAAAATG CCAGTGCTAA TCCTCCATCT GGAATTCAAG ATGAAACPOC 360 
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TGAAAATGGT GTACCAAAAC CGAAAGTGAC TGAGACC5GAA GATtSATAGTC ATAGTGACAG 
40 CGATGATGAT GAAGATGATG TTCATGTCAC TATAGGAGAC ATTAAAACGG GAGCACCACA 

GTATGQGAGT TATGGTACAG CACCTGTAAA TCTTAACATC AAGACAGGGG GAAGACTTTA 540 

TGGAACTACA GGGACAAAAG TCAAAGGAGT AGACCTTGAT GCACCTGGAA GCATTAAIGG 600 

AGTTOCACTC TTAGADGTAG ATTTGGATTC TTTTGAAGAT AAACCAIGGC GTAAACCTOG 660 

TGCTQATCTT TCTGATTATT TEAATTATGG GTITTAATGAA GATACCTGGA AAGCTTACTC 720 

50 TGAAAAACAA AAGAGGATAC GAATGGGACT TGAAGITATA CCAGTAACCT CTACTACAAA 780 

TAAAATTACG GTACAGCAGG GAAGAACTGG AAACTCAGAG AAAGAAACTG CCCTTCCATC 840 

TACAAAAGCT GAGTITACTT CTCCTCCTTC TTTGTTCAAG ACTOGGCTTC CACCGAGCAG 900 

GAGATTACCT GGGGCAATTG ATSITATOGG TCAGACTATA ACTATCAGCC GAGTAGAAGG 960 

CAGGCGACGG GCAAATGAGA ACAGCAACAT ACAGGTCCTT TCTCAAAGAT CTCCTACTCA 1020 

60 AGTAGACAAC AATTTTAGCA AACCACCTCC GTTTTTCCCT CCAGGAGCTC CTCCCACICA 1080 
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CrrrCCACCT CCTCCATTTC TTCCACCTCC TCCGACTGTC AGCACTQCTC CACCTCTCAT 1140 

^ TCCACCflCCG GGTTTTCrTC CTCCACCAGG OQCTCCACCT CCATCTCTTA TACCAACAAT 1200 

AGAAAGTC3GA COTTCCTCTG GTTATGATAG TCGTrCTQCA CGTCCATTTC CATATCGCAA 1260 

TGTTGCCrrr COCCATCTTC CTXSGTTCTGC TCCTTCGTGG CCTAGTCTTG TCGACACCAG 1320 

10 CAAGCAGTGG GACTATTKPG CCAGAAGAGA GAAAGACXX3A GATAGAGAGA GAGACAGAGA 1380 

CAGAGAGCGA GACOGTCATC QGGACAGAGA AAGAGAACGC ACCAGAGAGA GAGAGAQGGA 1440 

GaSTGATCAC AGTOCTACAC CAAGTCnTrT CAACAGC3GAT GAAGAAOGAT ACAGATACAG 1500 

GGAATATGCA GAAAGAGGTT ATCAGCGTCA CAGAGCAAGT OGAGAAAAAG AAGAACGACA 1560 

TAGAGAAAGA CGACACAGGG AGAAAGAGGA AACCAGACAT AAGTCTTCTC GAAGTAATAG 1620 

TAGAOGTCGC CATGAAAGTG AAGAAGGAGA TAGTCACAG6 AGACACAAAC ACAAAAAATC 1680 

TAAAAGAAGC AAAGAAGGAA AAGAAGCGGG CAOTGAGCCT QCCXX^KSAAC AGGAGAGCAC 1740 

CGAAGCTACA CXTOCAGAAT AGGCATGGTT TrGGCCTTTT GTCTATAITA GTACCAGAAG 1800 

TAGATACTAT AAATCTTCTT ATTTTTCrGG ATAATGITPA AGAAATTTAC CTTAAATCTT 1860 

GTTCTCTTTG TTACTATGAA AAGTTAACTT TTTTTCCAAA ATAAAAGACT GAATTTTTCA 1920 

30 TGTTAAGrrTA AAAATCTTTG TCTTGTACTA TTTCAAAAAT AAAAAGACAG CAATCACTTT 1980 

ATATOCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAGGGC GGCC 2024 

35 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQU^fCE CHARACEERISTICS: 

(A) LEWGTH: 931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dovible 

(D) T0PQUX3Y: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 84; 

CGCGCCMATA GCCGGAOGGG GATCTGAGCT GGCAGGATGA ATGTGGGGGT GGCACACAGC 60 

GAAGTAAACC CCAACACCCG AGTGATGAAT AGCCGAGGCA TCTGGCTQGC CTACAICATC 120 

ITOGTAQGAT TGCTGCATAT GGmCTACTC AGCATCCCCT TCTICAGCAT TCCTGTIGTC 180 

TC3GA0CCTGA CCAACGTCAT CCATAACCTG GCTACGTATG TCTTCCTTCA TACGGTCAAA 240 

55 GGGACACCCT TTGAGACTCC TGACCAAGGA AAGGCTOGGC TACTGACACA CTGGGAGCAA 300 

ATGGACTATG GGCTCCACTr TACCTCTTCC CGCAAGTTCC TCAGCATCTC TXXTATTCTC 360 
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CTCTATCTCC TGGCCAGCTT CTATACCAAG TATGATQCTG CGCACTTCCT CATCAACACA 420 
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GCXrrCATTGC TAAGTICTACT GCTGCCX3AAG TTGCCCCAGT TCCA3X3GQGT TCGTOTCTTr 
GGCATCAACA AATACTGAGG GATQQGTTTT GC3GACAQCTC CATQGGCA3X5 QGGAAGGCAC 
5 TGAAACAGAG GACTATAAAA COTCCTTCTC TTATTCTCCA TACTGTCTTC TACACCmA 
AAGCCTGAGA ACTATACAAC CTTTCCCAGA CTCCCAAGAA GEAGAAGAGAT TCGCAAATOG 
GGCTCCTOGG CCCAGTCCTG CTAGIGCSCAA GTTTCTTTGA ATCAGGAAGG CAGGTGAGGT 
AAGGGCCAAA TCACTCTCCT CCATAGCAGG AAGCCA3TPG GGCAGCTCCT TTCGTCMTA 
CATCTTTCCA TATCTPTTAC ACITACCACC TTOCAGCTCT CTirTGCTCT CTATnTICT 
TACAATAATT TTlTrCAGCT ATAGCTOCAG TTTAATCAGG ATCGGTAGAG AGCTOTCCTC 
ATAAGGCTGG GGGTGGGAAG ATGGAATACT G 

(2) INFORMATION FOR SBQ ID NO: 85: 

(i) SEQUaJCE CHARACTERISTICS: 

(A) LSMGTH: 825 hSLSG pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SBC3UENCE DE9CRIPTICN: SEQ ID NO: 85: 
OGGGGCOGGC GGGGTCTTCA GGGTACOQGG GTGGTTACAG CAGCTCTACC CCTCACGACG 
CAAACATGGC AQCGCAGAAG GACCAGCAGA AAGATGOCGA GGOGGAAGGG CTGAGCGGCA 
CGACCCTGCT GCCGAAGCTG ATTCCCTCCG GTCCAGGCCG QQAGTGGCTC GAGCGGCGCC 
GCGCGACCAT CCGGCCCTGG AGCACCTTCG TC5GACCAGCA QCGCTTCTCA CGGCCCCGCA 
ACCTQQGAGA GCTGrPGCCAG CGCCTCGTAC GCAACGTGGA GTACTACCAG AGCAACTAITC 
TGTTCCSTGTT CCTQGQCCTC ATCCTGTACT GTGTOGTGAC GTOCCCTATG TTOCTOGIGG 
CTCTQGCTGT CnTTTCGGC GOCTGTTACA TTCTCTATCT GCGCACCTTG GAGTCCAAGC 
TTGTGCTCTT TGGOCGAGAG GTGAGCCCAG CGCATCAGTA TCXTCTQGCT GGAGGCATCT 
CCTTOCCCTT CTTCrOGCTG GCTGGTGCGG GCTCGGCCGT CTTCTGOGrG CTOGGAGCCA 
CCCTGC3TGGT CATCGGCTCC CACGCTGCCT TCCACCAGAT TGAGGCTGTC GACGGGGAGG 
AQCTQCAGAT GGAACCCGTC TGAGGTGTCT TCTQGGACCT GCCGGCCTCC CGGQCCflGCT 
C3CCCCAOCCC TGCCCATGCC TGTOCTQCAC GGCTCTGCTG CTCGGGCCCA CAGCGCCGTC 
CCATCACAAG CCCGGGGAGG GATCCOGCCT TTGAAAATAA AGCTGnTATG GGTCTCATXC 
AGGAAAAAAA AAAAAAAAGG GGGGCCCCTC TAOGGGTCAA ACTTTA 
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(2) INFX>RMATION FDR SEQ ID tK): 86: 

(i) SEQUENCE CHARACTEMSTICS: 

(A) LENGTO: 1238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEQNESS: double 

(D) TOPOLOGy: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

CATGTAAAAG GATGAAATGT GACTTCTGGT GTTTmTAT TTCTATOGAG GGACTTTCTC 60 

15 GGGAOGGTTT CTGGCTCTCA GGCTCTGAGA AGCTGCAGTT TATGAGTGGC TCTCrTOlUlU 120 

CTQCCACCTA CTGGAGAAGC CATAAGCPGC AGCTTTAGGA AAflOSGAACC CQGGGCAGftG 180 

TGTGGGGAAG TCX3GATC3C5CA GCATCGCAGG GCTTTC3GAAA ATGAGAGGTC AGACTKTKTC 240 

CAGGAAGGGT GTAAQGAGAG GATGGATCCT GATACATGGA TTCAGGA'TCA TTAGGGTCCT 300 

GTCTGGC3ACA CTQQCCTTCC TGCTTACCTG CTCTTTCCTT CCTCCTTOCT OQGAGGAGQG 360 

25 QCTGGCTCAC TGCTCTGGCT TCATTTTCCA GAGCTGCCTG CTCCAGTCAC ACTTAGCTCA 420 

TCTTCTCTCA CTTTTCTCCT TTTGCCGATT AGTQGAOGTG ACAGAGATGT GAATGGGGCA 480 

GGGATGTCCT TT(3ATGGCAT CAAGACTTTA GCTTCTGGTG CGCTGTGTCC CAGCTCTCAT 540 

1TCAGTTGCA GOOGTGATCX3 AMAGITOGCA TGGAAGCTGA GACTCTCACT GACASTCAAA 600 

CCCTCAAATG AACACAATCC CTGCTTTCCT GCCAAGGATC CTTGTAGGCjr NCCCCCAGCT 660 

35 TCCCCAGTTT TTITCTGTGT CCTGACAAAG AAACACAGAG TAACTTC3AIT GCCCTCriGAC 720 

CTC3GCCAGTT GCATITOCCC TC5CAGGCTTG AGCCCAAGCC AGAGCCTTCA AAAGCTATTC 780 

AGCSrrTGTTGC CCAAAACACT GAAAAAAACT GCCCTGGCCC TGAflCCAAAT ACCTTQAACC 840 

40 

CTCC3TAAACT CCATACOCTG ACCCCCTTGT TTTOGATATA CCCAGGTAGA ACAACTCTCT 900 

CTCACTGTCT GTTGrTGflGGA TACGCTGTAG OXywCTCATT AAGTACATTC TCCTAATAAA 960 

45 TOCTTTGGAC TGATCACCCT GOCAGTCTTT TGTCTTGGGC AATCTATACT TTTNCTCAGA 1020 

OGTTCCCAAG GCCTACTGAA GGGACTTAAC ATACTCTTAA TGC3CTTTCCT CTCTCTTOTT 1080 

TTACCTTATG CCCTCACTTC CTGAGTTAAC CTCCCAAATA CAGGATICAC CTCTACCCAA 1140 

CSCCCTTAGCT TCAAGAATAC AGGATCACCT CTACCCAAGC CCTTAGCTCA AGCTCTOCTT 1200 

TGGAAGAACC CAAACTAAGA CAGTOCTCCT QGTGCCCT 1238 

55 



60 



(2) INFORMATION FDR SEQ ID NO: 87: 

(i) SBQUmirE CHARACTERISTICS: 
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(A) LEI«?m: 1460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPaLOGY: linear 

(xi) SEQOENCE DESCRIPTICN: SEQ ID NO: 87: 

AITOCCITCT GGTCCCTOGT GACACTGGQG TCATCCTTCA TCCCCX3GAGA GCATITCTOG 60 

CTQCTCCrCC TGACCCGGGG CCTGGTGQGG GTCQGQGAGG CCAGTTAITC CACCATCGCG 120 

CCCACTCTCA TTC3CCGACCT CTTICTQGCC GACCAGOGCG ACCGGATGCT CAGCATCTTC 180 



tacttpgcx:a TrcasGrroGG cagtqgtctg ggctacattg caggctccaa agtcaaggat 240 

ATGGCTGGAG ACTGCXACTG GGCTCTGAGG GTGACACCGG GTrCTAGGAGT GGTOGCOCOT 300 

CTCCTCCTGrr TCCTOGTCACT GOGGGAGCCX; CCAAGGGGAG CXXSTCGAGCXS CCACTCftGAT 360 

20 TTGCCACCCC TGAACCCCAC CTCarGGTGG GCAGATCTGA GGGCTCTGGC AAGAAATCCT 420 

AGriTTCGrrCC TGTCTTCCCT GGGCTTCACT GCTOTGCSCCT TTGTCACGGG CrcCCTCSGCr 480 

CECTOGQCTC aSGCATTCCT GCTGCX3TTCC CGCGTGGTCC TTOGGGAGAC CCCACCCTOC 540 

CTTCCCGGAG ACTCCTGCTC TTCCTCTGAC AGTCTCATCT TTGGACTCAT CACCTGCCTG 600 

ACCGGftGTPCC TGGGTGTQGG CCTGGGTCTG GAGATCAGCC GCCXSGCTCCG CCACTCCAAC 660 

30 CXXXX3C5GCTG ATCCCCTCGT CTGTGCCACT GGCCTCCTC3G GCTCTCCACC CTTCCTCTTC 
CTCTCCCTTG CCTQCGCCCG TGGTAGCATC GTGGCCACTT ATATTTTCAT CTTCATTCGA 



720 
780 



GAGACCCTCC TGTCCATGAA CTOGGCCATC OrGGCCGACA TTCTOCTGTA CGrrOGTCATC 840 

CCTACCOGAC GCTCCACt3GC CX3AGGCCTTC CAGATCX3TGC TGTCCCACCT GCTOGC3TCAT 900 

GCTGGGAGCC CCTACCTCAT TGGCCTGATC TCTCACOGCC TOCX3CXX3GAA CTGGCCXCCC 960 

40 TCCTTCTTGT COGAGTTCOG GGCTCTGCAG TTCTCGCTCA TGCTCTGCGC GrTTCTTOGG 1020 

GCACTOGGCG GGGCACTTCC TGGQCACCGC CATCTTCATT GAQGCCGACC GCCGGCGGGC 1080 

ACAGCTGCAC GTCXAGGGCC TGCTGCACGA AGCAGGGTCC ACAGACGACC GGATTOTOCT 1140 

45 

GCCCX3VGCGG GGCCGCTCCA COCGCGTGCC CGTGGCCAGT GrGCTCATCT GGAGAGGCTC 1200 

CCGCTCACCT ACCTGCACAT CTGCCACAGC TGGCCCTGGG CCCACCCCAC GAAGGGCCTC 1260 

50 GCXXZTAAACC CCTTGGCCTG GCCCAGCTTC CAGAQGGACC CTGGGCCGTC TCCCAGCTCC 1320 

CAGACACTAC ATGGGTAGCT CAGGGGAGGA GGTGGGQGTC CAGGAGGGGG ATCCCTCTCC 1380 

AACAGGGGCA GCCCCAAQGG CTCGGTGCTA TTTGTAACGG GATTAAAATT TCTAGCCAGA 1440 



55 

AAAAAAAAAA AAAAAAAAAA 
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(2) IKFX)RMATIOM FOR SE5Q ID NO: 88: 

(i) SEQUENCE CHARACTEiaSTICS : 

<A) LENCSTH: 1395 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIOGY; linear 

(xi) SBQOENCE 0ESCRIPTICN: SBQ ID ITO: 88: 

CAGGTOCAAA GTGOGAAGTG TCW3TCCTCA GTCTTOGC3CT ATTCGGCCAC GTOCCTOCCG 60 

GACATCGC5AC GCTQQAGGGT CAGCAGCX3TG GAGTCCTOGC CTlTiXAXrii: CACGGCTt3GG 120 

AAATTGGCCA TTGCCAOSGC GGGAACTOGG ACTCAGGCTG CCCCCCGGCC CTPITCTCATC 180 

CGrrcCACCGG AYT0C3TGGGC GCITOCACTG GCGCTGA3CT ACTTTCCTCA CCTCTCACCC 240 

GTAITCTCTC CAGATTAAAG GTACGACATT TOGAGGCCCC AGCGAGAAAC GTCACCGGGA 300 

GAAAOSTCAC CGGQCGAGAG OSGKCCCOCT CSTCTCXTTCCC CCGGAAGGAC AGCCAGCITC 360 

TAGGQGGGAG 1X3CCACCTGA AAAAAAAATT TCCAGGTCCC CAAACXSGTCA CCGTCPKXG 420 

25 GAGACAQCGG ATCGACTACC ATOTC5GC3TC3C CCACAAAAAT TYCACCTYTC AC?rCCTCAAC 480 

TGCrTGACCCC GGCX3TCAGTT CCAGAGAGAA CXSACTCCCTC CTCCTTOGAA GAGACCTCAC 540 

ACCGTCATCA CGATC3CCAAC GGCTCTC5AAG C3TCGATC5GCA TTCCTOCCTG GATTCA3CAC 600 

TCCCC3CATCA AAAAGGOCAA CRGAGCOCAA CTAGAAACAT GGGTCCCCAG CGCrOGOTCA 660 

CaOCCCCTTAA AACPGCACCT AAGTTQGC3TG AAGCCATTAG ATTAATTCTT TrPCTTAArr 720 

35 TTCTAAAACA ATGCATAGCT TCTGTCAACT TATOTAICTT AAGACTCAAT ATAACCCCCT 780 

'TCTrATAACT GAGGGAATCA ATOATTTGAT TCCCCAAAAA CACAAGTOGG GAATOrAGlG 840 
^ TCCAACCTQG TTTTTACTAA CCXntSTTTTT AGACTYTCCC TTTCCTTTAA TCACTCAGCX: 
TO3TTTCCAC CTGAATTGAC TCTCCCTTAG CTAAGAGCGC CAGAIXSGACr CCATCITOGC 
•ICTTTCNACT GQCAGCOGCT TCCIYCAAGG ACTCAACTTG TCCAAGCTCA CTCCCAGCAC 
45 ATCCAAGAAT GCAATTAACT GATAAGATAC TGTTCGCAAGC TATATCCGCA OTPCCCAGGA 

ATTCGTCCAA TTGATTACAC CCMAAAGCCC CGCXJTCTATC ACCTICTAAT AATCTTAAAG 1140 

CCCCTGCACC TGGAACTATT AACGTTCCTG TAACCATTTA TCCTTTTAAC TTTTrTOCCT 1200 

ACTTTAOTrC TOTAAAATTG inTAACTAG ACCCCCCCTC TCCTITCTAA ACCAAAOTAT 1260 

AAAAGCAAAT CTAQCCCCTr CTTCAGC3CCG AGAGAATTTC GAGCGTTAGC CGTCTCTIGG 1320 

55 CXZACCAGCTA AATAAAOQGA TrCTTCATGT GTAAAAAAAA AAAAAAAAAA CICGGAGC3GG 1380 
GQGCOCGGTA CXXAA 
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960 
1020 
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(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LESftcrm: X186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANnEENESS: double 

(D) TOPOLOGY: linear 



60 



300 
360 
420 
480 



(xi) SEQUENCE EESCRIPTION: SBQ ID NO: 89: 
C3C3CACGAGCC GJGCAAGOCGA GCTAGGGTGA AAACTGCSGGG CGCACCAGGA OGTONGACAG 

AAAAGCAGAA C3ATt3AGACTC TGTTCATPCA CTTITCCTAG GCXX3VTCCTC TOCTCAlCTr 120 

TOCCCCTCCC ATCATAOCTC CTOCITOCTG GAG0CTCTC3C CGGCTTOSCT CTAATOG?K3G 180 

CACTEACCTG GATATITCAG TQCXAGGATG AAAGGCGAGA CTCACCCTAC GCGC3TCGGAC 240 
20 AGATGGGGAG AGGAAAAAGG CAGAGATQC^ CAGGAGAGGG GTO^ 

GTTQGGrrCAG GGGAAAAOGG TGGGGAGAAA GAGGGGTQCA GGCCCTOCAG GCCGCTTAGC 
CAGCAGCPGC GGCCTCCCCG GGCCCTTC3GC ATCCAACITC GCAGACAGC3G TACCAGCCTC 
CTOGTC3TC5TA TCATAGGATT TCSTTCACATA GTIOTPATCCA TCATCTTCGr AAGOTTAAGA 

AGCCCTGC5TG GTGCACCATG ACATCCAACC OGTATATATA AAGATAAATA TATATATATA 540 

30 TGrrATGTAAA TTATGGCACG AGAAATTATA GCACPGAGGG CCCTOCTCCC CTOCTOGACC 600 

AAGCAAAACT AAGCCTOTG GITTGGGTAT TATCTITCCT TTICTrATIT G TriUrmT 660 

GTGGCITCrC TTATGrOGTO ATAGCACAAG TGCCAGTCGG ATO3CTCTCT AITACAGAAT 720 

AGWirrriA ATTCATCAAT GrPCTAaiTA ATGrPCTACCT CAGCACCTCC TCTTAGCOTA 780 

ATTTEAGGAG GTTGCCCAAT mUTrivrr CAATTTTACT GGrTTACTnT 1TCEACAAAT 840 

40 CAATCTCTTT CTCTCTTTCT CTCCPCCCCA CCTCTCACCC TIGCCCTCTC CATCTCCCTC 900 

TCCCGCCCTC CCCTCCTCCC TCTGGCTCCC CGTCTCATTT CTCTCCACTC CATTCTCICT 960 

CCCTCTCTCC TGCCTCCTGC TGCOCCCTCC OCAGCCCACT OCCCOGAOTr GTCCTTCCOG 1020 

CTCCITATCT GTTCTAGTTC CGAAGCAC7IT TCACTCGAAG TTOTOCACTC CTGGTTGCAG 1080 

CTPTOCGCAT CTCCOTTCCT ITOGTGTAGA TTGACGOCyiT TCTITCTAAT TrcAGTOnT 1140 

50 CTGACAAGAT TTAAAAAAAA AAAAAGGAAA AAAAAAAAAA AAAAAA 1186 



(2) INFORMATION FOR SBQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCTTH: 1821 base pairs 

(B) TYPE: nucleic acid 

(C) SIRANnEDNESS: double 
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<D) TOPOLOGy: linear 
(xi) SBQUKJCE DESCRIPTION: SEQ ID NO: 90: 

AAAACATGCT ITCflGCSGCGT CXrCTATGTA TTCGC3GGGGC CXMQGACAC TCAGGCTOGA 60 

KATCXCTCCT CACTGCGCTC AAGAaXSOCCT CAGCAGACAC CAGTTACCCA GCTCAAAGTC 120 

ACAATCCCTC CCAGAAGTCT CCCAACACTA GTCCTGACCA GAGGTOGGGC TCTCAQGCTA 180 

GGACHTPCAC ACACAATGAC AGGCTCSCTGG GGGACATTGC AGGACCCCTT TTCCTYTCCT 240 

CrcCATGCTA GAAGCCAGCC CTAOGMAGCT GCAGTTACTC CCTGTGACTC AGCAGCAGGC 300 

TGATTCAACA CAGCTGCCCA CACAAAGCCA GTGGrTAATAC ATCTOTTTAC CTITCCCTAT 360 

CACCCAGACA CAAGCCCCTT TCCCAGGTCA AACXACAGGC CGATGCATCT CCAGITTGAC 420 

AGTCAAATCA CTACTTCCAT TGCTACTTTA GATCAGCCAA AGrrGGTGACT GCTOCAGTCJr 480 

GTGGCTATCC CTACAAGGCC CACCCAAGGG ATGCCCAAAG CCCAACCTTC TCCAGGGCTC 540 

CAGCACaJAGC AACCCCACCA GOCTAAGTCC AGCAGAGGAC CTCCCACCCA ATGTCTrorr 600 

CTAA3TAGAA GQQGAAGTrA GCCACAGAAA ATCAACTTAT CTATAATTAC AAAATTCTCT 660 

TGACTCACCT TAAAGTTCCT ATTGACATCT ACTGCTTTTA AACCTATTTG AAAACTCTCA 720 

TACTAAAACA AATGACACTC TAAGAAAGrTT TQGGAGCCCC ATGCTGAGAA CCATTTCTOT 780 

GCAGTCAGGA TGTTTCCAGA AGCTACTTAC CTACATGTGA ATGPGCCATT TTCWiXJCIT 840 

TTGTAGAGAA AATCCCXTTTT ACTTTTTQGA ACAGTAATGG CAGCTTCTAG TACAGCCATT 900 

ACAGTTTCAT ATGAGAAAAA TTAAGAATAA CTATAAAATT GrTTAAAATAT CCAATAATOG 960 

ATAATGATGG CCAGAAGATT TAACATACAA AGTAATTCTC AATGTAAAGC TATTCAGCTC 1020 

TTCCAGGrrTG AATGCCCTCT AACCX:ACCCT GACCTTCCAC ATCATCTTCA AAAAGCAGTT 1080 

TCTCTOTTCC CCATGATTCT CCTATAAGGT AACTCTTTAG TCClXXATrT AGCACATTTT 1140 

AAATCCTCCA AAGAATAAGT ATCATGT6AT TATTTTAOCT TTACAAAAAA AAAGTTGAAT 1200 

GGCGTTTTAT TITCATQGCX: TATAAGCAGG TACCTTAGTA GGGCAGATAT AGGAAAAACA 1260 

AA3TAGAGCA AAACAAATCC TCTACAAATC CAAGGCAGQA AAAGTGGTGG CAGAGTCACT 1320 

CAITCTCCTG TCCCTCCCAT CAGGTCAAAT CAGGAGGCTG CAGTGAAIGC ClVl'Xr x rm 1380 

AATCTOTAGC AGTTGTTCCT GTAACTCTTT AAAACTTGGC TATAGGCTGT TTAGCACACT 1440 

ACAGATTAAA GATACAGTTA CGTAAACAGC AAAGTAATTT TATAGTGCTT CATCCATITA 1500 

TCATGCTTTG GTTTCCTAAT TTTTPCACAT ACCTTTTTCT ATCACAGTCT GTIGCmTC 1560 

TACACATTTC TCAIATTGGG GITCGACAGG TAAACACAAA CTGCTATTTC AGTAGAAAAA 1620 

GTTATTGTTA TQGAATATTA AACCCAATAA ATTCTATAAA GGGTAAAAAA AAAAAAAAAA 1680 



312 

AAAAAAAAAA AAAAAAAAAA AAAAAAATTC CTG0t5GQCCG CANOCTTTTr CCCTITOGCST 1740 
GKXXXfirta TITOOGCTTG GGCACTGGGC (XTKWm- TACAACGTCG TCANC3GC3C3QG 1800 
5 AftCCOXSGGG QC5C3TTTCCCC C 

182X 



10 (2) imjRMATIQN FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 862 base pcLirs 

(B) TYPE: nucleic acid 
<C) STRANDECNESS : dotible 
(D) TOPOLOGY: linear 

(xi) SBQUEMCE EESCRIPTION: SEQ ID NO: 91: 

20 TGCCCTTTTT CCCACCGATT CGGGGCNTOG TSAAC3GTt3GG AGATOTCAAC TCCAATTAAG 

GGACTGGAGA GAGGTGAAGA ATTITCCAGG TCSGGAGATTT C3GATITCAAT GTOGACTICT 

AAATC3ACTTG ACCITCCCAT CTGTGTTCAA CXTPCACCCTT TXXTCK3GGG TrCCTOCSCSAG 

AGCTTACTCA CCCCGGAGTC TTrTCTTTCT CTTGCTCCAA GAAGAGCCCT CTTOGTOCTT 

TACCACCGCT TOGAGTCTCC OGAGGACACA AACAGGCAGA GAGGGACCnG TAGQGAGACT 

30 TdTTCcror irrcreTGCT ttccttttta caggactccc ggaaggccac tcatogccat 

GCCAGGAGCr TTCTCAGAAA CAGTCATAAA CGATCTCTTC ACHCTCTTTC ITCTCCTCCC 
AGCT6AGCTT TCTEATTCCA CCCTrTCTGG TGTCTATAGG AArTOCATCAG AAGACCCTOG 

GACGmrrc TQCTCTCTTC TGGCCCTCCA TQGAGCCATG GGCCTCGGCC TCGGCGGCTC 540 

CTCAOCCTCA CAATPTATTT CCTCCTCCCG TGCCAGCCCT a t,TmXJ i V r CTCAAACCGG 600 

40 irrTAAAATG TGACTCTCCC AGAGAAGAAG CCGCPQGCTC TATCAAACTT GACGGCGCTT 660 

TTGTAAGGTG CCACCCCCAA ACTTTAAGGT AGCTAAACCA ATTTTTAAAA GATTCAATOG 720 

CTICTPCATC CTCCAGATGT AGCTATTGAT GTACACTTCG CAACGGACTG TCTCAAATIG 780 

TQGTGCSrcCT GATTTATAGG ATTTCATAAT TAAAATGTCT GCTGAATAAA AAAAAAAAAA 840 
AAAAACTC6A GGGGGGCCCG GT 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 92: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 696 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
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360 
420 
480 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 92: 
CTQAGGOGAG TGAAGTOGAC TCPGAGGCXn' ACCGCTACCG CCACTGCTGC GGCfiQGQQC3G 
TX3GAGQGCAG AQGOCCGCGG AGGCXX3CAGT TGCAAACATG GCTCAGAGCA GAGACGGCGG 
AAACCCXTTPC GCCGAGOCCA GOGAGCTTGA CAACCCCTTT CAGGACCCAG CTGTCATCCA 
GCACXX3ACCC AGCCGGCAGT ATGOCACGCT TGAOGTCTAC AACCCTTTTG AC3ACCCGC5GA 
GCCACCACCA GCCTATGAGC CTCX:aGOCCC TGCCCCA3TC CTTCCAOCCr CAGCTCXTTC 
CTTTGCAGCCC TCGAGAAAGC TCAGCCCCAC AGAAOrTAAG AACTATGQCT CATACAGCAC 
TCflGGCCTCA GCTGCAGCAG CXACAGCTGA GCTQCTCAAG AAACAGGAGG AGCTCAACCX; 
GAAGQCAGAG GAGTTOGACC GAAGGAGOGA GAGCTGCAGC ATC3CTGCCCT GGGHGGCACA 
GCTACPCGAC AGAACAATTG GCXXXCTCTA CXOTCTTTTT GTCCAGTTCA GCCCTOCTTT 
TTCCAGGACA TCTCXATOGA GATCCCCCAA GAATTTCAGA AGACTCTATC CACXZATOTAC 
TACCTCTQGA TCTGCAQCAC CffrocajTCTT CTCCTGAAYT TOfTCGSCTG CCrHQGCCAGr 
TCTGTGTQGA AACCAACAAT GGOGAGGCTT TQGGTT 

(2) INFOFMATIC3N FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 1886 base pairs 

(B) TYPE: nucleic acid 

(C) SIRANDEENESS: doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE nESCRIPTICN: SBQ ID NO: 93: 
CAGGCCACTG AOGCTTCTTT GOGAGGGATG CAGGAGGTCC TACAGAGAAA GGCGCTTCTT 
QCATKTCAGA GGtXCCACAG CCTGTCACCC ACAGATCACC AAGCAGCTTT CTACCTOGCT 
CTGCAGCTTG CCATCTCXaG ACAGATCCCA GAGGCTCTGG QGTATGTCOG CCAAGCTCTT 
CAGCTTCAAG GTGACGATGC CAACTCCCTG CACCTCCTTG CCCTCCTQCr GTCAGCACAG 
AAGCATTACC ATGACGCTCT GAACATCATC GACATQQCCC TGAGTGAATA CCCAGAAAAT 
TTCATACTAC TGTTTTCCAA AGTGAAGTTG CAGTCACTCT GCOGAGGCCC GGACGARGCA 
CTGCTGACTT GTAAGCACAT GCTGCAGATA TGGAAATCCT GCTACAACCT CACCAACCCC 
AGTGATTCTG GACGTGGGAG CAGCCTCTTA GATAGAACCA TTGCTGACAG ACGACAGCTT 
AATACAATTA CTTTGCCAGA CTTCAGCGAT CCCGAGACAG GCTCCGTCCA TCCCACAra5 
GTAGCAGCCT CAAGAGTGGA GCAQGCACTG TCGGAAGTGG CTTOGTCTCT GCAGAGCATG 
CCCCTAAGCA GGGCCCGCTG CACOCCTOGA TGACGCTC3GC ACAGATCTOG CTCCATCCAG 
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CTGAflGTCTA TAT00C5CA1X: GCSGAAGCCTG CAGAAGOCAC AGCCICTACC CAAGAAGCTO 720 
^ CCAAOCTCTT COCAATGTCC CACAATGTCC TCTACATOCG CXSGCCAGATT GCTCAGCTCC 780 
QC3C3GAAGCAT GGACXSAGGCG CGGCGCnOTr ATGAAGAGGC CTTAGCCAWT CAGCCCCACC 840 
CACGrrSAAGA GCATGCAGCG ACITOGCCCT GATCCTTCAC CAGVTAGGCC GYTACACTYT 
10 GGCX3GAGAAG A3CCTCCGGG AaSOQGTOCA OGTGAACTCG ACAGCCCACG AGGTCTOGAA 
OQOGCTGQQC GAGGTCCTCC AAGCTCAGGG CAAOGATCCG GCGQCTACGG AGTOCITCCT 
GACAGCCTTC QAGCTOGAGG CCAGCAGCCC OSCCGTQCCC TTCACCATCA TCCCCCQCCT 1080 
GCPCTGAGCA GGCGCCTGCX AGCCTCACCT GCXX3CTCAGC CTOCAGAGGC CCTOCCGQGC 1140 
AOCAGGGCTT GTGCCATCGC CCXAAGQGGA TGAA3CTGCC GCACTGAGGC CAGGGACGAG 
20 TGTTCAGTGG GCCACAGTPGA ACX:AACCAAA CCAACCCCGA AOCATCGCTC TCGCCAroTG 

OCJmCTCTT GTrmTTTG CCAGCCCAAT GGTAGTTTCT QAACCTATTC ACATTOTTCA 1320 
^ AAATOGATCA 1CTGCCATAT TOTCITAGTT GACATCTGAG TriTCAOTAA AATCATEATC 1380 
GAATTAATCA GCAAATOPAG AAGAATATAT TCAAAGITAA AATTCACTGG CAGCACAGAT 1440 

TATrrrrATc agagctctaa agaaaacaac TGfrccrriTC tccccacxw: cxxrrccrccc isoo 

30 CX^^CTTTOGC CCAGAAACCA AATCTGAACT TOTCTCTCC CACCTCAGCA CTAGTCCATC 1560 
CCAGGACACC AGCTGACAAT TTCTTGCnTT TACTGTrCAAT AATICTACCA TtTTCA'TCAAT 1620 
TACTOTCCTC ACTTAGAACA AAGCCTGACT CCGAGAATAT TTATATnTA CCAATATATC 1680 
CCTGnTACAA GAGAAGGAAA TATGAGTTAT TTAAGTITAA CTTITTTATG TCAA3TCAGA 1740 
GTTTATTTAT CGAGGGAAAT ATGTEACAAAG AAGCTTCAAA TOGAATATIT ACCGACAJTTC 

40 CTTATACATG ACAGACAdT GGCTACATG6 GAAGA3tam; TTAATAATAA AATCAmTT 
AAATQGAAAA AAAAAAAAAA AAAAAN 



1800 
1860 
1886 



(2) INFORMATION FOR SBQ ID 1©: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) L®JGTH: 1774 base pairs 

(B) TlfPE: nucleic acid 

(C) STEy^NDEENESS: doiable 

(D) TOPOLOGY: linear 

t^) SBCK^CE DESCRIPTIQN: SBQ ID NO: 94: 
CTCAGCTACC GTATACAGTA GGACATAACC CCATTTCACA TGCACTACAC TGAGACITOC 60 
^ CTCCTCTCCC CCCACATTGA AGATOITCTT TTTTCATAAC TATATACTAT TCCAITOCAT 120 
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GAAXAITCTG TAATrrATTT AATCCCCTAT QGATTGATAA TTAGCTTCAT TATAGATflGA 180 

AGTtJTAATTA ACATTCCTGT ACATCTAITr TOCXACTTOT (JTOGCTATIT CICTAGGAIG 240 

AATAACTAGA AATrTATIGG AaxaDGTlTC ACATTTOCAG TnTCAAAAC TACTACCAAA 300 

A«3KnTCAC CAATTTACAA CTCCATCATT AGTAAGAATC CCTCTITCCC lATAGTCroC 360 

CAAOOCTCAA TCCTTAAAAA TmTGOCAA TCTOGTBGGC AAAAmriT I LT rnvm - 420 

GAATATTAAT GflGGAGGAAC ATCTPITCAT GTTrCTTQGC CATITCCAIT TCCTAITATC 480 

AATTOCTTTT GOCCATTTTC CTTrrTTTAA TTATGAAAGT CTAATCACTA CCTTCTCaTT 540 

GTATAAAAAA CACflGTTCTT TGAATAGAGA GACCCTTITC TCCAATOCTA CCAATCACAT " 600 

TOCACTTACC ACAGTITAAC ATACATCCTC TAGTCAOCTT TCCCTAOSAA TATACftTACA 660 

CATAAAAACA CmTTACAT AAATAGGATC TCATATTCTG TA(3CTITPrA AAATPTTOCT 720 

CTCAAAAAAA GATAACAGC3T CTTraAATrT CTITAATOGT TCAWATCAT TAAATACTAT 780 

GAAAATQCCA TTMTTATPC CTTTAArrTT TITCCTCTCG CTATTACATT CXX^AACTAA 840 

ACATOCTAIT CflGATGTCTT TOPtSCATGTC TGTGAATRTT TCITTACTCT GGflGTCCAGT 900 

AflGGTCGATT TnOGtOCUi AOQGTTTOIT CTCTCTCCAC CTICAGICIT CCCAAftGGCC 960 

TTCATAACTG TATTTTCACC AAGTOTATGG AGAATCTTCA 'nTCCCCATA TAACCATACC 1020 

TACACTTGAT AGrTTTTATC TGriTQQGCGA AAAftGAACCT -nTCrrATIT TOCATTTCCC 1080 

TGATTATAAA AAAAAATQGT GflGATTQGGG TTATmCAT amXTTCCC CATTTATAGT 1140 

rrACTCTQGA TTGrrroTAT OOCTTAOCTG CTTTCTATTG GGnTATOTOT GGATATATTO 1200 

•mrrATiTC ncAocATCT cxmtccccat citctgctaa cACAAccnr AinaTnOT i260 

OOGGAflDCTA TrCCCTGrTOG CTTASGTCSG CATGTGACCA GGCCTOGCCT CCTCfttTTCCC 1320 

ACftGCTTCCT AGCCACKJTG ATAAAAGAAT QGGTATATAA CTTAAGCCflG GCTAAGGAAA 1380 

OCOCTTAACA GAACTICIGC TOGAACTACT GGAAAGAAGG CmATCGfiG ATCCCMBAA 1440 

CCAAGGflCCA TCTAftGCXTO AATTTOTOCC ATOTOGAGAG ACrTCTOICTC AGGAGAAACT 1500 

OGGATOCTAG CAGAAATOGA ASGAGAACTA AGTTCTGATC TCATnTICT GGAQGCCCTA 1560 

GAICCAGCTG TGCCTAAAGC CIGCXCTACT CXXKACTTTA AAGTmCTG ASCCAATAAA 1620 

GTCCCrmCT TGPITAAGAT AATTGAATrG AGTTTCTGTT CTGATTAATA TAGCTTATTr. * 1680 

GTATTITCTT ATTGArrroT AGAAAACCTT TCTAATTTTA AATTCTA(3AC TITATGCaCT 1740 

ATATAAGTTA ATAAAATTAG CATOGCCTTC C3VTG ^774 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi> SBQUQJCE DESCRIPTION: SBQ ID NO: 95: 

GOCAOGAGCG AAGGCAAGGG GCXACCAGCT CAGGACTC3CA TCTOCCTOCC ATTTCCCTrc 60 

CACTCCTCCT rrCTCSGAGTC TGACATTAGA AAGCCAGOGA GAAOGAAGAT TCAAACAAOC 120 

AACCCTGATT TOCTCSCTTCT CCTTTTCATG ACHCTTOCTC TOC3TCTCTOC ACCTCCTITC 180 

TC7PCCCCCGG CAGAGC3C3CAG TAGAGATQGC OCSGCCCAAGG CCTCRffTOGC GCGACCAGCT 240 

CSCTGTTCATG AGCATCATAG TOCTCGTGAT TGTOC5TCATC TGCCTCATtST TATACC3CTCT 300 

TCTCTGC3GAG CXTOGCAACC TCACTGACCT GCCCAAOCTC AGAA3X:GGC7r TCTATAACTT 360 

CTGCCICTCXt AATGAGGACA CCAGCACCCT ACAGTCTCAC CAGTrccCTC AGCTOGAAGC 420 

CCTGQGC3C?rG CCTCXSGGriTC GCCITOCXCT QGCCAGGCTT GGCCTCTACG GGTOCCTOtTr 480 

CCTCACCCTC TTTGCCCCCC AGCCTCTCCT CCTAGCCCAG TCCAACAKTC A3X3AGAGAGC 540 

GTGGCGSCTG GCAGTCGGCT TCCTGCSCTGT KTCCTCTC5TG CTCCTCGCAG GCGGCCTOGG 600 

CCTCTTCCTC TOCTATCTGT GGAATGQGTC ARC5CTCTCCC TCCCGGCXSCC TCGCnTTCTA 660 

GCTCTGC3QCA GCGCCCAGSC CTTACTCATC CTCTTQCTTA TAGCCATCQC TCIOTTCCCT 720 

CTGAGQGCTG AGAGQGCTGA GAGCAAGCTT GAGAGCTCCT AAAGGCTTAC CJTCATTOCAA 780 

GCSGTTCAGTT CCAACCATGG TCAGAGGriGG CACATCTC3CT CAGCCATCTC ATTITACAGC 840 

TAACGCTC3AT CTCCAGCTCC AGCGATC3GAA OCCACTACAG AGGAGt5TO0G GCCCCTOlXJr 900 

CAAAGAGGCC GAGGQGCAGC AAGGGCAC31C AGGGCACCTG TGACnCTTA GTACAAGATT 960 

GTCTGrrCCrr CAGGACTTCC AAGQCTCCCA AAGACTCCCT AAACCAJTOCA CSCTCATKTrc 1020 

ACACCAATTC CTGCTITAAT TAATGGATCT GAGCAAATCT TCCTCTAGCT TCAGGAQGCT 1080 

GGGGAGGC5AG TGATTGCTGT CATGGGGCCA GACTTCCAGG CTGATTTGCC AAATCCCAAA 1140 

ATGAAAOCTA GCAAAGAACT TAOGGCAACA AACGAGGACA TTAAAAGAGC GAGCACCTCA 1200 

(nxmyrCVGG G6ACATGGTT AAGGAGCTTC CACTCAGCCC ACCATAffTCA GraOGCCGCC 1260 

ATAAGCCATC ACTQGAACTC CAAOCCCAGA GGTCCAGGAG TCATCTCTGA GTCACTCAAC 1320 

AAAGACAGGA CACATGGGGT ACAAAGACAA GGCTTGACTG CTTCAAAGCT TCCCTGGACC 1380 

TGAAGCCAGA CAGGGCAGAG GCGTCCGCTG ACAAATCACT CCCATCATCA GACCCTCGAG 1440 

GACTCCAAAT CCTOGCTGTG AACAGGACTG GACGGTTGCG CACAAACAAA CGCTCCCACC 1500 

CTCCACTTCC CAAOCCAGAA CTTGGAAAGA CATTAGCACA ACTTACGCAT TCGQGAAOTC 1560 
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TGTGTATTrr CTAQCACTTG TGTATO3GAA AACCTOTATG GCAGroATTT ATTCATATOT 1620 

^ TOCTGTCCAA AC3CCACACTS AAAACftGAfiG CAGAGACATG TACTCTOGIG TCATCTCTTG 1680 

TCrTCAGTGT CTCTTCTQQG CTOCTGTCCC TCTTGCITTA TOGCTAGCTC CXXS3CSGGACC 1740 

MGGTACAQG TCAAAOCAAG GTOGCABCTT GOtSGGAGGAG QCCTOTCTOG CTTACCAGTC 1800 

10 TATACACTGT GQCCTCAACC TCCCaGflCAG GQCaCflGAAC TOreOGOWSC TCGTTTOCTr 1860 

TCTAQQCrOG CTOGAGflGGT GGGAQCTCAT TGMSGACTC ATGJOGGAAA CTAimTCA 1920 

AfiCAGGCTTC CTCCTTCAGG AGRGATCATG CXSGflCTARAC TOTAQCaATr CX3tf3IWacC. 1980 

TQGCAGTGAT CCTmCTIT GCAAflGTRCT GfrcTCTITOG TTCCAGEAfiG ITOGACXaCC 2040 

ACMGACATy ATTITCCCTG GAM3CT0GTC ACTGACTAAC ACAGACAATT QQGACTCCAG 2100 

20 AGCCTCAAGA GCCAGGAQAG GOCACAGTAC ATACAGAGGG ACTCAAAIGG GATCTCATTT 2160 

TGAGTCCTGC CnCCGCACA CTCAGAflCGG CAMXCCAAG GCCCGGAGTO ■TCCAGGGCTT 2220 

CTQGCCTGAG GTGAATCTGC CAGQCCCAAG AAQGCACAAA GCSTAGGAGCA CSGAGAGCCX: 2280 

CAXTCccacA GQCGcaaasGc ccaGCAocac cagtogaagc tcagctcicc TCcaGCTCcr 2340 

CTCGQCAGAC AGnTCACTGC ACACTTTATG CCCTAGCTGA AAAftGATCTC CCX3GACCTAT 2400 

30 TTCAGCACaT CCTCTTOCTC CTCCTCCTCA GQGCTCCTCC TACaCGCAGA GCTOGAACCC 2460 

CCCGOCCICrr GQGAAGGGCT GAQGCXTTOGA GYCSGTCCCT CTC ocot 



35 



40 



45 



(2) INFORMATICaj FOR SBQ ID NO: 96: 

(i) SEQUENCE CJIARACTERISTICS: 

(A) LENSm: 2801 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBCNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESC3aFTI(»l: SBQ ID NO: 96: 

CTGGAAAGCC GAGQGTAGCC GAGCGGQGCG GGCGCTCTQG AGCX3GCGGGT GCTCQGGCK3 60 

COJPOCGCTC CGCCAGAAGC A£XX5AGCAGC CGAQCOGGQG CXXXSCCGCXC TCCTCCTCCA 120 

TGAOSCCCGA CJTGAGQCGCG GOGGCTATAG OCXaCO0GCX3 GCGCCTTCCE CCXGO(?ICCT 180 

ATOKQAGCG CAOSACMAGC GGCCCCTGQA OGAGGAGGCG GAOGAGGAGG AGCATOTCQG 240 

55 ACGGTITCGA TOQQGCCCCA GGTCCTQCTC GGGGCOQGAR CCOGQGOCTC QGCCGOQGAG 300 

OSGQCGGGCC ■hlAGQGCGGC GGTTTyoCGA AMaGAROSGR GCCTGCTCAG CGGRCGOGGC 360 

^ ACCAGCCGOC GCAACXXrAAA GCCCX3GGGCT TYCTGCARCC AICGOCQCTC OGCCAHCCCA 420 
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GGAOSACCCC GCCGCCAGQG GCCCAGTC30G AGC5TCCCCGC CAGCCCCCAG CGGCCTTCCC 480 

GGCCCX3QGC3C GCTOCCAGAG CAAACGAGGC CCCTQAGftGC TCCACCTAGT TCACAGGATA 540 

AAATOCCACA CXAGAACTCG GAGTCAGCAA TOGCTAAGCC CCAGGTOCTT CTAGCTCCTG 600 

TATTAATtSTC TAAGCTGrTCT GTCAATCCCC CTCSAATTrTA CCCTTCAGCyr TATTCTTCCA 660 

GTTACACAGA ATCCTATGAG GATGGnTCTG AGGATTATCC TACTCTATCA GAATATCTTC 720 

AGGATTTTTT GAATCATCTT ACAGAGCAGC CTCSGCAGrTTT 'TCAAACTCAA ATO3AACACT 780 

ITOCAGAGAC CXTCAAICGT TGTGrTTACAA CAGATGATGC TTTOCAAGAA CTTOIXSGAAC 840 

TCAPCTATCA ACAGGCCACA TCTATCCCAA ATTTCTCrTA TATOGGAGCT CGCCTOICTA 900 

ATTACCTOTC CXMCATCTG ACAMTAGCC CACAGAGTOG CAACITCCGC CAATTCCTAC 960 

TTCAAAGATC T0C3GACTGAA TATGAAGTTA AAGATCAflGC 'TCCAAAAGGG GATCAACHTA 1020 

CTCGAAAACG ATTTCATCCA TTTGrTACTCT TTCTQGGAGA ACTTTAICTT AACCK3GAGA 1080 

TCAAGGGAAC AAATGGACAG GTTACAAGAG CAGATATTCT TCAGGTTOGT CTTCGAGAAT 1140 

TGCTGAATGC CCTOmTCT AATCCTATC3G ATGACAATTT AATrTGTOCA CTAAAATTCrr 1200 

TAAAGTITCAC AGGATCAGTT TTGGAAGATG CTTGGAAC3GA AAAAC3GAAAG ATGGATATGG 1260 

AAGAAATTAT TCAGAGAATT GEAAAACGTTC TCCTAGATGC AAACTOCACT AGASATCTAA U20 

AACAGATGCT CTPGAAGCTT GTAGAACTCC GCTTCAACTTAA CTOGCSGCAGA GrTCCATOCAA 1380 

CTTCAACATA TAGAGAAGCA ACACCAGAAA ATGftTCCTAA CTACITTATG AATCAACCAA 1440 

CAnTTATAC ATCTGATQGT (nTCCTTTCA CTOCAGCTGA TCCAGATTAC CAAGAGAAAT 1500 

ACCAAGAATT ACTTCAAAGA GAGGAdTIT TTCCAGAITA TGAAGAAAAT GGAACAGATT 1560 

TATC0C5QGGC TGGTCATCCA TACTTGGATG ATATTGATGA TGAGATOGAC CCAGAGATAG 1620 

AAGAAGCTTA TGAAAASPTT TOTITCGAAT CAGAGCCTAA GCGAAAACAG TAAACTEAAA 1680 

TTTCAGCATA TCAGmTAT AAAGCACnTT AGGTATGGrTG ATTTAGCAGA ACACAAGAGA 1740 

GCAAGAAAAT GTGTCACATC TATACCAAAT raAGGATCTT GAGT:TATOrr ACTAATOTAT 1800 

GCAACTTTAA TTITGTTrAA CACTATCTGC CAAAATAAAC •TTTATTCCXT ATAACITAAA 1860 

ATGTGTATAT ATATATAATA GTITATTATG TACAGrTTAAT TCTACTOrTT TOGCTGCAAT 1920 

AAAATC6ATT TTGAAATAAA TGAAAICTTG AAAATrTTGC TAGTTOOTTA GATCCTTATC 1980 

CTTTAAATTC TACTTTTCTT GAGGGGAAAA AGTCTTOCSTC TGGAAATACA TATTACTCCA 2040 

AAAATGTAGC ATCCTTTTTT AGGTAGGAGT ATTATAGCTT YCATPITAGT TKGACATITA 2100 

C?K3T0CCAAT GAAOTCAATT TCAAATATGA ATCATAATCT TCAAAATCTT TAGCACTAAA 2160 

GTCTTGGGAA TATATCAACA ACTGATTTAC ATATOCAGAT GCTATTTCNA TACCAAGGGC 2220 
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(2) INFORMATION FOR SBQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1631 base pairs 

(B) TYPE: nucleic acid 

(C) STOANDEDNESS: doiible 

(D) TOPQLOGV: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 97: 
ATGGAGCCAA AGACAATCAC TCATCSCTTTC QCTTCTAGTA TAATTAAGAG ICTGCTOCCT 
AATmCTTC CATACAATCT CATC3CTCTAC AGTQATOCIC CftGIGAGTCA ACTOTCCCTC 
GAGCTGCrrC TC3CTTCAC3C3T TGrPCITOCCA GCATTACTCG AACAGGGACA CACGAGGCAG 
40 TC3GCTQAAGG GGCTCGTGOG AGOGTGGACT GTGAOCGCCG GATACrTCCT GGATCITCAT 
TCITATTrAT TOC3GAGACCA GGAAGAAAAT GAAAACACTTC CAAATCAACA AOTTAACAAT 
AATCAGCATG CTCGAAATAA CAAOGCTATT OCTGrctTIGG GAGAAGGCCT TCATCCAGCC 
CACCAAGCCA TACTOCAGCA GGGflGGGCCT CrTTOGmTC AGCYTTACCG CCGACCTTTA 
AMTITOCAC TCAGGATATT TCTGriTCA3T GTCTTCAIOT GTATAACATT ACTCAntSCC 
50 AGCCTCATCT GCCTTACITr ACCAGTATIT GCTOGCCCOT GGTTAATOTC OTPTIGGACG 
QC3GACroCCA AAATCCATCA GCTCTACACA GCTGCTTCTC; CmrTCTATCT TOXTCGCrA 
^'^^'««5G CTCTGAOGCT GAT^^ 

CAGAAGGTTA AAGACTGGTC TCTCATGATC AIGAAGACTT OGATAGITCC GGTOCraiTC 
GCTCGAGTTG TCCCTCTCCT TCTGGGGCPC CTCTITCAGC TGGTCATICT GGCTCCCCIG 
60 AGC3C7ITOCCT TQGATCAGAC TOCTCTTITr TATCCATCGC AGGACTCGGC ACTIGGACm: 



2280 



imTAAATG TCATGGOGGG GAAAAACOCA ACTTOGIGGA ACTCCCAGCT AAACAAOCAA 
GACETCACTG GAAGATITAT TCCAATICTA GGAAnTHTC TnTTrAITr OTATmnC 2340 
5 AACTC3lCrAA CTTCATTACC TTAAAGOCTA GAACA^TATT dGCTITATT TATAIGGCIT 2400 
TCTCACmT ATmGTAGC AKGOGTOSCA TCGACnTTr TftCTAGAGAA TTTPACEAGA 2460 
TAirrGTCAT TCAAGTmc AOCTGCITTA TAATTGATAC ACCTIGAGGG TCACimCT 2520 
AATACTTPTA CTATAATGTG GTACCACCTC AGCOCTAATA AATAATATTT ITACCIAATC 2580 
TCAAATCITr TTCCAGCTAA CTAAAAACTO TGTACAAAAG GATOCTICT AAATAIGCAT 
15 GTAAATAffTT CIGTTAATAA COCACICTTr TACATITOGT ACATCIX7ICT CTOCTAATAC 



2640 
2700 



GACTTTGAAA 2760 
ATAAAGCTTA AAAAAAAAAA AAAAAAAAAA AAAAACTCGA G 2801 
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120 
180 
240 
300 
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420 
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780 
840 
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CTGCATC3CCA AAATCATTCC AGCTATAACA TTCMtSGCnc CTCAffTOCno OITCAAAACr 

G^TAArn^ AGGrrrAocxr aaaioscatc cggaaovitc acx^cta TATitTrrror 

AAACTOGCAG CTCCCGTCAT CTCIXTKSCTG TTGCmtXC TO^CTACC TTATOTCATA 
GCTXCTOGTC ritTITCCPrr ACTAGGTGTr ACTGCX3GAAA TCCAAAACTT AGTCCATCGG 
CGGATTTATC CATnTTACT GAIX3GTCCTG GTATTCAICG CAATnTOIC CTTCCAWm: 
CGCCAGTrrA AGOGOCTTTA TGAACATATP AAAAA3GACA ACTACdTOT GGGICAACGA 
CiroreAACT AOSAACGQAA AICTGGCAAA CAAGGCTCAT CTCCACCACC 'TCCACACSTCA 
TCXXIAAGAAT AAACTAGrTTG TCTCAACAAC TTGACCITCC CCTOTACATC TCCTTi™ 

T«3AcrrcTC TcmosAGA rmTcxr^ TGATxiTCrcA Gcciromr -ru^aaat 

GTA-TPPGACT TaKTrTCTCA GCATTCAGAG AGCAGCGCJTO TAAGATTCTC CTOmrTCCC 
TQGATCTTCT GACArTACTO OTHCTGAGA TITCTATATO TOTAAAlftCA AGPTCCTTOA 
TACCCTAAAA CCTTOGATTA AACAGAATGT GCATTCTACA TCTTTAAACA AAATOTATAT 
TAATOTATTA AATCTAGTTG TCACTITAAA AAAAAAAAAA AAAAAACTOG AGGGGGGCCC 
GGrrACCCAAA T 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1631 
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(2) INFORMATION FOR SBQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEENESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTIC»J : SBQ ID NO: 98: 
CCGAGCTGGG OGAGAAGTAG GQGAGGGCAC GAGCXXXTGC G^TOGCGCTIT GCTATCGCIT 

45 CGCAGAACCT ACTCAOGCAG CCAGCTGAGA AGAGTTCAGG GAAAGTOCTC CTCCTOGCnC 
TOCAGAOGOG ATGGATAAOG TGCAGCOGAA AATAAAflCAT CQCCCCTTCT GCnCAGICT 
GAAAGGOCAC GIGAAGATGC TGOGGCTGGA TATTATIIAAC TCACTGCTAA CAACAOTATT 
CATGCTCATC GIATCTGTGT TQGCACTGAT ACCAGAAACC ACAACAITCA CACnTCCJTCG 
AGGGGnOTPT GCACTPGTGA CAGCAGTATG OT?ICITCCC GAOGGGGCCC TTATITACCG 

55 GAAGCnCTG TTCAATCOCA GCGGrTOCTTA OCAGAAAAAG CCTtTTOCATC AAAAAAAAGA 
AGTTTTGTAA TTTTATAITA CrmTAGlT TGATACTAAG TATTAAACAT ATITCTCTAT 
TCTTCCAAAA AAAAAAAAAA AAAA 



60 
120 
180 
240 
300 
360 
420 
480 
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(2) INFORMATION FOR SBQ ID NO: 99: 

(i) SEQUENCE CHARACIERISTICS : 

(A) LENC3TH: 1416 Jbase pairs 

(B) TYPE: nucleic acid 

(C) STOANDEENESS: doiible 

(D) TOPOU)Gy: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

CCTGCCGGrr CCACAGGGTA CAGTTAGGAC TTGft^nxnTr CTmTClCT rTTCACTTOG 
'TGAGIGAGTG ATAOGGTAAC ATGGGCCITC AGGATCACCC CTTOGAACTC IX^CCGAGm: 
CTTAAATCTC AGCTGGGATC CTGGACCTGG GAGGCCCCTG lOAGGGCCAG CTOTOGAAAA 
ACCTC3GGA<?r TGATCCCXX5A CXTOTOGAAG AACTCIGCrc GAGGGCAGGG TOCCCTOGAA 

CICTTCAGCA GTOZAACTAC TICTTCTCAA AACAmTCT AATrGATTOG TAGCmnCA 
TAAGCAOTGr TTCTTTAAGG CAOOSAAAGG GAAGAA'TOCT CAAGCAAGTC ATOmCTTr 

TCAGTGGGAT GGGCCCGCGT TCTCACTGCT GGGGGCITCC COTTCATOTO GCACCTTICT 

GCAGGGGCCA CXIAOGCAGAC TCTTCCCACC T^ 

AOOGTAAITT GGCTAATCAG AGGCAOTrTT TITCTCCrAG TAICTITCAC ACTIGTCCAA 
COGTCTTArr rrTTTAAAAG TOTOETOCT TGTATrAACA OGAAACTAGA GAGAAATAOT 
rrCTGAAGCC AOTITATO3T GAAGATCCCC AAGGGGAGOT TOSOTAGAGA AAAATACTAA 
«rrGGTTrAG AAACTGACGA GQGCAAACAG OCAGGAOGCA TICGAGAGGA ATTO5CCAAA 
GATCTAOOCT GAGATAACGC CTGTCCAGrTG TCTTCACCAC CHGAATAACC AGOGCTCCAA 

AGrrGTmrc toctotgaaa aaaaaaaitc cacaagotpt taaagcstcca nrAAGAArc 

CATGTGACTT TAGAATOGAA CrGCXX3GCCX: TGGCAACICT CACGiaKSCT AGAAGCTTOG 
ATGCCTCTGG AATGCATGTG ATACTCATCT CCATmOTT TCOTTOAITC CATmTOIT 
CTPITAGCAG ATCTGTCCCT GTCGGTOGTC TCTAAGAACT CGGACACCTT GOTmTCTC 
WAGAITGAG CTOQGCAGCT GCAATCAGCr ICTITATATC CAAATTAGGC ACGACCCATC 
TCrCGOTTCaV GGTTGCTGGC TAATGAAGTG AGGGGAGGGA GGGATCTCAC CCCAAAA(?m 
GCXXXTCCCA TTGGCTTrGG CCAGGCCAGA CACTTCACAT CGnTTACATC C?rKTOlX?EA 
ATTTTAAACT TTATGTGTAT AAAGCGAAGC TGTITCTOre AAACTOTATA TTITCTAAAT 
AAATATATT6 CTACTTGAAA AAAAAAAAAA AAAAAA 
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(i) SBQUEMCE CHARACTERISTICS: 

(A) LENGIH: 2847 base pairs 

(B) TYPE: nvicleic acid 

(C) SIWVNnEENESS: double 
tD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

GC3CTAC3GACA ATTPrGGTCC TTTACCTATC TCTX3CAAAGA CTOGAGAAIT TOGCATACCA 

TTAATTACAA CCACCAATCA TATCCAACAA AAGTACCCTA AAAGAAGGAC CAffTOGCCAC 

TCTCGAAAAA ATITAAC?rAT CAGAAGATTA AAAAGATPIT AGGAITTCGA AGCTTOTATT 

CTTCriTCCCC AATAATCATT GTTTCATCTC CAAATAGTAG CCrTATAITA GCAATRGACA 

GATCATTGGT TCTCCATATC TGATCATATG TPACTACTrr GGAATCACTA TTTOGGCAAA 

TTCAAGCATT TATGCAGTGG ATATAAATGG AAATATAAAA ATATtTCCCA ACCTOTCTCA 

GTAACTTATC ATATCTCTCT GNATCCTCAA GGAAAGCACT TITCCTITTA CTTAGAAAGC 

GmrCAGATT TGCTTTATAG ACPOCTQCTG TCTTCACTAC CIGATAAAAC TTTAACCAGG 

GMGCAITAA ACACAGTGCA GCAGCTTTTG CCCAGGCITC TAACTTCCTC CCGGCAGCAT 

TTATCAATGT AAGAACTAGG ATGCTTOCTG CAGTCGCACT ACCITCCCCT AGAGCTOGAG 

CATGCTGCTT GGCCTTAAGC CCCAGCATGA TGAGGCTTCC CTCCTCCCAG CTCACTAAAA 

CTTAGAGAGC TCAGAATTGG GTCTTGOCTG GGTGCAGCTTC GCflGGOTITC CTCAAACCCC 

TAAAGAGAAG TCACCAAGGG AGGCAGGTAA TGAATCTTTC CAGAATCAGT CKGftTACTCA 

TAGCAATTTC TGGCTATCIT TCAAATCTTG AATTOTOSA TCCTCAGAGG GACTTI^ 

TQATATCATT AAATCCfiGGA CAGTCCCAAG AACTGCTTCG ACHClCGGCr CTCIACAGCCC 

AAGAAGGGAA ATAACTTGTA TTAAGGAACA ACTATGAGCC AGGCCCTCAG CTOTCTCTTA 

GATAATAAAA CAGATGGGGA GTGGAAGAGT CATTK3CTTC AAOTTATACA GCTAGGAAAT 

ACTCAAfiCCA AATCTTGAAC GCAGCTCCCC CTAATTCTOT GGACAGGCAC TTICTACCAC 

ACACCATQGT CCACCTAAAA ACAGAAGGAT AAAAAGACTT CAGGITITO: CACTCK?rcC 

TGACCATCCC AATTTATGAA TCTTCTTCAA AATGACATTT CACACTTATA CTTTAGGGCIC 

AGAAATGGCA TTGftGC?rAGC CTTAaTTCTC OCCTPTAGCA GATOCTTTAA CTACACATIG 

CTQACTTGAG CCCACCCCCA GGAGTTAGGA GAACATITCC TmTCATCC CATCTTCCAT 

AAATAAGGTG rTTCTPOGCC ITCAAAGATA TAGAACTTTG CAGCAGrA(?r AAAAGTCAAG 
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GtHOTPCTGC TCICTACTCA ACTTTATITG AAAATC5TCTG CSGCTTCACT CCTOEAGAAA 
AGGAAATCTT CATATTTrAG TAAACTTAGC asCCAGTICTA CTCTCTCAGG ATOTOCSCAAT 
'TCAAAGTCCA GTCAATCTOG CTCTCPTACT GATTOCTOGT 1TrACTX5TOr GKTTCGGGGG 
AGnCTGTACC TATATATAAA GCSACAAGICT GATATCTOTC TATATOEATA TACA-mCATA 
CATOTCCACA CACACACACA CAATATTTGA GAGCTAAGGA AAACTCAAAG CAGCCCCTTC 
ATTATCmSC GTACTTACTTC AAAGATITCT GTCAGCCCTA A1TACAACT3 TCACCAa-ATA 
GTTGX3QGCTT AGGTACTTGC TTACAGGAAG AGCAATTCCC TAGCAAAGCT CATTAGCTCC 
TAAGGCACTG AGTCAAAGTC ACAGCCCTGA AGGAAATTCC ACTCCAGCC!C TCCTOMGA 
TOTCTAATAA GATGQGAAAC TTGGATOCCX: ACSCCAnTTO GTCACCTCAG AGTCTAACTA 
CrcCAGTTAG ACCTAAGGC3C ACAAATOCAG AATTCATCAC CTOTAGrTO TCGCAGGCTTC 
TAGGAAGTCC TCTCTCCXXZA AGTAGAAAAT AITCTCTTOC GATTCCTCAA AITCCACAIT 
CATAXAATCG CICTGCAATA CATGCTTCTC AATAAGAAAA TTAACIGCAT GITIACTCTC 
TGCTGATCAC ATCAGATITT TATOTTTAAA AAAATCTCAT TATGCaJTTCA GTCCAGCCCA 
C3CTCTAAGAG AAAAAGAAC3G CXX:ATATGGG AGACTTCACT CrcATOATTA -ETCCCTTrAT 
CCAGCAGTCC TTATRAAGCC CCCTACC?CTG TCCCATTCCA GAAACCATAA GACTCACSCSCA 
CTrCTTGATr CTGGAGGCCT GCXnxXJTAAG ATAAGATAGT ATAATTK3GA ACTCAGAACA 
TACCAGAAAC AGCAGAACGA GOGCCAGAGC AGAAAAA3X3A AAATAACTTCG AGACACTTAT 
GGATACATTG GTGCAAAAAA AGCCACGGGS CCCATACTOG QCTTCATATC ACPITCAGGG 
GACAGCAGAT TAATACTTAA TGAGGGITAA ACCTGACCAG TCTITCTACA GTCACAGGCC 
ACACTGCATG AATOQGGAGA ACCAATGAAT CCATTGTCCT CTGCCTAnT TCCTOTCCAC 
AGTCACATTC CCTCCITAGG AATCITCCCC TrCCACCCTT TACATTAAAC AAGGGAACAC 
TGAATCTTTC AAGQGAATTA CAOGTTTGGG rrAATOnTC ACTATATCAT TTTCATACTC 
TAAATTATTT TGTAAGAGAG ATTEACTCCT ATCCCAGGAT CTTCGGACTT GtHGCCCCTC 
TC3CATITOGA AATCAATAAA CTATTACrGG AAATGCCAAA AAAAAAAAAA AAAAAAAAAN 
NAAAAAACTC GAGGC3GGGCC CGTACCC 
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(2) INFORMATION FOR SBQ ID NO: 101: 

(i) SBC^JEKCE CHARACTERISTICS: 

(A) LENGTH: 1394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDHMESS : double 

(D) TOPOLOGY: linear 
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(xi) SBQOaiCE DESCRIPTION: SEQ ID NO: 101: 

^ GAGATTQCSTC GAGGAGAGTA AATAATCTAG AGGCAAGAGT TCflGTCAGQG CXIAAGGGGGA 60 

CCCCCAGAAA AAGC3TATGGA GCTAACTCAT CTCTTrTACA AGGCSGTOGCC MtiACTEACT 120 

GITCCAAAGT ACTCAGTCTA TATTTAATGT TGATTCTTGA ATTTTAOTPA CGAGAGQGAA 180 

10 GAACAATTTT ACTTCTOTCC TTATTTCACT TGCTQAAAAG CTCTCC5C3ACA AAATCTATOG 240 

AATAGACAAG GCCACTTTCT TTGTGATTTC TGCTTTTCAT GCATATTATT TTATITACCX: 300 

ATAATTTCCA AGAGC?rTO3G CCTITCCGCTC TCCTGCmT TTCTITCATC CACCCCTTTC 360 

CTTTTTITOG AAGGCSOGrTTA TATATGAGAG TTCAITGAAG AAdtTAGTC AGGCTCAAGT 420 

AAAGGQGCAA GATAGGGCAG TTAACTAAAG AGCACTTTAT TTCTTTGAAG CCTPTCTAAG 480 

20 AAAGAAATGG GQC7P0CGACT GC3CTTGAATC TCCCATGA3G TTOGAGGGCA CrrACTX3GGG 540 

TTGAAGTATG ACATAATATT TCCCATTGGG GAAAGGAGAA TTTCTCrTAG AGC5(?K30CAA 600 

AATGCCTTTG CCCAGTOTCC CTATnTAGG CATCTTTTCC TTCCTTArrc CTTCCACTTCA 660 

GC3C3TGPGTCC TATACAAAAC TTCCCATCAG TICTCCTCAA TATTCCCCAT TTCTAAAIGA 720 

TCACTTCTCT TTTCTAAACC CTTrrCCTCT TCAGATCCAT ACAGGATITC CAAGQCTAGG 780 

30 ATCATACATG CAAATCCCCC TrGTTCATCT GTIGTCTTCTG CAAACTAGTC TCATCAAGAA 840 

TTCTGGOGrrG CAGCAGGGTA GCTGAAGrTTT QGGTCTQGGA CTOGAGATTC GCCATTAGGC 900 

OTCNCTGAGA TTCCAGCTCC CTTCCACCAA GCCCAGTCTT GCTACGTCGC ACAGGC3CAAA 960 

CCTGACTCCC TTTGGGCCTC AGTTTCCCCT (XCCTTCATG AAATCAAAAG AATACTACIT 1020 

TTPCTTGriTC GTCTAGCA3T GCTC3GACACA AAGTGTAGTC ATTAfTTCTTC TATTCGCnGA 1080 

40 TGTOTGCAAA ACTGCAGAAG CTGACTQCCT ATAAGAGGAA ATAAGAGAGA AAGIQGAQGA 1140 

GAGGGACAAA AGGAGTAATT ATTrGGTATA GATCCACCCA TOXAACCTT TCTTCTCCTCA 1200 

GTCCCTGCTC CTCATGTITC TGGTTTGGTG AGTCCTTTCT GCXACCACCC ATAATGCTTT 1260 

GCATTGCTGC ATCCTGGGAA GGGQGTATAT GGTCTCACAA GTTCITOTCA T TC TmTlT 1320 

GCATGCTTTC TTAATAAAAA AAAAAAAAAA ATOTTTANAG TITrAICrTA AAAAAAAAAA 1380 

50 AAAAAAAAAA hCCC ^^^^ 

55 (2) INFORMATICN FX5R SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IJa«?rH: 794 base pairs 
- (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLCX^: linear 
(xi) SEQUENCE DESCRIPTIC»J: SBQ ID NO: 102: 
5 G(31ROGAGC3C GGACTAAAGG GACTTGAGCG AGCCAGTTCC CGGATKOTC TATITCCCCT 60 
CCCTCTCTCC CQCCCCCTAT CTCTTTTCAC CCTTCPOCCA CCCTOGCTCG CCTACCAIGG 120 
CQGAQCGTCG GCGOCCACTC AGTCCCAITC CATCTCCTCG TCGTCCITCG GAGCCGAGCC 
<nXXXXXXrC C3CXX3GCGGro 

CAGCTCCTCC TGTGOC3GTC3C AOGA3CTaAT TITCTOGAGA GATOTCAAGA AGACTCGCTT 
15 TCTCTTTGGA CACGCTGATC ATGCTGCTIT CCCTCGCAGC TETCAGKnc A3CAKPCTGG 
GTTTCTTAMC TCATCCTGGC TCTTCTCTCT GTCACCATCA RCTTCAGGAT CTACAACTTCC 



CTTCATCCAAG CTOIWCAGAA RTCAGAARAA GGCCATCCAW TCCAAAQCCT ACCTOGACCT 
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2Q *^*w»a««iw4 otrtj-AiTJUAW TDCAAAGCCT ACCTGGACGrT 480 

AGACATTACT CTGTCCTCAG AAGCTTTCCA TAATTACATC AATOCTOCCA TOGTOCACAT 
CAACAGGGCC CTGAAACTCA TTATTCGTCT CTTTCTOGTA GAAGAaCTCG TTCACTCCTT 
25 GAAGCTQGCT GTCTTCATGT GGCPGATGAC CTATGTTQGT GCTarFTrrA ACGGAATCAC 660 
CCTTCTAATT CTTGCTGAAC TGCTCATTIT CAGTGTCCCG ATTOTCTATC AGAAOTACAA 720 
GACCCAGATT GATCACTATG TO3GCATCGC CCGAGATCAG ACCAAffTCAA TOTTTCAAAA 780 
GATCCCAAGC AAAA 

35 

(2) INFORMATICN FOR SBQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCrm-. 1544 base pairs 

(B) TYPE: niicleic acid 

(C) STRANDEDNESS: double 
<D) TOPGUXSY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

TTTGCTTGCT AGTCTGAACC AAAGAGTTGT TTOGGCATTT GCTOTGTTCG CCKPTPCTOG 60 

AGCAAGAGGG TCTTCITCCT CCITCCCCCA GCCAC3CCAGC TCTCCTOGGG CCAGGCnTC 120 

CTGOCyrOGAA AGAAGTATAC CTrTCCCTGG GGCCCTAGGA TAGCAAAGTC AGCCATAGK? 180 

C5GCCAGQCTG CCCTCCATQC TGOGCCCCAG CCCAGGTCTC CACTCGCCTO GATCACCTTC 240 

TTTGAGCCTT AOCCATCTCC TGTCAGGTAG GAATGAACTT GCCAGCCTTC AGGYTCCmC 300 

AGCTATGACC ATCTGTGCGG TCAGGGTACA CTCAGCTCTC CTCCCCAACT CCAGCAGCCT 360 

TTAAGAAGTG TCCCnTCGC GCCCCCTC5GA GGCAGAGCAC ItSAGCTCGAC CCTCGCTTAGA 420 

CTCOCACAGG GAGGACGGAG CTGGCCTCAG GAGTQGGACA CCCAGACITC GCAGGGCCTT 480 



60 



wo 98/39448 



PCT/US98A)4493 



326 



10 



15 



20 



25 



30 



35 



CAAfiflGGCCT GTGTOGGQGC CCCAGGAATC CTTAGCTGAA GCGGGGAGAC TCACTCTCCA 540 

TCTCAGGAAA TTCTAGOCCT TQCX3CTCAGG GAGCCACOC3T TGAGGGTGM; GCCCAACACC 600 

TGCXTTTAGGG CTCTOGOPQG GCAftSTCTGG GCCCTQGQC?r AQGGAGGGAG ACTCAGGCCC 660 

ACACITOGC5T ATTTTCTAAT TTCftGACAAA CACACACTCA GOGOtSCACTC ACTCATTCCT 720 

ACACATTGCC AAGATTTCAC ACATGTCACC AOGGGCCACr AAAGTCCCTC TCACCTTTCT 780 

GACTAGGATC CTAATTTCTC TATTTTCTCC TOGGTQCCTG GGTCTCTCTC ACCTOGGGCA 840 

GTCTGGATAA TGTITAGTPC TGTGACACTG TTTTTTGOGG GTCGCACCTC GTTCTCCGAT 900 

GOCTGGGCTG GTCTCAGGCC CAGGACTCTA C3TC3CTCGGAG CAGTAAAGCT CAGCTCTOIG 960 

TAATCaOTGA TGCTATQGCT TGCTCGTCTC TTATGATCCA ATCCTTTTCT ACATCAGCCC 1020 

TTGrmTGTT TTATQGCTAG TCTEATCTQG CCTOGTTATT TCCTTQOGGG GAGGAGAGGG 1080 

TTrcCTAATC TGCTCCCAGC CXAACCTATT ACCACCCCAC CTCGCTCGGA CCTACTOCTC 1140 

GGGAQGCAGC AGACAGGC3AG CCACCAGCAG TOGCTTCCTG GCCCTCTOCT GGGGCTTOGGG 1200 

GGAAGCTOGG GGCACATCTTG GCXXHTGCXT TCTGAGCAGC TCCCAGTOCC AGGGOTPTCA 1260 

GACTTTCCCA CATGATAAAA GAAAAGQGAG GTACAGAAGT TCCAATTCCC TnTTAmT 1320 - 

OCTPGGrrrGGT ATCTCTAAAT GTTTAATAAA TATCTGAGCA TGTATCTATC AACGCCAAGA 1380 

ATTTCAAAGT CTCCTTCAAC AATATGAGCXT TTTTAGGATG TTTATATTCC TTCATCCCIC 1440 

TTC3TTTCCCA GGrmTOCAG GGAAAAAAAG TCTGGAATTA TAGATACAGC TTATTATTAA 1500 

ATTTGrrrCTT GCATAAAAAA AAAAAAAAAA AACNCNJiGGG QGGG I544 
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(2) INFORMATICHJ FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LOJCTH: 871 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPnON: SEQ ID NO: 104: 
ACCCACGCGT CCGNCTTGrC CACCCGGGGG CGTCSGGAGrPG AGGTACCAGA TTCAGCCCAT 
TTQGOCCCGA CGCCTCTGTT CTCGGAATCC GGGrTGCTGCG GATTCAGGTC CCGCTrTCCTA 
AGGTGGGTCG CTGTCCACCC GGGQGCGTGG GAGTCAGGTA CCAGATICAG CCCATITCGC 
CCCGACGCCT CTGTTCTCGG AATCCGGGTG CTGOOGATTG AGCTTCCCGGT TCCTAAOGGA 
CTGCAAGATG GAGGAAGGCG GGAACCTAGG AGGCCTGATT AAGATGCTTCC ATCTACTCGT 
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(2) INFORMATION FOR SBQ ID NO: 105: 

(i) SBQUQJCE CHARACTERISTICS: 

(A) LE2*JGTH: 404 base pairs 

(B) TypB: nucleic acid 

(C) OTRANDECNESS: double 

(D) TOPOLOGY: linear 
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CTTCTCAGGT GCXHTOGGCA TC3CAAATGTC GGrreACCITC GTCTCAGGCT VICCICCm 
-rcCGAAGCCT TCCCCGACAT ACCITCCaGAC TACTGCflGAG CAAACICTIC CCCTTCTACT 
5 TCCACATCTC CATQC3GCTGT GCCTTCATCA ACCICTQCAT CTTOXTTTCA CAGCATCCTr 
GGGCPCAC5CT CACATTCTC3G GAGGCCAGCC AGCTTTACCT GCTXTTTCCTC AGCCTTACGC 
TGGCCftCTGT CAAOSCOCGC TGCXTOGAAC OCOGCAOCAC AGCTOCCATC TC5GGCCCTCC 600 
AAACCGTC3GG AGAAGGAGCG AGGCXTO3GT GC3GGAGGTAC CAGGCAGCCA ACAOmcOC 660 
GATCCTTAAC GCCAiMGCG AGAGAAGGAC CCCAAGTACA GTOCTCTCCG CCAGAATTTC 720 
1TCCGCTACC ATGQC3CTGTC CTCTCTTPGC AATCTC5GGCT GCGTCCTCAG CAATOGGCTC 
TGTCTCGCTG GCCITCCCCT GGAAATAAGG AGCCTCTAGC AJXSGGCCCTC CATOCTAATA 
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AATC3CTTCTT CAGAAAAAAA AAAAAAAAAA A o-i-i 

20 ^'^ 
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(xi) SEQUENCE EESGRIPTIOJ: SBQ ID NO: 105: 
GGCAOGAGTT ATAGCATOC5C ATPCATACTT TTGTTITATT C3CCTCATCAC TXTmGACT 
TTAGAACAAA ACAGTGCAAC CCTAGAGCCT TCTTCCCATC AAATITTOCA TCTOCTCCAA 
AACTCXTTTTG AGTTACTCAG AACTTCAACC TCCCAATGCA CTGAAOGCAT TCCOTOPCAA 180 
40 AGATACCAGA ATC3GGTTACA CAITTAACCT GCSCAAACATr GAAGAACICT TAAICTITO: 240 
TTTTTAATAA GAATGACGCC CCACTTTOGG GACTAAAATT GTOCTATTCC CGAGAACCAG 300 
. TCTAAAATTT ATnTITrAA AAAGAGAAAC TGCCCCATTA TTTTCCSTOCX; Gril^ l T r i T 360 
AA3TTNTAAT IfTGAAAAATT TTTTTGGCSGT TrTTCOGCSCC ATOG 404 



50 

(2) INFX>RMATION FOR SBQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LH«3TH: 1542 base pairs 
•^-^ (B) TYPE: nucleic acid 

(C) STRANDEENESS: doiible 

(D) TOPOLOGY: linear 
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(xi) SEC^JENCE DESCRIPTION: SBQ ID NO: 106: 
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GTC»GACAGG TGCSAGCCOX: GGOXAG^ 

GGCTCITCGA GAGGACauSflG GflGflGCOCTG GflOAGGAGAG GCTOGAGAGT CXTTTAGCCAG 

5 GArocaooCT GirorGAACT rorACCAAca ostgatgaag CACGCwaTC ccogga-tcca wo 

GQGCTACCCT CTGATOQOT CC0CCTO3CT AATGACCICC ATTCTCCTCA CCTACGIGEA 240 
CTrCGTTCTC TCACITOQGC CTCCSCATCAT QOCTAATCGG AAGCCCITCC AGCTCCGTOG 300 
CETCATGATT GTCTACAACT TCTCACTQGT QC5CACTCTCC CTCTACATK5 TCTATOAGTr 360 
CCTCAICTCG C3C3CTQC3CTGA GCAOCTATAC CTGQCQCTOT GftaXTOTOG ACTATTCCAA 
15 CAGCXXTGAG GCACTTAGGA TQCTTCGQGT GGCCTOCXTC TTCCICTrcT CXaAGITCAT 
•TOAGCKaTG CaCACAGTCA TCm-AlTCT 

ACATOICTTC CATCACTCTG TQCTTCCCTG GAQCTGGIQG TOQGGQCTAA AGATTCCCCr 600 

GGGAGGAATG GC3CTCTTrcC ATGCXaVTGAT AAACTCTTCC GTCCATOTCA TAAICTACCT 660 

GTACTAOGGA TTATCTOCCT TTGCMCCTOr GGCACAACCX: ■CACCTITCCT GGAAAAAGCA 720 

25 CATtSACAGCC ATTCAQCroA TCCAGnTGT CCTOGTHTOV CTOCACAICT CCCACTACTA 780 

crrrATCTCc agctotaact accajstaccc AGrcAn-A-n' cacctcatct ggatotatog 840 

CACCATCTTC 1TCATCCTGT TCTOCAACTT CTOCTATCAC TCTTATAOCA AGGGCAAGOG 900 
GCTCCCCrGT GCACTTCAGC AAAATGGAGC TCCAGGTAOT GCCAACGICA AGGCCAACTC 960 
«3AA0CATGG CCTAGATAGG CGCXXftCCTA ACTQCCTCAG GACTCCACCT TAGGGCACTG 
35 TOCGTCAGTG OCCTCTCCAC CTACACCTOT GACCftAGGCT TATOTOaTCA GGACTOAGCA 

C3GGGACTCGC CCTCOOCTOC CCACAOCTQC TCTACAGGGA CXaOMCTIT GGnCClCAC 1140 
^ CCACTKXXX: CQGQCAGCTC CAGGGATGTG GCCTCAITCC TCTCIGOCAC TCCAGAGCTC 1200 
GGQGCTAAAA GGGCICTACA GTTATTTCCC CCTCCCTGCC TTAAAACTIG GGAGAGGAGC 1260 
ACTCAGGGCT GGCCCXaCAA AQGGTCTOCT GGCCTTTITC CTCACACAGA AGAGCTCAGC 1320 
45 AATAATGTCA CIGTCGADCC AGTCTCACTC CKXaOCCCA CACACTCAAG CACTaGCTTC 
IGGGCCAAAG GTCAQGGrTGG GOGGQQGOCT GGGAATACAG CCTCTOGAGG CTOCITACTC 
^ AACTTCItJIC TTAA3TAAAA GTCACAGAGG AAAOCANAAA AAAAAAAAAA AAAAACTCGA 1500 
(XCGCGCCXG TACCCAAATC GCCQGTATGA TCGTAAACAA TC 1543 

55 

(2) INFORMATION FOR SEQ ID NO: 107: 
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(i) SBCOEMCE CHARACTERISTICS: 

(A) LHJC3TH: 2327 base pairs 

(B) TYPE: nucleic acid 
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(C) STOANEEEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE IIESCRIPTICN: SEQ ID NO: 107: 

GGTAGCTCAN TGCAGTGAAA TAGTCTTACT GGAAACAAAG CCCTTTAICA AGAATAAaTA 60 

ACTCTTCCCT TITCTTTTTG GAGaGGTGCT TTGTTTCTGA TCGGACCArr TCACTCCAGC 120 

AAC5CAACACA CTATTCTOAG CAGAAGATCG GCSACTTGAGG CCATOTIGCG GAQGGCCRCT 180 

RACATTATCT CX3ACTCTC3GA GTOIGAGGAA TAIGGACTCC ACTCITCACT ATATTCACAR 240 

CGATTCAGAC TIGAGCAACA ATAGCAGm 300 

ACAAGATGTT GTACCTCAGG CGTTGTTAGA TCAGTATTrA TCTATCACTC ACOCTTCTCG 360 

TGCACAGACG GTTGACACTG AAATTOCTAA GCACPGTGCA TATAGCCTCC CTOCTGTCGC 420 

CTTGACACTC GGAAGACAGA ATTGGCACTG CXTTCSAGAGAG ACCTATCRGA CTYTOGCCTC 480 

AGACATGCAG TGGAAAGnTC GADGGAACTC TAGCATTCTC CATCCACGRG CTTOCACTTA 540 

TTCTTGGAGA TCAATTGACA GCTCCAGATC TGCnTCCAAT TrTTAATCGA TmTAAAAG 600 

AOCTCGATGA AGTCAGGATA GGTCTTCrTA AACACTTGCA TCATTrTCTC AAGCTTCTIC 660 

ATATTGACAA AAGAAGAGAA TATCTTTATC AACTTCAGGA CTITITOSTG ACflGATAATA 720 

GTAGAAKITC GCGGTITCQA GCTCAACTGG CTC3AACAGCT GATTTTACTT CTAGAOTTAT 780 

ATAGTCCCAG AGATGTITAT GACTATTTAC GTCCCATTGC TCTGAATCTG 'TOTOCAGACA 840 

AAGnrcrPc TGrrroGTroG aottcctaca agttqgtcag cgagatogtc aagaagctcc 900 

AOSCGQCAAC ACCACCAACG TTCQGAfiTOG ACCTCATCAA TSAQCTTCTC GAGAACTITC 960 

GCAGATGTCC CAAGrrGGTCT GGTCGGCAAG CCTTTGTCTT TGTCTOCCfiG ACTCTCATTG 1020 

AGGATQACPG CXTTCCCATG GACCAGTITG CTGTGCATCT CATQCCGCAT CTCCTAACCT 1080 

TAGCAAATGA CAGGGITCCT AAOGTOOGAG TCCTOCITGC AAAGACATTA AGACAAACTC 1140 

TACTAGAAAA AGACTATTTC rrOQCCTCTG CCAGCTGCCA CCAGGAQGCT GTOGAGCAGA 1200 

CCATCAITOC TCTTCAGATC GftCCGTGACA GCGATGTCAA GrrATTTTGCA AGCAICCACC 1260 

CTC5CCAGTAC CAAAATCTCC GAAGATGCCA TGAGCACAGC GTrCCTCAACC TACTAGAAGG 1320 

CITCAATCTC GGrrcrCTTTC CTGCTTCCAT GAGAGCCGAG GTTCAGTOGG CMTCGOCAC 1380 

GCATGTCACC TGGGATAGCT TTCGGGGGAG GAGAGACCTT CCTCTCCTCC GGACTTCATT 1440 

GCAGGTOCAA GTTGCCTACA CCCAATACCA GGGATTTCAA GAGTCAAGAG AAACTACA(?r 1500 

AAACACTATT ATCTTATCTT GACTTTAAKG KKWAWKMMWW KTCAGMSRA rrATAMTTSW 1560 

CTOIMRARGEM WyMAAWSCTK SMSCTCYWCX: KSRSTCTMKG MMRCTCTAGA AYTRGYRGAK 1620 

CWyYYKSGCT KMftlGGAAKKS GGCASGAGCC AGAGACCTGC ATIGCTrrCT CCTOCTriTA 1680 
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rrTAflCAATC GACAAA3GAA ATTCTTACAG OCTCftAGtXA GAa?IX?roOC CAGATOIGAA 1740 

MaCACCTTC ACTMCflGCC CTAACTCITC TCTOCCflGGa AaaCTTOCT CXJGCICTOIG 1800 

GCCflGCTCTC CAGCrtaOCX: CTOTOTGTCA ATCGrTTOTC ACCTGTCCAA KTOCXSAAMG 1860 

«3GC3GTTTrT ACATCTCCTA AMGACCTC5A TGCXAACACA ACTAXSOTC ACTTAAACTC 1920 

TTAAGOCSCAG CA3MTOCTG TACACATTTA CACSAATOGTT GCTCflaTOTC TOKJTCTOAT 1980 

TrmCATCC TKJICMGftC CTGAAGC5AAA TTTAITAGAC CTATAA-roTA TOTCTCCTCT 2040 

TITrAACITC ATCAaxaVIta GCTCTGftBGT GCAACTTCTT CACATftClCT ACATACCICT 2100 

GACCACTCTT GGGAGTOCTG CAQTCTOrAA TCAatGCTGTT TAAACTCTTO TCGCftCAflCT 2160 

TCTCTTCTCC AAATAAAATT TATTAATAAG ATCTATAGAG AGAGATATAT ACACTTnGA 2220 

irorrrrcTA gatgtctacc aataaatgca atttctgacc tctaaaaaaa aaawaaaaaa 

ACTCGAGC30G GQCCOGGTAC CCAAATCGCC GATATGATCT AAMCATC 
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^ CCrcrrAAAGA GCAAATTTGA TTCTCTTGRC TITCACflCAT TGITCAAACA OSOCCAGAflG 720 
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(2) INFORMATICHJ FOR SBQ ID NO: 108: 

(i) SBQUQJCE CHARACTERISnCS: 

(A) LQJSIH: 1062 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDmMESS: double 

(D) TOPOLOGY: linear 

('ti) SEQUENCE DESCFaPnCW: SBQ ID NO: 108: 

GGCCGCCGAG GOGCAACAGC CGTTCTGTCA GCTCTOQGTC CAMXX5GACT AGCGAANATC 60 

^ OTOCTCATCC TCATCATCGT CTTCCTCATC CCXaTCTCGG TCCfiGtJTCCC TCTCCCCCCC 120 

ACACAAGAGG TQC3CGABGGT CCAGCTOTAG TTCCTOTGGA CXytTCTCGAA GATOCTCTTC 180 

CTCTTCTTCG TCATCATCTT CXTTCTTOGTC TTCCTCATCC TCMCATCCSV CTTTCTCGAAG 240 

45 CCXSCTCflCXSA ATCCCCSTCC CXXXX5CCGGA GRAACTGBCA GGAGQCQGCG CTACSGCTCT 300 

TATCGTTCAC ATGACCATTA CXAAAGGCAA AGAGTGCTAC AAAAOGfiGCG TCCAATAGAA 360 

GAAAGAAGGG TOGTCTTCAT TOGAAAGATA CXTOGCOGCA TGACIOSATC AGAQCTCAAA 420 

CAGAGGTICT CCGrnTTCG AGAGATTGAG GBGTQCACXa TCCACTICCG TOTCCAAGGG 480 

C3ACAACTA0G GCTTCOTCAC TTATCGCTAT GCTGAGGAGG CATTIGCAGC CATTGAGACT 540 
55 QQCCACAAGC TGCQGCAGGC AGATGAGCAG OOTTTGATC TCTOCTITCG GGQCXGAAGG 
SWSraCTGCA AGAGGAGCTA TTCTCATCTT GACTOCAACC QQGAAGACTT TCACCCAGCA 
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AACCPCAGGA GGTAACCTTG GGCOCTTCCC TGClA'TCCTr TITCTCCTIT GGAGC?rcCCC 780 
AACCTCCrcC ACCOCCTTCC CCTACTCTAG QC3C3AGAGAGC TQCTAGTCftG AIXlACTOITr 840 
5 TATAAAGAAA TQQAAAAAAG TGAAATAAAA AATAICTTCA MCAGATPIT TTAAAAGGGG 
TATTOmrr riTATAACAG CSTATTGAAAC AACnTAACTT GCMTCCTAT CJTAAGATAGG 



AGC3GGCTGAG GGGftlXXTCA C3TOTTOGAA CATAAGTCAC TATOCAGACT AATAAACATC 1020 



15 



20 



AACTAGAGAG NAAAAAAAAA AAAAAAAAAA ATTTAAAAAA CT 



(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCmi: 2539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 



^ (xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 109: 

GAGAGACTCA CACTTCrTTT CCATTATCAC TQACGA3CTA CJTOGACATAG CAGQC3GAAGA 
CXIACCTACCT GTCOTO^ 
30 AGGOTPOCTG OOTATGAAG OCGATC3CAGA AATTTTOGCT GTCAAAmC ACACTATCAT 
AACTGAGAAG TGGGGATTAA ATATGGAGTA OTOKXTTOGC CAGGCTTACA ITCWCTCTAG 
TGGATmCT TCCAAAATGA AAGTICTrGC TTCTAGACTT TYAAGMKMRA IWKCCCCMAK 
WAWCKGAAC AMAMKCTCSW CYTOCWSYGC SKTRRMKRYC GYKSTATRRC WARWKSAKYM 
CarcKKKTGS RRGTAWYTSK TOCAYKAGGG AACAATKSAG GAAGnTOTP CTimTCCA 
40 TOGATCACCA CAACT3CTIT TAGAACITCA CAACCTAA3T IdCTICTrr TTCAGAACAG 
TAAAGAAAGG GGTAAAGAAC TGAAC5GAAAT CroCCATTCT CAGTOGACAG GCAGGCATCA 
TGCTOTCAA AnTTAGTGG AACTCCTC3CA AGCACTTCTT TTATOPrTAG ATCCSrATAAA 
TAGTGACACA AATATTAGAT GGAATAACTA TATAGCTOGC CGAGCAriTC TACTCTCACT 
GCAGTGTCAG ATTTTGATTT CATTGITACT ATTCnTOm: TTAAAAASCT CCTATCTTIT 
50 ACAAGAGCCT TTOGGAAAAA CYYCMAGGGG CAAACCTCIG ATOKTnCTT TCCKKMMSRT 
ARMTITTGAY ATRMARYACT R^MrKSAyTY AAY?g?WOTGA CWSGAWAATA TTRAASTYTA 
TACAATKAAT YWTRRYTTSM KRMACMfAAT OCGAAAYTOT GGMAAMYAAA CITCATATTC 
AAAT6AAACT CCCTC3GGAAA TTCOGCAGAG C7PCACCAGGG TAACITCGAA TCTCAGCTAA 
CCTCTQAGAG TPACTATAAA GAAACCCTAA GTCTCXXAAC AffTOGAGCAC ATrATTCAGG 
60 AACTTAAAGA TATAOTCTCA GAACAGCACC TCAAAGCICT 



900 
960 



1062 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

1020 

1080 
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CCICAGTCAT GGGACAACIC AAA3TCAATA CXTPOCSGAGGA ACACCATOCT GACATGTATA 1140 

GAAGTGACTT ACXXAATCCT GACACGCTGT CAGCTGAGCT TCATTGTTGG AGAATCAAAT 1200 

5 

GGAAACACAG GQGGAAAGAT ATAGAQCTTC OGTOCACCAT CTATGAAQOC CTCCACCTOC 1260 

CTGACATCAA GTITTTTCCT AATGTGTATG CATTOCTGAA GGTCCTGTGT ATTCTTCCTG 1320 

10 TGMGAAGGT TQAGAATGAG CX3GTATGAAA ATGGACGAAA GCGTCTTAAA GCATATTTGA 1380 

GGAACACTTT GACAGAOCAA AQGTCAAGrTA ACTTOGCTTT GCTTAACATA AATTTTGATA 1440 

TAAAACACGA CCTGGATTTA ATGGTQGACA CATATATTAA ACTCTATACR AKTAMGTCAG 1500 

15 

MQCTYYCTAC AKAYRAXTCM SWASMCTQG AAARVWSSTA MSMSWGCWKK TAMMRKIMOG 1560 

GMWWTVYYMK RKTYGAXMyW YGCGWMOGAG AAAAAGCOST AAGGrPGTATG TAGACCACTT 1620 

20 AATCACTAAA TATCTTTQCX: TATAGGftCTC CATTGAATAC MTAGCCATT GATAATCTAC 1680 

CTGTTTAAAT GGCCXXTC3TT TGAACTCTCA AGCTTTGAAG ACXTEACXTTGr TCTTCCAGAA 1740 

GAGAAOCTTG AAACTGCCAT GTTrCCTTTT QCG?rGATCTC TGTTGATQGC ACTCTGGAAT 1800 

25 

TGrrrrccACTr ttaaktcatt ttagacatag catttattat cactgtqgat CTCTACTTGT i860 

TGGGTOTTAT GAATTCTTTG AAGAATATAT TTTGAAGAGG TGTQGGAGGA AGGAATACAT 1920 

30 TTTATAAAAT GTTGTACTrGA AGCCX:ACAAT TGACCTTKGA CTAATAGGAG TTTTAAGTAT 1980 

GTTAAAAATC TATACTQGAC AGTTACAAGA AATTACCGGA GAAAAGCTTG TGAGCTCACC 2040 

AAACAAGGAT TTCAGTGTAG ATnTGTCTT TCTTGAACTT AAAGAAACAA ATGACAAAGT 2100 

35 

TTGAATOGAA AAGCXTTGCTG TTGTTCCACA TCTCGTTGCT GrPTTACAITTC CTTTGTGGAG 2160 

CCTACATCTT CCTAAGCTTT TTAGCAGGTA TATGTTGAAC ACTTCTGTTT CATQC3TTGAG 2220 

40 ACAGAATCAG AGGCCATQGA TACTGACAAC TGATTTGTCT LrmTlTi ' IV 'lirXXJ l trr rrr 2280 

TCCATCACTC TTATATACTG CCTCATCTTG ATTTATAAGC AAAACCTGGA AAACCTACAA 2340 

AATAAGTGTT GTOGmTATC TAGAAAAATA TGGAAAATAT TGCrGTTAIT TTTGGTGAAG 2400 

45 

AAAATCAATT TTGTATAGTT TATTTCAATC TAAATAAAAT GTGAATITTG TTWWATTAAA 2460 

AATTWQGSAC AAABTBGHQG GQGETCCAAA CKrWVTCGHG KAAMTTCTCT WAARMATYTK 2520 

50 ATAAACMSCT TCACAATTC 2539 



55 (2) INFORMATION FOR SEQ ID ^K>: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LSJGTH: 1751 base pairs 
<B) TYPE: nucleic acid 
60 (C) STRANDEENESS : double 



i 
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55 



60 



(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 110: 

AGCATCSAAGC CGftTGGCCCT GGTOGCCAC3T ACCGTCCTOG GCCTOGTGCA AAACATQCCTr 60 

GOGTTTGGOG GGATCCTGGT GGTGGTCTAC TADGTATTTG OCATCATTGG GATCAACTTG 120 

TTTAGAGGCG TCATTGTC3GC TCTTCCTOGA AACAGCAGCC TGGCCCCTGC CAATGGCTCG 180 

GCQCCCTGTG GGAGCTTCGA GCAGCTGGAG TACTGGGCCA ACAACTTCGA TGACTTTGCG 240 

GCTGCCCTGG TCACTCTCTG GAACTTGATG GTOGTCAACA ACTOGCAGC3T GTTTCTQGAT 300 

GCATArrCGGC GCTACTCAGG CCCGTOGTCC AAGATCTATT TTOTATTGTO GTOQCTQGTG 360 

TCGrrcrcrcA tctgqgtcaa ccTGrrrrcTG gccctgmtc tggagaactt ccttcacaag 420 

TGGGACCCCC GCAGCCACCT GCAGCCOCTT GCTOGGACCC CAGAGGCCAC CTACCAGATG 480 

ACTGTOGAGC TCCTCTTCAG GGATATTCTG GAGGAGCCCG QGGAGGATGA GCTCACAGftG 540 

AGGCTGAGCC AC5CACCCGCA CCrGTOGCTG TGCAGGTGAC GTCOGGGCTG CCMCCCAGC 600 

AGGGGCGCXA GGAGAGAGAG GCTGGCCTAA CACAGGTGCC CATCATGGAA GftGQCGGCCA 660 

TC3CTGTGGCC AGCCAGGCAG GAAGAGACCT TTCCTCTGAC GGACCACTAA GCTOGGGACA 720 

GGAACCAAGT CCnTOCGTG TGGCCCAACA ACCATCTACA GAACAGCTCC TOGrGCTTCA 780 

GGGAGGCGCC GrTGCCCTCCG CTTTCTTTTA TAGCTGCTTC AGTGftGAATT CCCTCGrTCGA 840 

CTCCACAGGG AOCTTTCAGA CAAAAATQCA AGAAGCAGCG GCCTCCCCTG TCCCCTGCAG 900 

CTTOQGTGGT GCCTTTQCTG CCGGCAGCCC TTGGGGACCA CAGGCCTGAC CAGGGCCTOC 960 

ACAGGTTAAC CGrrGAGTCTG TCTCATCTAT TCACAGCTGG GAATGATACT AATACCTCOG 1020 

ATTTTAQCCC AGCACCACAG GGTAOyrrCC AGTTTTrCTC TCTTTCCAIA GCTGTAAGGC 1080 

CCTTTCTGGG AATQGTTCTC ATTCTCCTTA ATCTATTATT QQGTCAGTTT TCCTGCAICT 1140 

CCCCAGCCTC CCATCACTGC CACCCACTCC CCACAGAGAT GCCCTGCTCA TCCGACTGGG 1200 

GCTTTGACTC CCACACTGTG TACCCCTCTT GTGTOGftCGC OCTOCTQCCA AAACCTTCAG 1260 

CAAACAGCTT TCCAAATQGA AGTTGTCACT GTCAGGCCTT TACAATCAGC AACAGCAAAA 1320 

TCTACATGCT GCTGAGGGTC CTGCCTCATT AAGATGCAAT AAATATGTAA GTACATAAAA 1380 

ACAGCAATAG AAGAAACXTTA ATGCTTTATT CTCAAATATG ATGrTCTACAT AGAAAAGCCA 1440 

AAATTATTAA GAATAGTAAG AATTCACCCA GCACTTTGGG AGGCCGAGGC GGGTQGATCA 1500 

TGAGGTCAGG AGATCGAGAC CATCCTGGCT AACAGGGTTGA AACXXTCTCT CTACTAAAAA 1560 

TACAAAAAAT TGGCCGGC3CG CAGTOGOGGG CGCCTGTGGT CCCAGCTACT GGGGAGGCTG 1620 

AGGCAQGAGA ATGGCGrPGAA CCCQGGAAGC GGAGCTTCCA GTGAGCCGAG ATTGCGCCAC 1680 
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TGCACTCCGC AGTPCCAGCCT GQGCGACAGA GCGAGACTCC GTCTCAAAAA AAAAAAAAAA 1740 
AAAAAAAAAA A 1751 



(2) INFORMATION FOR SEQ ID UO: 111: 

10 (i) SEQUENCE CHARACTE31ISTICS : 

(A) LENOTTH: 1117 base pairs 

(B) TYPE: nucleic acid 

(C) ^TOANDECNESS: double 

(D) TOPOLOGY: linear 

15 

(xi) SBt^JENCE DESCRIPTION: SEQ ID NO: 111: 
AATGrrTGTGG TGGTAGCATT TGGGTTAATT CTRATTATAG AC3TCTCTTQG AGAGCAATGT 60 
20 CCATAAACTA ATCCXIAAACA ACATTGTCTT TTTRATGTTG TAGTGAACAG CAGflGAATTT 120 
CAAAGGACCT TGCTAATATC TGTAAGACGG CAGCTACAGC AGQCATCATT GGCTtSGGTCT 180 
ATGGQGGAAT ACCAGCTTTT ATPCATGCTA AACAACAATA CATTGAGCAG AGCCAGQCflG 240 

25 

AAATTTATCA TAAOCGGTTT GATGCTGTC3C AATCTGCACA TCGTQCTGCC ACACGAGGCT 300 
TCATTCGrTTA TGGCTGGCGC TGQGC7ITOGA GAACTGCAGT GTmCTGACT ATATTCAACA 360 
30 CAlTTGAACAC TAGTCTGAAT GTATACCX3AA ATAAAGATGC CTTAAGCCAT TTTGTAATTG 420 
CAGGAGCTGT CACGQGAACT CTTTrTAGGA TAAACGTAGG CCTC3CGTGGC CTGGTQQCTG 480 
GTGGCATAAT TGGAGCCTTG CTGGGCACTC CTGTAGGAGG CCTGCTGATG GCA1TTCAGA 540 

35 

AGTACTCTGG TGAGACTGTT CAGGAAAGAA AACAGAAGGA TOGAAAGGCA CTCCATGAGC 600 
TAAAACTGGA AGAGTQGAAA GGCAGACTAC AAGTTACTGA GCACCTCCCT GAGAAAATTG 660 
40 AAAGTAGTTT ACAGGAAGAT GAACCTGAGA ATGATGCTAA GAAAATTGAA GCACTGCTAA 720 
ACXTTTCCTAG AAACCCTTCA GTAATAGATA AACAAGACAA GGACTGAAAG TGCTCTGAAC 780 
TTGAAACTCA CTQGAGAGCT GAAGGGAGCT GCCATGTCCG ATGfiATGCCA ACAGACAGGC 840 

45 

CACTCTTTGG TCAGCCTQCT GACAAATTTA AGTGCTGGTA CCTGTGGTGG CAGTGGCTTG 900 
CTCTTGTCrr TTTCTTTTCT TTTTAACTAA GAATGGQGCT GTTGTACTCT CACTTTACTT 960 
50 ATCCTTAAAT ITAAATACAT ACTTATGnT GTATTAATCT ATCAATATAT GCATACATGA 1020 
ATATATCCAC CCACCTAGAT TTTAAGCAGT AAATAAAACA TTTCGCAAAA GATTAAAGTT 1080 
GAATTTTACA GTTAAAAAAA AAAAAAAAAA AAAAAAA 1117 



60 



(2) INFORMATION FOR SEQ ID NO: 112: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEEXgESS: double 
5 (D) TOPOI/X^: linear 

(xi) SBCyiQCB DE9CRIPTICN: SBQ ID NO: 112: 

GC3CAGA0GTr TTCTTATATT TTAAGTAAAT TTAAAGTGGC TATCAGAATA TTTATTCTTG 60 

TTTGAGACTA CCAACATAAC TAOGTGTTGA AGGTGCTTCA CAGAGAATAT ATTGCCTTTA 120 

ATCTTGAAATA ATTTTCACCA ATGTTGCTAA CTTTAATAAA GTATAAAATT TGTAGAATAT 180 

15 TCAGTTAAGT AGTTQGTAAC CCTTTTCTAT TTTAGTAAAA CTTAATGCAT GTTTACTTTT 240 

TTTTGAAAGA TQCAGACAAT CTCTCTTGAAC ATGAATTQGG QGCTCTCAAT ATOCXTOCAT 300 

TACTACGAAA AGAAGAAAGA GCAAGTCTTC TTAGTAATCT TGGCCCATGT TGTAAGQCGT 360 

20 

TGTGCTTCAG AOSGGATTCT GCAATTCGAA AGCAGCTTGT TAAAAATCAG AAGGGCACCA 420 

TAAAACAAGC TTACACGAGT GCTCCAATQG TAGACAATGA ATTACTTCGA TTGAGTCTTC 480 

25 GGfTTATTTAA GCGGAAGACT ACTTOCCATG CTCCAOGACA TGAAAAGACT GAAGATAATA 540 

AACTTTCACA GTCCAGTATC CAACAGGAAC TGTGTGTC3TC TTAAGACCGA AGTTACAATA 600 

TOGTATTTTT GGTACTGTCT TCCTTCAGCA GTGCATATTC TTTTGCAAAG TTCrrrOGTr 660 

30 

TGACAAGCAT TAGTGACAAA GQCAGAAAAG ATTTATCAGC CATOCTAAAA GAGTGAAGAA 720 

TTTTGATCTT TAGAGACACT AGTTTTC3GCC AACTTAAGAT TTTACGriTAA TTTTTACATA 780 

35 GTATTTGACA CTCATGCAAA ATAATGTGAA AACATCTAGA TTTAQTAGTr TATTCTGCGC 840 

CTTTTGTTAA AACTGAAGAT TTTQGAAAAT GGTTGTCACT GCTCTTOCAG CCTATGAATA 900 

TTTTTGTGAA ATQGAACCAT GGATTTATGT CTGGATCATC CATACAGAAC CAACAATTTT 960 

40 

ATTCAAAAAC AATGTGTTCA TCAAAGTAAT TGCTCACATT GTGCAGTACT ATGTTGTACA 1020 

GACCACGTGA AAGGGAATGC TQGTCTAGCT GGCGTGGTAT GTTTATAGGC GAATTTCAGC 1080 

45 AGAAGGAAGC CAAAATAGTT TTTTCCTTTT GAAAGTTTTT TAAAAATTAT TTCATGQGTC 1140 

TTTTTTITAA TTAATATGTG TGCATTGTTA CAATGTATGT TGGGATGTCT TTTGACCCTA 1200 

AATGCTTTTT TTGTTATCAG AGATTGTGTA CTATTTTTAT TTTTAATAAA TGTATCTTCC 1260 

CTTTIHAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 1313 



50 



55 



(2) INFORMATION FOR SBQ ID NO: 113: 



(i) SBQUQJCE CHARACTEEUSTICS : 

(A) LETJGTH: 1654 base pairs 
60 (B) TYPE: nucleic acid 
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(C) S1?<ANDECM£SS : double 

(D) TOPOIOGY: linear 

(xi) SBQUH«:E DESCRIPTION: SEQ ID NO: 113: 

5 

ACAGGGACAG AATACTTTCT TTCCTTOCTT CAAGTACAAG AAGQCTTTCT CTACCATTTG 60 

CXTTCTACACT TTATTTTAAA AGCTATCCTT TTCTAGTAGT ATTTTATCAT GGCAATGGCA 120 

10 TGATGaCAAC AACAGTCTTT CATEACflGAC TGAAGGGAAG CATGTCCTTA CTTAAAATAG 180 

TTCTGCTACT TTCCCTCCTA TTATAAGGAA ATTTTACAGA TTCTAAAAAT ACCTTAATTT 240 

TTCTTTGATT TTTATTTTAC CAAGTCACAA ATCTCTTTrT GATGTTTTGA GAATTGTTCT 300 

15 

CATAGAATCA CAAATACTGA CATTTCATTA GATGATTATT TTCCTAGAAT CCCCAAAGAG 360 

CAGTGGCAGT CXJ^TGGCTTG CTTTGAAGCTA GAAATTTTCC TQCCCCTOGT GACCTGCTAA 420 

20 GCCPCCTQCT CQGAACXX3TG TGAGTGGGTG AGQAAGATGA GAGATGGTCA GATGGAAGAG 480 

AGRAATACAT GAACTGCTCT GGCCTCTCTG &TTCTGTTCT TOGCCCAGAG TTTTTGAAAA 540 

GCAGCGGANA TNGACTGACT TCACATQCTC ABCTITCTCA QCCTTTTGTT TATTrTGTTG 600 

25 

TOCTTAGATT TCCCTGnTGT AAAAGGGOCA AGAAAAGTAA CTC3VTCATCT CTAACACACC 660 

ATGGCAGCTT AGCCAQGTAG TCTTAGTGGT GGTGTTTAGG CATAAGATAT GCTGATCATC 720 

30 AGTCTCAGGC CACAGTTTCC TTCACTAATC GTCCAGCTTG AGTCrrCPGT TCTCTTCCTG 780 

CCXATTTCCT TGAACCTCCT QCTCTAGCCT TC3GCGGAGGG AGAGTGCTAT TTGCTTTTGT 840 

TCTCCCTCTG TCTTAGGAAA AGCCATCTTT AATATAGTTC TTCACCACTG TTGGGGriTOr 900 

35 

TTTGTGATTT TTTTTTTCTT CXI3AAGAACT CCTGGTPGTT ATTGGATTTT GTATTTTAAT 960 

ACAAATTATT GAATTTTATA AGCTTGTACA CAATATTTAA TTAGTGTGAA AGGAAACAAA 1020 

40 GAATQCAGGA AAAATAATTT AATATCAACC TCAGTTGACA AGGTGCTCAG ATTATTCAAT 1080 

TCGGGATCCT CCTTTTGrTTA GGTTTTTGAG ACAACCCTAG ACCTAAACTG TGTCACAGAC 1140 

TTCTGAATGT TTAGQCAGTG CTAGTAATTT CX^XXSTAATG ATTCTCTTAT TftCTTTCCTA 1200 

45 

TTCTTTATTC CTCTTTCTTC TGAAGATTAA TGAAGTTGAA AATTGAGGTG GATAAATACA 1260 

AAAAQGTAGT GTGATAGTAT AAGTATCTAA GTGCAGATGA AAGTGTGTTA TATACATCCA 1320 

50 TTCAAAATTA TQCAAGTTAG TAATTACTCA GGGTTAACTA AATTACTTTA ATATGCTGTT 1380 

GAAYCTACTC TGTTCCTTGG CTAGAAAAAA TTATAAACAG GACTTTGTAG TTTGGGAAGC 1440 

CAAATTGATA ATATTCTATG TTCTAAAAGT TGGGCTATAC ATAAATTATT AAGAAATATG 1500 

55 

GATTTTTATT CCCAGGATAT GGTOTTCATT TTATGATATT ACX3CAGGATG ATGTATTGAG 1560 

TAAAATCAGT TTTGTAAATA TGTAAATATG TCATAAATAA ACAATGCTTT GACTTATTTC 1620 

60 CAAAAAAAAA AAAAAATAAA NTTOGAGGGG GGGC 1654 
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5 (2) INFORMATICS FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 1171 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDECNESS: double 

(D) TOVOUXnC: linear 

(xi) SEQUQICE DESCRIPTION: SEQ ID NO: 114: 

15 GGCAAACTTT CCCCCAAN(3C TTCGAAACTT GCAAGCCGAA ACCTTGAATC GTTAAAAGTr 60 

GGGTTGOGNC GGCGCCCTGG CCCGAAGAAG CGCAATTOGC GTTCCGCGAA CGTTGGCCCT 120 

CAACGGCTCG GCAGCCAGCC ATGTCCTGCA CCCAGGACAG CGGCCCTC3GG CTACAAGGAC 180 

20 

CTQGMCCTCA TCTTCCTGCG CCGACCTC3CG OQGGGTAAGG GGWAGTTTCA GACTCTGAAG 240 

GACGTCC3TQC TQGACTGCCT GTTQGACTTC TTACCCGAGG GQGTGAACAA AGAGAAGATC 300 

25 ACACCACTCA CGCTCAAGGA AGCTTATGTG CAGAAAATQG TTAAAGTGTG CAATGACTCT 360 

GACCGATGGA GTCTTATATC CCTGTCAAAC AACAGTGGCA AAAATGTGGA ACTGAAATTT 420 

GTGGATTCCC TCCGGAQGCA GTTTGAATTC AGTGTAGATT CTITTCAAAT CAAATTAGAC 480 

30 

TCTCTTCTGC TCTITTATGA ATGTTCAGAG AACCCAATGA CTGAGACATT TCAOCCCACA 540 

ATAATCGGGG AGAGCGTCTA TGGCGATTTC CAGGAAGCCT TTGATCACCT TTGTAACAAG 600 

35 ATCATTQCCA CCAOGAACCC AGAG6AAATC CGAGOOOGAG GCCTGCTTAA GTACTOCAAC 660 

CTCTTGGTGA GGGGCTTTAG GCCOGCCTCT GATGAAATCA AGACCCTTCA AAGGTATATG 720 

TGTTCCAGGT TTTTCATCGA CTTCTCAGAC ATTGGAGAGC AGCAGAGAAA ACTQGAGTCC 780 

40 

TATTPGCAGA ACCACTTTGT GGGAATTGGA AGACCGCAAG TATGaCTATC TCATGACCCT 840 

TCATGGAGTG GTAAATGAGA GCACAGTGTG CCTGATGGGA CATGAAAGAA GACAGACTTT 900 

45 AAACCTTATX:: ACCATGCTGG CTATCCGQGT GTTAGCTGAC CAAAATGTCA TTCCTAATOT 960 

GGCTAATC3TC ACTTGCTATT ACCAGCCAGC CCCCTATGTA GCAGATGCCA ACTTTAGCAA 1020 

TTACTACAOT GCACAGGTTC AGCCAGTATT CACGTGCCAG CAACAGACCT ACTCCACTTG 1080 

50 

GCTACCCTGC AATTAAGAAT CATTTAAAAA TGTCCTGTCG GGAAGCCATT TCAGACAAGA 1140 

CAGGAGAGAA AAAAAAAAAA AAAAAAAAAA A 1171 

55 



(2) INFORMATION FOR SEQ ID NO: 115: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH : 842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY; liiiear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GCTCTGCGCC GGAAC3TGCAT GflGCTGCCGA TGTGGTGCTT AGTGATTGCG (J1T 1 UJU 1 X,U 
CTCTCCOGTC TTTCCCGGGC TGQGTATTTG CCTOQCACCA TOQOOCCCAA GGGCAAflGTC 
GGCACGAGAG GGAAGAAGCA GATATTTGAA GAGAACAGAG AGACTCTGAA GTTCTAOCTC 
CGGATCATAC TGGOGGCCAA TQCCATOTAC TGCCTTGTGA aJl ' im;il, '' iT CTTTTACTCA 
TCTOCCTCAT TTTGGGCCTG GTTQQCCCTG GGCTTTAGTC TC3GCAGTGTA TGGQGCCAGC 
TACCACTCTA TGAGCTOGAT GGCACGAGCA GCGTTCTCTG AGGATGGGGC CCTGATGGAT 
GGrrOQCATGG AOCTCAACAT GGAGCAGGGC ATQGCAGAGC ACCTTAAGGA TGTGATCCTA 
CTGACAQCCA TCGTQCAGGT GCTCAGCTGC TTCTCTCTCT ATGTCTGGTC CTTCTGGCTT 
CTGQCTCCAG GCCGGGCCCT TTACCTCCTG TOGCrTGAATG TGCTGGGCCC CTQC?rTCACT 
GCAGACAGTG GCACCCCAGC ACCAGAGCAC AATGAGAAAC GGCAGCGCOG ACAGGAGCGG 
CQGCAQATGA AGCGGTTATA GOCATTGACA TTGTGGCCAC AGGCCACTOG CCCTQGG?PGG 
CTCTGTCAGG GTGCACAGCC CCTCATGCCT GGAGCAATGA GQC3TCTAGTC CAGGGQCCAA 
AAGCAGTCTG AOGTATTOGG TATACTTATA CTCTATAGQG TOGTTGAATA AATGGCTTAG 
AATGTGAAAA AAAAAAAAAA AAAAAACTCG AGGGGGGCCC GCTACCCAAT TTCNCCTANA 
AT 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SBQUQKZE CHARACTERISTICS: 

(A) LENC3TH: 1640 base pairs 

(B) TYPE: nucleic acid 

(C) STOANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE nESCRIPTION: SEQ ID NO: 116: 
GGCAOGAGGC GGOGGCAGCG GTGGCGQCGG OGCCCCCCGG OGGGAGCCGT TCCCTTTCCC 
GTCGGGGAGC GCGQGGYCGG GGCCCAGGGG ACCCCGGGCC ACGGAGAGOG GGAAGAGGAT 
GGATTGCCCG GCCCTCCCCC COOGATGGAA GAAGGAGGAA GTGATCOGAA AATCTGGGCT 
AAGTGCTGGC AAGAGCGATG TCTACTACTT CAGTCCAAGT GGTAAGAAGT TCAGAAGCAA 
GCCTCAGTTG GCAAGGTACC TGGGAAATAC TGTTGATCTC AGCAGTTTTG ACTTCAGAAC 
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TGGAAAGftTG ATGCCTAGTA AATTACAGAA GAACAAACAG ASACTGCGAA ACGATCCTCT 360 

CAATCAAAAT AAGGGTAAAC CAGACTTGAA ATACAACATT GCCAATEAGA CAAACAGCAT 420 

5 CAATTTTCAA ACAACXX3GTA ACCCAAAGTC ACAAATCATC CEACTAATAA AGTGAAATCA 480 

GACCCACAAC GAAOGAATGA ACAGCXACGT CAGCTTTTCT GGGAGAASAG GCTACAAGGA 540 

CTTTAGTOCA TCAGATGTAA CAGAACAAAT TATAAAAACC ATCGAACTAC CCAAAGCHCT 600 

10 

TCAAGGAGrrr GGTCCACTAG CAATGATGAG ACXXTTTTTAT CTGCTGTTQC CAGTQCTTTG 660 

CACACAAGCT CTGOCXCAAT CACAOSGCAA GTCTCCGCTC CTCTGGAAAA GAAOCTQCTG 720 

15 TTTGGCTTAA CACAaCTCAA CCCCTCTGCA AAGCTTTTAT TGTCACAGAT GAAGACTCAG 780 

GAAACAGAAG AOCGAGTACA OCAAGTACGC AAGAAATTOG AAGAAGCACT GATGGCAGAC 840 

ATCTTX3TCGC GAGCTGCTGA TACAGAAGAG ATGGAXATTG AAATQGACAG TQGAGATCAA 900 

20 

GCCTAAGAAT ATGATCAQGT AACTTTOGAC OGACTTTCCC CAAGAGAAAA TTCCTAGGAA 960 

ATTGAACAAA AATGrTTTCCA CTQQCTTTTG CXTGTAAGAA AAAAAATGTA CX:CGAGCACA 1020 

25 TAGAGCTTTT TAATAGCACT AACCAATQCX: TTrrTAGATG TATTTTTGAT GTATATATCT 1080 

ATTATTCAAA AAATCATGTT TATTTTGAGT CCTAGGACTT AAAATTAGTC TTTTGTAATA 1140 

TCAAGCAGGA CCCTAAGATG AAGCTGAGCT TTTGATOCCA GGOXXAATCT ACTQGAAATG 1200 

30 

TAGCACTTAC GTAAAACATT TGrTTTCCXXX: ACAGTTTTAA TAAGAACAGA TCAQGAATTC 1260 

TAAATAAATT TCCCAGITAA AGATTATTGT GACITCACTG TATATAAACA TATTTTTATA 1320 

35 CTTTATTGAA AQQGGACACC TGTACATTCT TCCATCGTCA CTGTAAAGAC AAATAAA3GA 1380 

TTATATTCXrA CAGAAAAAAA AAAAAAAAAW MWSTYGARRR GSRGCMCRSW AyHHARNifKX: 1440 

CCWMRTWRGS MKTCSnMTKA YTTACATTCA ACTCTGATCC OGGGGCCTTA GGTTTGACAT 1500 

40 

GGGAGGTGGG AGQAAGATAG CXSCATATATT TGCAGTATGA ACTATTGCCT CTQGGACGnT 1560 

GTQAGGAATT GTGCTTTCAC CAGAATTTCT AAGGATTTCT GGCTTAAAIA TCACCTAGCC 1620 

45 TGfrGCTAATT TTTTTTCCCT 1640 



50 (2) INFORMATIGN FOR SBQ ID NO: 117: 

(i> SEQUQJCE CHARACTERISTICS: 

(A) LENGTO: 952 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SBQUEKCE DESCRIPTICM: SBQ ID NO: 117: 
60 TGAATTTAfaj AAACACTITG GAAAACTCAT AACCTCATCA GAAflCTGCCT TTAGCCACAC 60 
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TCCTGACCTT CTAGATCAGT AACAAAAAAA TGAAATAAGT TCTTGGAAAT TAAGCXJOTT 120 

ATTTTAATIT GCTATTTTTT TCAATGTTCT AGGTATCTTT AAATTTGTTA ITGTGGAATC 180 

5 

ATTTTCCTGC CAGATADCTT TATCAAAATT ATTQGCCTCA TGAGAGCTGA AGTAAGTCAG 240 

CTTTTPQGTG AACTTTAGTG GACTTCTGTG AGATTGTAGT TGTACTTTGT ATCTCTAAAT 300 

10 CTAAAGATAG TTTTTTAAAA CTCCCAAAfiA AAATCTQCTC TCCTTTCTGA TCTAAAAACT 360 

CATCrnGGG GTAAAGAGrrr AAGTGTCCAA AGGTTGTCAC AGTTCATGAG GTCAGAGGGA 420 

GCTAGCCTGG CACCTGGACT CTGCCXATCC ACAGCTGACA GATTCCAACA GAAGTCTTATT 480 

15 

TAAATTCTCX: AGTAGACAAT GCTGGGTAAG GGAGQGGGTA GQGCTGGGrrT ATTAAGATAC 540 

AGGCTC3CTGT ATTTTACATT GGTICTCXXSG GAAGGGGAGC CTGGAGAAAA CAAAGTCACT 600 

20 ATTCCCTTTT TTGAAACAGG AAAAAAAATT ATTrTTTGTT CAGTAAAAAT QGTAGAGAAT 660 

TCCAATGrrcC CTAGCCACAA GGGACCAGTT CCACTGAGAA GTCAACAGrTG GGAACTCAAA 720 

ATTTCAGAAA CATTGGGGGA AGGGAAAATT GGCTTTCTCT TAATTGGCAG ATCrTTCCAGrr 780 

25 

GGQGSGGQGG GGCTCnCTTT TTGTTGGGAT GTGTTATCTT GTAaXSTACGC ATATATQGAC 840 

CGGACTCTGC TGAGTTTATA AGGTITCCAAA AATATGGTAA AATCTrGGTT TTTGTTAATT 900 

30 TATCnCAATA AAAGCCCACT GGRACTCCAA AAAAAAAAAA AAAAAAAAGA NN 952 



35 (2) INFORMATION FOR SBQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1256 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 118: 

45 GACGTCATAG GTAAACAGGC TCTGTATCCG TGGCAGCGGC CGTGGCAGGC TGGCTGGGTA 60 

CCGGCTGTCG CTGACCCAGG AGAAGCTGCC TGTCTACATC AGCCTGGGCT GCAGCGOGCT 120 

GCOGCCGCGG GGCCGGCAGC TGAACTATGT GCTCTPCAGG GCGGGCACCG TGTTGCATTC 180 

50 

ATCTTTGTAC CCCCAGCATC TAGCAGTGTT QGCATGTAGT AGGCACTCAA GAAATGTGTG 240 

TTGAATGAAC GATGCCTGTG ACAAGCAAGC GGACTTTATT CTTTCCTGAC CCTTGCTCCT 300 

55 ATGACACACC TCCTCCTGAC TGCCACTGTC ACTCCITCAG AGCAGAACTC CTCTAGGGAA 360 

CCTGGATQGG AAACAGCCAT GGCCAAGGAC ATCCTGGGTG AAGCAGGGCT ACACTTTGAT 420 

GAACTGAACA AGCTGAGGGT GTrOGACCCA GAGGTTACCC AGCAGACCAT AGAGCTGAAG 480 

60 
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GAAGAGTOCA AAGftCTTTGT 
ATTGAGCTTG TTGATCAACT 
5 GCTOGGAACT TOCTCAAATC 
OCCCTAATAG CAGAAAAGAA 
TCTAAAGTAG AAGCAGAACA 

10 

GAAAATTTCG CTTITATAGT 
ACCTCTGTAG CAJITCCAGOG 
GAATOCAOCC CCXTOAAGAC 
ACAGCTGCTG CTQGTOGCCC 
AGCAGTnCT AAACTTTCAT 
AATAAATTTT TGrTTTTCAAA 
TTTAACTCCT CATGAGAGAA 
CAGTGGTCAG CAGCTACCTT 

(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE INSCRIPTION: SEQ ID NO: 119: 

GGCCGTAGCA GCOGQGCTQG TOCTQCTGCG AGCCQGCGGC CCGGAGTOGG GCGGCQQCAT 60 

GTACCTTCCA CATTGAGTAT TCAGAAAGAA GTGATCTGAA CTCTGACCAT TCTTTATGGA 120 

TACATTAAGT CAAATATAAG AGTCTGACTA CTTGACACAC TGGCTOGAGC AAACATGAAC 180 

GTTGGAGTTG CCCACAGTGA AGTGAATCCA AATACCOGTG TCATGAACAG OCGGQGTATG 240 

TGGCTGACAT ATGCATTQGG AGTrOGCITG CTTCATATTG TCTTACTCAG CATTCCCTTC 300 

TTCAGTGTTC CTGTTGCTTG GACTTTAACA AATATTATAC ATAATCTGGG GATGTACGTA 360 

TTTTTGCATG CAGTGAAAGG AACACCTITC GAAACTOCTG ACCAGGGTAA AGCAAGGCTC 420 

CTAACTCATT GGGAACAACT GGACTATQGA GTACACTTTA CATCTTCACG GAAGrTTTITC 480 

ACAATTTCTC CAATAATTCT ATATTTTCTG GCAAGTTTCT ATAOGAAGTA TGATCCAACT 540. 

CACTTCATCC TAAACACAGC TTCTCTCCTG AGTGTACTAA TTCCCAAAAT GCCACAACTA 600 

CATQGTGTTC GGATCTTTGG AATTAATAAG TATTGAAATG TTTTGAAACT GAAAAAAAAT 660 



GGACAAAATT GGCCAGTTTC AGAAAATAGT TGGTGGTTTA 540 

TGCAAAAGAA GCAGAAAATG AAAAGATGAA GGCCATOGC?r 600 

TATAGCAAAG CAGAGAGAAG CTCAACAGCA OCAACTTCAA 660 

AATQCAGCTA GAAftSGTATC GGGTTGAATA TQAAGCTTTG 720 

AAATGAATTT ATTGACCAAT TTATTTTTCA GAAATGAACT 780 

AGGAAGOCAA AACAAAAAAA AGCCTCTCAA AACCAAAAAA 840 

GCTTGACCAA TGACCTATGT CACAAGAGGT GGCGTGTAAG 900 

AGCACTACAA GTCTQGGGGA GCCAGTTTEA ACATCAGTGC 960 

TC3CAGTGTAC GTTCTCACCT CITATGCTTA GTTGGAACTA 1020 

CCnTTTTTT GTAAATTCAC AAAGCTTTGG AAGGAGAAOC 1080 

TGGCITGATG TACCTTTTTT CCTGTTGCTC TTGAAATATG 1140 

CCCTGGATTC TCTATCCCCT ACTCCACAAA ACAAACCAGG 1200 

•mATTTGGAT CACACACGrTG AGTCAGACAG TACCAC 1256 
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TTTACAGCTA CTCAATTTCT TATAAGGAAG GACrPOGnTAG TAAACTGCAC WmtTi y iU 720 

ATAATCTGAA ATGAGAAGTA TrrACATTGG AGGQCCAATG GCTGGTCCTT CAftGrTGCTGrT 780 

5 

TTTGAAGTGC AGATTTOCAT TAAflTGATGC CTCTGTTTAA TACACCTGGT ACATTTCTGA 840 

AGAGGGGCTT TATAAGCflGG CTQGGCAGGC CCAGCTTATA AGTTAAAGQG CATCACAGTC 900 

10 AOOGrrGTAGT AGAXAAATTC AAQGAAATAA GAGATTTGrrA AGAAACTAGG ACCAGCTTAA 960 

CTTATAATGA ATGQGCATTG TGTTAAGAAA AGAACATTTC CAGTCATTCA GCTGrPGGTTA 1020 

TTTAAAGCAG ACTTACATGT AAACCGGAAT OCTCTCTATA CAAGTTTATT AAAGATTATT 1080 

15 

TTTATTACCX? TAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAANA 1140 

GAN 1143 

20 



(2) INPOHMATION FOR SEQ ID NO: 120: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 1782 base pairs 
<B) TYPE: nucleic acid 

(C) S17UVNDGDNESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRXPTIC»« : SBQ ID NO: 120: 
CAGGCCCCGG CCCCCCACCC ACGTCTGCGT TGCTGCOCCG CCTGQGCCRG GCCCCAAAGG 60 
35 CAAGGACAAA GCAGCTGTCA GGGAACCTCC GCCGGAGTCG AATTTADGTG CAGCTGCCGG 120 
CAACCACAGG TTCCAAGATG GTTTOCGGGG GCTTCQCGTG TTCCAAGAAC TGCCTGTOCG 180 
CCCTCAACCT GCTTTACACC TTGGTTAGTC TGCTGCTAAT TQGAATTGCT GOGTGGGQCA 240 

40 

TTGGCTTCQG GCTGATTTCC ACJPCTCCGAG TGGTCGGOGT GGTCATTGCA GTGGGCATCT 300 
TCnCTTCCT GATTQCTTTA GTQGGTCTGA TTGGAGCTGT AAAACATCAT CAGGTOrTGC 360 
45 TATTYTrrTA TATGATTATT CTGnTACTTG TATTTAOTOr TCAGTTTTCT GTATCTTGOG 420 
CTTGTTTAGC CCTGAACCAG GAGCAACAGG GTCAGCTTCT GGAGGTTGGT TQGAACAATA 480 
CGGCAACTGC TCGAAATGAC ATCCAGAGAA ATCTAAACTG CTGTGGGTTC OGAAGTGTTA 540 

50 

ACCCAAATGA CACCrGTCTG GCTAQCTGTG TTAAAACrTGA CCACTCGTGC TCGCCATCrTG 600 
CTCCAATCAT AGQAGAATAT GCTGGAGAGG TTTTGAGATT TGTTQGTGGC ATTQGCCTGrT 660 
55 TCrrCAGTTT TACAGAGATC CTQGGTGITT GGCTGACCTA CAGATACAGG AACCAGAAAG 720 
ACCCOOGCGC RAATCCTAGT GCATTCCTTT GAT6AGAAAA CAAGGAAGAT TTCCTTrCGT 780 
ATTATGATCT TGTTCACTTT CTGTAATrTT CTCnTAAGCT CCATTTGCCA CTTTAAGGAA 840 

60 
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GGAAACACTA TCTQGAAAAG TACCTTATTG ATAGTGGAAT TATATATTTT TACTCTATGT 
TTCTCTACAT GTTTTTTTCT TTCOGTTGCT GAAAAATATT TGAAACTTGT GGTCTCTGAA 

5 GCTOGGTOGC ACCTQGGAAT TTACTCTATT CATTC3TCGGG CACTGTCCAC TGTOGCCTTr 

/ 

CTTAGCATTT TTACCTQCAG AAAAACTTTG TATGOTACCA ClViVriWi ' TATATGGTGA 

ATCTGAACCT ACATCTCACT GGTATAAITA TATGTAGCAC TGTGCTGTGT AGATAGTTCC 

TACTGGAAAA ASAGrPGGRAA TTTATTAAAA TCAGAAflGTA TGAGATCCTG TTATGTTAAG 

GGAAATOCAA ATTCCCAATT TTTTTTGGTC TTTTTAGGAA AGATGTGTTG TGGTAAAAAG 

TGTTAGTATA AAAATGATAA TTWACTKGTA GTCTTTTATG ATNACACCAA TGTATTCTAG 

AAATAGTTAT GYCYTAGGAA ATTOrGGTTT AATTTTTGAC mTACAGGT AftGTGCAAflG 

GAGAAGriGGT TTCATGAAAT GTTCTAATGrT ATAATAACAT TTACCTTCAG OCTCCATCAG 

AATQGAACGA GnTTTGAGTA APCAGGAAGT ATATCTATAT GATCTTGATA TTGrTTTTATA 

ATAATTTGAA GTCTAAAAGA CTGCATTTTT AAACAAGTTA GTATTAATGC GTTGGCOCAC 

GTAGCAAAAA GATATITGAT TATCTTAAAA ATTCTTAAAT ACCGmTCA TGAAAGTTCT 

CAGTATTOTA ACAQCAACTT GTYAAACCTA AGCATATTTG AATATGATCT CCCATAATTT 

GAAATTGAAA TOGrPATTGTC TOGCTCTGTA TATTCTC5TTA AAAAATTAAA GGACAGAAAC 

CTTTCTTTGT GTTATCCATGT TTGAATTAAA AGAAAGTAAT GG 

(2) INFORMATION FOR SBQ ID NO: 121: 

(i) SBQOQJCE CHARACTERISTICS: 

(A) LQlGni: 610 base pairs 

(B) TYPE; nucleic acid 

(C) SmANDEEMESS: double 

(D) TOPOLOGY: linear 

(Xi) SBCJUQSCE DESCRIPnOM: SEQ ID NO: 121: 
GTTQGCTQCA GATTTGTGGT GCGTTCTGAG CCGTCTGTCC TGCGCCAAGA TGCTTCAAAG 
TATTATTAAA AACATATQGA TCCCCATGAA GCCCTACTAC ACCAAAGTTT ACCAGGAGAT 
TTQGATAGGA ATQGGGCTGA TGGGCTTCAT CGTTTATAAA ATCCGGGCTG CTGATAAAAG 
AAGTAAGGCT TTGAAAGCTT CAGCX3CCTGC TCCTGGTrCAT CACAACCAGA TTTACTTQGA 
GTACATGTGA AAGAAAACGT CAGTCTGCCP GrTAAATTTCA GCAAGCCGTG TTAGATQGGG 
AGCGTGGAAC GTCACTGTAC ACTTGTATAA CTACCGTTTA CTTCATGGCA TGAATAAATG 
GATCICTGAG ATQCACTGCT ACCTGGTACT GCTTTCAGTC TGTrCCCCCT CAGCCCTCCG 
GCGTGTCAGG CATACTCTGA GTAGATAATT TGTCATGCAG OGCATQCAAT CAGAATCTCA 



wo 98/39448 



PCT/USWD4493 



344 



CTGAQCCACC CATCATTGTG AAATAATTAC CTCAGTTGTA CAGGACTTGG TGATCAGGAT 



540 



CCAGGCACTC ACTTGTATTC TACTGCTCAA TAAACGTTTA TTAAACTTGA AAAAAAAAAA 



600 



AAAAAAAAAA 



610 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(2) INPORMATIQN FOR SEQ ID NO: 122: 

(i) SBQUEMCE OffiRACTERISTICS : 

(A) LQIGrrH: 526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECBfESS: double 

(D) TOPQUXY: linear 

(xi) SEQUENCE DESOaPnON: SEQ ID VtO: 122: 

GGTACGCCTG CAGGTACCGG TCCGGAATTC CGGCTTCGCCC ACGCGTCNGG CCACGCGTCC 60 

ACCCACGOGT CXXSSCCACGC GTCGGAGCCX; AGCCGGACTG GTCAGGATGA TCAOSGAOGT 120 

GCAGCTCGCC ATCTTCXXXIA ACATGCTOGG CgrGTOGCTC ITCl'lt^CTlXj TCGTTCTCTA 180 

TCACTACC3TG GCCGTCAACA ATCCCAAGAA GCAC3C3AATGA AAGTOGCGCT OTCTCCGCCC 240 

CflGGGTTCCA GGACATAGTC TGAGGCAAGA TGGAGGGTAT GAGGGGCCTT CACACTTCAC 300 

TTCATCOCTT CTACCCATCA CAACATACAA AGCAACTACA CCTGGATTTT TCCAAACAAC 360 

TTTTATTTCC TCAGAGTCTT CCTTAATCCT ATGQAACAAG AACXTTGCCAC TGAATAGGGC 420 

CCAiSTATAGG GGCTTGCTTT TCTACTCCCT CXXXTCAATA TAAAAATATA GACTXTTTAA 480 

AAAAAAAAAA AAAAANTTOG NGGGGGGSCC GGTACCCATC CCCCTA 526 

(2) INFQRBlAriON FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 2081 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

TGTACCGGTC CGGAAATTCC OGt3C3TCGACC CACGTOGTCS GGGGAACATG QCGGCTKCGG 60 

AGCCGGCGCT CCITOCGCTC CCCAACAGCG QCGCCGGGGG CGCGGGGGCG CCGTCGGGCA 120 

CAGTCCCGGT GCrCTTCTGT TTCTCAGTCT TCGCGCGACC CTCGTCGGrTG CCACACGGQG 180 

CGGGCTACGA GCTGCTCATC CAGAAGTTCC TCAGCCTGTA CGC30GA0CAG ATCGACATGC 240 

ACCGCAAATT CGTGGrrQCAG CTGTTOGCOG AGGACTOGGG CCAGrTAOGTG GACTPQOCCA 300 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



AQGGCTTCGC GGTRAGOGAG CGCTGCAAGG TCCGCCTCGT GCCX3YTOCAG ATCCAGCTCA 360 

CTACOCTGGG AAATCTTACA OCTTCAAGCA CiUl t yiTl T r CTGCTGrTGAT ATOCAGGAAA 420 

GGTTCAGACC AGCCATCAAG TATTTTGOGG ATATTMTAG CGTQGGACAG AGATTGTTGC 480 

AAGQGGCECG GATTTTAGGA ATTCCTGrTTA TTGTAACAGA ACAATACXCT AAAGGTCTTG 540 

OGAGCAOOGT TCAAGAAATT GATTTAACAG GTGTAAAACT OGTACTTCXA AAGACCAAGTr 600 

TTTCAATGGT ATTACCAGAA GTAGAAGOQG CATTAGCAGA GATTCCOGGA GTCAGGAGTG 660 

TTGTATTATT TGCaGTAGAA ACTCATGTCT GCATCCAACA AACTGCCCTG GAGCTAGTTG 720 

GCCGAGGAGT CXS^GGmCAC ATTGTTGCTG ATQCCACCTC ATCAAGAAGC ATOATGGACA 780 

GGATGTTTGC CCTCGAGCGT CTCGCTCRAR CCGGGATCAT AGTOACCACG AGTX3AGGCTG 840 

TTCTGCTTCA GCTGGTAGCT GATAAGGACC ATCCAAAATT CAAGGAAATT CAfiAATCTAA 900 

TTAAGGCXaG TGGTCCAGAG TCGGGTCTGC TTTCCAAAGT ATAGGACATT TGAAGAACTG 960 

GTATGCTACT CACTGGTGAA GGACAGTCAG GTGAAGGACT GTAAGCCCAC ACAAGCTCTT 1020 

CTTATCTCTA CTAGAATTAA AATGriTAAGT CAAAAACGGC ' IWriTi'A ' llr CGCCTCCTAG 1080 

TGAAACTTAA CCAGCTAGAC CATTTGAGTA CCAGCATTTA GTTACAAACG TCAAAGGCTT 1140 

CCGGTOCTGC TTACCTTCCT TrrTTGTTAA TGTGCrTTTA TTTATTAAAA AAAATTACAA 1200 

TGAAGATGCX: TGTTTTGTCT CTACTOTGTA CTCTGATCGrT ATCTTTCCAA AGTGCAGACT 1260 

CTTGTGAAGrr TTTCTTAAAT TGTTCACTTT AAAGAAAATG AC3GTACCAAC AATGATTTGG 1320 

CTTTTAIATT ACTGTAAGAT GTTATAATGT TAAaXSTOGAT GTAGrTGCTTT TACTTTACAG 1380 

AITCATTGGA ATAAGATTAT TGCATATGAA TTTACCCACA GGACTCTGAA TCATGTTACC 1440 

CACTCXXrrC ACAATGTTGT CCACTTAGTG AGTTGCATTG ATCTATCCGT ACCAAATGAT 1500 

GTTGAATAAT TACATATCTT TCPTGACTAT ACTGATTTCT TATTTTGGTC ACTATTACEA 1560 

AATCTCPGTT AATATTCTCT CTTTTAACTG AAAAGGGATG GGATAGAAGG GrrTTGCAATG 1620 

CXATATTATT GGTQGAGGGC TGrPTTTAACA TCTTTGAAGT ATGGCTTGCT GAATATCTTT 1680 

AOCAACATCT TGAATATATA TTCTAGTGTC CACAAGATTT AGCAAAAAGA TAAAGCTTGG 1740 

GTOGAATATC ATTTTAAAAT GTTCATGrTTC TGrrTCTATAT TTTCTTCACC TACTCTCCAA 1800 

ATATTCrTAAT GCAAAAAGTC TCAGTAAITGA TTTOGTAGTA TTAATTTTGT GGTCATICTT 1860 

TCTCTTCGAT AAATTTATTT TCATTAAATA CTTRTTAGAG GGmTTGAAA TGTmTCAA 1920 

ATATGTTGAAA TOTGAAACTG CTGTCTTTTA TATTAAAGTA ATTAAAGAAA ATGTATTGTG 1980 

ATTGAAATTA TTTTCaKXrrC CACAAQATGG CTCTATGAGT ATTCTTCCAG GGATTCTAAT 2040 

AJTrTATTTAA GGmiATAAAA TCTTGACATT TATAATCTTT C 2081 
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5 <2) INFTOHMATION FOR SBQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQK5TH: 1717 base pairs 

(B) TYPE: nucleic acid 
10 (C) SmMUSEOHESSi doiible 

<D) TOPOUX^: linear 

<xi) SBQOENCE DESCRIPTION: SBQ ID NO: 124: 

15 CCCCGGOGGA GCTQGACCCG CGGrrGGQCTA GGGQCAGGGC CGGAGCOSCG GCGGCGGAGC 60 

TGTGGATCCT TCftTGATGAG AGATTTQGGG ACACTTCTCT CTCCTGrrGTG TAGTTGATAG 120 

TTTGGTGGTG AAGAGATOGC TGACAGTGTC AAAACCTTTC TCCAGGACCT TGCCAGAGGA 180 

20 

ATCAAAGACT CCATCTQGGG TATTTGTACC ATCTCAAAGC TAGATC3CTCG AATCCAGCAA 240 

AAGAGAGAOG AOCAOCCTTCG AAGAAOGGCA AGTAGTGTCT TOGCACAGAG AAGAOCOCAG 300 

25 AGTATAGAGC GGAAGCAAGA GAGTGAGCCA OGTATTGrTTA CTAGAATTTT CCAGTGTTGT 360 

GCTTGGAATG GTGGAGTCTT CTQGTTCACr CTCCTCTTGT TTTATOGACT ATTTATTCCT 420 

GTGCTTCAGT CGGTAACAGC CCGAATTATC GGTGACCCAT CACTACATGG AGATGTTTGG 480 

30 

TCGTOGCTOG AATTCTTCCT CACGTCAATT TTCAGrrOCTC TTTQGGTGCT CCCCTTGTTT 540 

GTOCTTAGCA AAGTCSGTGAA TGCCATTTGG TTTCAGGATA TAGCTOACCT GGCATTTGAG 600 

35 GTATCAGGGA GGAAGCCTCA CCCATTCCCT AGTGTCAGCA AAATAATTGC TGACATQCTC 660 

TTCAACCTTT TGCTQCAQGC TCTTTTCCTC ATTCAQGCaWV TGrPTTOPGAG TCTCTTTCCC 720 

ATCCATCTTG TCGGTCAGCT GCTTAGTCTC CTGCATATGr CCCTTCTCTA CTCACTGTAC 780 

40 

TQCTTTGAAT ATCGTTGGnT CAATAAAGGA AITCAAATGC ACCAGCGCTTT GTrCTAACATA 840 

GAAAGGAATT GGCCTTACTA CTrTGGGTTT GGTTPGCCCT TOGCTTTTCT CACAGCAATG 900 

45 CAGTCCTCAT ATATTATCAG TGGCTGCCTT TTCTCTATCC TCTTTCCTTT ATTCATTATC 960 

AGCGCCAATG AAGCAAAGAC CCCTQGCAAA GCRTATCTCT TCCAGTTQOG CCTCTTCTCC 1020 

TTGGTCSGrPCT TCTTAAGCAA CAGACTCTTC CACAAGACAG TCTACCTGCA GTCGQCCCTG 1080 

50 

AGCAGCTCTA CTTCTQCAGA GAAGrTTCCCT TCACCGCATC CGTOGCCTGC CAAACTGAAG 1140 

GCTACTGCAG GTCACTGAGT TGCCTGCCAT OCAAAGQGGA TGQGCGGGAT TGGAAGAAGC 1200 

55 TGTPGGCAGCT CTTTTCCCTG TTCACCTCCC GCCTC3CCAGG GAAGGCAGGA CCCGCTCTGC 1260 

CAAGGGCCCT CTGCGTATTC CCTTCTCTCT GAGGAATTGA AATTTTTGrC TCTGCrTGCAC 1320 

CTAAGGCAGA ATGnTCCCTG ACACCAGTGT GrrGGATTTTT AACATCACCG TGAGTCTGAA 1380 

60 
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AGGACCACAG GmTTCTOC AGCTATTTrC TAGCATTTGC CAGTCCCTOr GCCTGGACTG 1440 

ATTGGAACAC TTTGTTTTTC TOCCTGTGCC ATTTACOCTT CCAjCXTTTCC ATCCTGOCTT 1500 

5 CTACCACCCT TGGATOAATO GftlYl'lVrAA •rrCTAGCTGT TCTATTTTGT GAATTTGTTA 1560 

ATrTTGTTGT TTTTCTOrGA AACACATACA TTGGATATOG GAGGrTAAAQG AGTGTCCCfiG 1620 

TTGCTCCTGG TCACTOXTT TATAGCCATT ACTGTCTTGT TTCTTGTAAC TCAGGTTftGG 1680 

10 

TTrrGGTCTC TCTTCCrcCA CTGCAAAAAA AAAAAAA 1717 



15 

(2) INFOKMATIGN FOR SEQ ID NO: 125: 

(i) SBQUQICB CHARACTERISTICS: 

(A) LBKTTH: 804 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDECMESS : double 

(D) TOPOUXaf: linear 

(xi) SEQUHJCE E9SSCRIPTI0N: SEQ ID NO: 125: 

25 

CCACGCGTCC GC3TCACTATG TAGTGGAGGG QCAGACACCC TCCOXAAAT TCTOGAAGGTr 60 

TCTTAGTCTC GACTAGGGCA GTAGCCCCAG GACTCCTAGT CGCCGGCTTC AGGTCACTQC 120 

30 CQQCTGAACG GAGCTGOCGT CGCCATGTTT GCXTTQCTTCG TC3GOGGGGAG OCTQGTGCAA 180 

ACAGCTGCAC AGCAAGTGGC AGAGGATAAA TTTGnTTTTG ACTTACCTGA TTATGAAAGT 240 

ATCAACCATG r H jflULrri' X T TATQCTGGGA ACAATCCCAT TTCCTGAGQG AATQGGAGGA 300 

35 

TCTGTCTACT TTTCTTATCC TGATTCAAAT GGAATGCCAG TATGGCAACT CCTAGGATTT 360 

GTCACGAATG OGAAOCCAAG TGCCATCTTC AAAATTTCAG GTrCTTAAATC TGGAGAAGGA 420 

40 AQCCAACATC CTTTTGGAGC CATGAATATT GTCCGAACTC CATCTGTPGC TCAGATTC3GA 480 

ATTTCAGrrOG AATTATTAGA CAGTATGGCT CAGCAGACTC CTGTAGCTAA TGCTGCTGTA 540 

TCCTCAGTTG ACTCATTCAC TCAGTTCACA CAAAAGATGT TGGACAATTT CTACAATTTT 600 

45 

GCTTCATCAT TTtSCTGTCTC TCAGC3CCCAG ATCACAOCAA GCCCATCTGA AATGrTTCATT 660 

CCGGCAAATG TGCrTTCTGAA ATGGTATGAA AACTTTCAAA GACGACTAGC ACAGAACCCT 720 

50 OTNTTTTGCSW AAACATAATT TGAATAAAAT AATTTTTAAT GGAITOTGNA AAAAAAAAAA 780 

AAAAAAAAAA AAAAAAAAAA AAAA 804 



55 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SBCyJEWCE CHARACTERISTICS: 
60 (A) LENGTH: 431 base pairs 
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(B) TYPE: niicleic acid 

(C) STRANDEOiESS : double 

(D) TOPOLOGY: linear 

5 (xi) SBQUmCE DESCRIFTIGN: SEQ JD NO: 126: 

GGCACAGCCC AGGGCCTIUA AGCCAGCTGG CXXTTGGAGAG GGGCTOCTGrr GCCAGCTTQG 
GGAGGGrrCTG GGATQGGC3CT GCCCCTGATG GCCCTGMCT GGAGrTACCTT GCCAGCKPCT 
GCTGGQGTGA ACTTTATTTT AGCCCTTCCC TTCITOCTCT TATQGAfiGAA CAGAGGflGGG 
GTOGGCAGGT CAGTGArorC AQCAGTQGAG TGATTCCCAG CACAGCGQCT TCTGQGAAGA 
GGGCATGGAG GCATTTCTrT CAGGGAAATG GTCCA3NATT TCAGCCAGAA GGCATTGCAT 
TAAGITAAGT CCNGGACTITT TCfTGOCCCAG CTCTOICTTA TTAAGGGCCC TTGGOGAflGA 
CTTCAAGGAG GGQGCAAAAN GACCTTTAAG TrTTTAQGTT TAACACAQGG AACCCNCAAA 
GGGTTATTTT G 

(2) INFOKMATICN FOR SEQ ID NO: 127: . 

(i) SBQfUENCE CHARACTERISTICS; 

(A) LENGIH: 3752 base pairs 

(B) TYPE: nucleic acid 

(C) ^TRANDETNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DiESCRIPTiaN: SEQ ID NO: 127: 
NGGCACGAGG AGAGTCACCT GGACTCAGAA CTAGAGATAT CCAATGACOC AGACAAAATT 
AAACTTCAGC TTTCTAAGCA TAAGGAGTTT CAGAAGACTC TTGGTGGCAA GCAGCCTGTG 
TATGATACCA CAATTAGAAC TGGCAGAGCA CTGAAAGAAA AGACTTTGCT TCCOGAAGAT 
ASrCAGAAAC TTGACAATTT CCTAGGAGAA GTCAGAGACA AATQGGAiaC TGTTTGTGGC 
AAGrPCTGTGG AGCGCXaGCA CAAGTTGGAG GAAGCCCTGC TCTTTTCGGG TCAGTTCATG 
GATGCTTTQC AGGCATTGGT TGACTGGrTTA TACAAGGTGG AGCCACAGCT GGCTGAGGAC 
CAGCCCGTCC ACOQGGGACC TTGACCTCGT CATGAACCTC ATQGATGCAC ACAAGGTTTT 
CCAGAAGGAA CTGGNGAAAG CGAACAQGAA CCCTTCAGGT CCTGAAGCGG TCAGGCCGAG 
AGCTGATTGA GAATAGTCGA GATGACACCA CTTGGGTAAA AGGACAGCTC CAGGAACTQA 
GCACTCGCTG GGACACTGTC TGTAAACTCT CTGHTCCAA ACAAAGCCGG CTTGAGCAGG 
CCTTAAAACA AGCGGAAGTG ITrOGAGACA CAGTCCACAT GCTGTTGGAG TGGCTTTCTG 

aagcagagca aacgcttcgc tttcggqgag cacttcctgg atgacacaga ggccctgcag 

TCTCTCATTG ACACCCATAA GGAATTCATG AAGAAAGTTAG AAGAAAAGCG AGTGGACGTT 
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AACTCAGCftG TAGCCATQGG ASAAGTCATC CTGGCTGrTCT GCCACCOOGA TTQCftTCACA 840 

ACCATCAAAC ACTQGATCAC CATCATCXXSA GCraXTITOG AGGAGGTCCT GftCATQGGCT 900 

5 

AAGCAGCRCC AGCAGCXJTCT TGAAACGGCC TTC3TCAGAAC TGGTOQCT A A TOCTGAGCTC 960 

CTGGAAGAAC TTCTGGCATG GATCCAGTGG GCTGAGACXA CrCTCATTCA GCGGGA3t:ayS 1020 

10 GAGCXMICC CX3CAGAACAT TGACCGAffTT AAAGCCCTTA TCOCTGAGCA TCAGACATTr 1080 

ATGGft3GAGA TGACTCGCAA ACAGCCTGAC OrGGACCXSOG TCACCAAGAC ATACAAAAGG 1140 

AAAAACATA6 AOCCTACTCA OGCGCCTTTC ATAGAGAAAT CCCOCAOCGG AOGCAOGAAA 1200 

15 

TCCCTAAGTC AGCCAACCCC TCCTCCCATG OCAATCCTTT CACAGTCTGA AGCAAAAAAC 1260 

CCAOGGATCA ACCAGCITTC TGCCXX3CTGG CAGCAGGrTGT GCXTGrTTAGC ACTGGAGCGG 1320 

20 CAAAGGAAAC TGAATOATGC CTPGGATCGG CTGGAGGAGT TGAAAGAATT TGCCAACTTT 1380 

GACTTTGATG TCTOGAGGAA AAAGTATATG CX3TTQGATGA ATCACAAAAA GTCTCGflGTG 1440 

ATGGATTTCT TCCGGCGCAT TGATAAGGAC CAGGA3GGGA AGATAACAOG TCAGGAGfTTT 1500 

25 

ATCGATGGCA TTTTAGCATC CAAGrTTCCCC ACCACCAAGT TAGAGATGAC TGCTGTOGCT 1560 

GACATTTTCG ACCGAGATGG GGATGGTTAC ATTGATTATT ATGAATTTGT GGCTOCTCTT 1620 

30 CATCOCAACA AGGATGOGTA TCGACXAACA ACCGATQCAG ATAAAATCGA AGATCAGGTT 1680 

ACAAGACAAG TQGCTCAGTG CAAATGTOCA AAAAGGTTTC AGGTGGAGCA GATCQGAGAG 1740 

AATAAATACC GGTrCTTCCT CGGCAATCAG TTTGGGGA3T CTCAGCAGrT GCGGCrGGTC 1800 

35 

CGTATTCTGC GCAACCGTGA IXATl'lUJajr TOGTOGAGGA TQGATGGCCT TGGATGAATT 1860 

TTTAGrFGAAA AATGATCCCT GCCGAOCACG AGGTAGAACT AACATTGAAC TTAGAGAGAA 1920 

40 ATTCATCCTA CCAGAQGGAG CATCCCAGGG AATGACCCCC TTCCGCTCAC GGGGTOGAAG 1980 

GTCCAAACCA TCITOXGGG CAGCITCCCC TACTCGTTCC AGCTCCAGrTG CTAGTCAGAG 2040 

TAACCACAGC TGTACATCCA TQCCATCTTC TCCAGCCACC CCAGCCAGTG GAAOCAAGCT 2100 

45 

TATCCXATCA TCAGGTAGCA AGTTGAAACG ACCAACACCA ACTTTTCATT CTACTCQGAC 2160 

ATCCCTTQCT GGrPGATACCA GCAATNAGTT CTTCCCCGGC CTCCACAGGT QCCAAAACTA 2220 

50 ATCGGGCAGA CCCTAAAAAG TCTGCCAGTC GCCCTGQGAG TCGGGCTGGG AGTCGAGCCG 2280 

GGAGTPCGAGC CAGCAGCCGG CGAGGAAGTG ACGCTTCTGA CTTTGACCTC TTAGAGACGC 2340 

ATTGCTTGrrr CCGACACTTC AGAAAGCAGC GCTGCAGGGG GCXAAGGCAA CTCCAGGAGA 2400 

55 

GQGCTAAACA AACCTTCCAA AATOCCAACC ATGTCTAAGA AGACCACCAC TGCCTCCCCC 2460 

AGGACTCCAG GTCCXIAAGCG ATAACACTGT CTAAGCACXX: CCAAGCCACT ATCCACTTTG 2520 

60 AATCCTGCTC CATACATTGG GTOEATATTT ATTCTGAACG GGAGAAGTTA TATTOITAAA 2580 
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AGTGTAAAAG AATAATTGTG TTATGAAGCT GCCTTATTTT aTl ' lTJTri ' i ' l ' GTAAGTTACT 
ATTTTCAICT GAATATTTAT GEAGATAAAA TTTGCCTCCT QGTAACCCTG TAATQGATOG 
GGCXXaVGAAA TGAAATATTT GAGAAAAACA AGrTGAAAAQG TCAftGATACA AATGTGTATT 
AAAAAAAAAA AflGCCTATTA ATAGGGrTTTC TCXXSCGGrTGC AGGGTTOTAA ACCTQCTTTA 
TCTTTTAGGA TTATTOCTAA A'TGCATCTTC TTTATAAACT TGACTTGCTA TCTCfiGCAAG 
ATAAATEATA TTAAAAAAAT AWGAATCCTG CAGTGTTTAA QGAACTCTTT TTTTOEAAAT 
CACGGACACC TCAATTAGCA AGAACTGAGG QGAQGGCTTT TTCCATTGrT TAATGnTTTG 
TGATTTTTAG CTAAAGAGAG GGAACCTCAT CTAAGTAACA TTTQCACATG ATACAGCAAA 
AGGAGTrCAT TGCAATACTG TCTTTOGATA TTGTTTCAGT ACTGGGTGTT TAAAGGACAA 
ATAGCTGCTA GAAITCAGOG GTAAATGTAA GICTTCfiGAA AAOGTrCAGAA CATTTGGGGT 
TTTAAACTGA TTTGTTQCTC CCTATCCAGC CTAGACACCA GTAACTCTTG TGTTCACCAG 
GACCCAGACC CTTGGCAAGG GATADGCTCG TTGGTtSACAT TGTGAATTTC AGATTTCTTT 
TATCCACTTT TTTTQCTATT TATTTAAATG GTCGATCAAC TTCCCACAAA CTGAGGAATG 
AATTCCftCGA GCXTOTTCTG AAAATCTGGA OGTAAGACAA ACAOGTGCTC GTCCTTTAAT 
GGAGTTCACC AGCACACTTG TTAACCflGTC CTGTITOCTT TOGTCTTTrT TTGTGCX3TAA 
TAAACTTCAAC TGACCAAGTG ACCATGAAAA GGOGCTGTCT GGGGCTCCTG TTTTITAGCT 

GCTGTrcrrc agctccgacc ATcrrrGCTrGT ctgaitatct caaxtggttt taattgasgc 

AGAAACIGAA GCTCTACCAA TGAACTGTTT AGAAACAAGA CACACTTTTG TATTAAAATT 
GCTTGCAGTA ACAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAACTCGAOG GGGGCCXX3GT 
ACCCAATTCG CCGTATATGA TOGTAAACAA TC 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) Lmcmz 1144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBCNESS: double 

(D) TOPOLOGY: linear 

(xi) SBC3flI^ICE DESCRIPnON: SEQ ID NO: 128: 
TGACCCTCTG CCTGCCGQQC TCAGTC3CTGG ACGCTTTCTG TTTTGrPOGCA GTCGGTCCTC 
GGTAACACCA QOGGCCTC?rG GTCCACCACT CCATTCAGCA GCTCCATTTG GTCCAGCAAC 
CTTAGCAGCG CCTTCCCTTC ACCACPCCAG CAAACAOOCT GGCAAGCATC GGCCTCATGG 
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GCACAGAAAA CTCOCCIGCT CCTCACGCTC CCTCCACCTC CAGTrCCAGCT GACGftCTTQG 240 

GACAGACCTA CAAOCCGTOG COGATATQGA GCCOCAOGAT TQGAAGAAGA AGCTOGGftCX: 300 

5 CTTGGrrCTAA TTCGCACTTT CCTCACGAGA ATTAAATTAA GCAAAAAACA AACAAACAIEA 360 

GTGGGCCCTC GTCTAGATCA TGATOTCSCCA GTTrCTGftGA CATCTTTTTA AGGCTCTTAC 420 

TOCAGCTCCC CTCOCXZACCC TCCTCTTCTT TOCAAAACAG ACCX31AGCAG GGCAGGCTCA 480 

GftCCACTCGC TTCrrTCAGA TCTTTCTTGC AATTATQATA ACATGAGATT 'lU;'iVrAUii; 540 

CTTTTAGAGA AAAGTCTGGA CTCAGCCACA AACTCTAATA AGACXTTCTAC ATCTGAGAAC 600 

15 CTTTCCCGTT ACPGCGTTTT CACCACCTGT CTTOXCATG CTTTATTTAT CTGTATGAAC 660 

ACAGATTTGA CATTACAGCT AAQGAAATAA TTTGAGTTGA TTCAGEAAATC CTQGCATGTC 720 

ACAATTTTGT TAAATTACCA AGTrTOGTIT TTAATAATTT CTCAATATTA TGCGCCAAGA 780 

20 

TCTAATTTTA AAACK7EATG AGC5ACTTTGT GCTGAAAATA GAGrTATTTTT TTAAAGIAAG 840 

GCrcrrCTTGG TTTAAAACXA GATTACAGAA ATGTAAGTCA ACTTAAGAAC RGTGAATGAA 900 

25 TGTAAAAACA TTCAGTYGAG AOCATATGCA TTTTCTGTGC TCrrPGTACT TGAGCTAOCT 960 

AACATTTGTA TACCTGAACT TATTTTAAAG ATGAACTGAA ATGCACATAG CCAAGrrCTTTG 1020 

AGATACAAGA TTGAAOXSTGT ATTTCTTAAA AATACAACTT TGTGTTGTAC TTTGAAATAA 1080 

ATGATCCTTT TTPCAAAAAA AAAAAAAAAA AAAAAAAAAC TCGAQGGGGG GCXXXSGTACC 1140 

CAAT * 1144 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 129: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) l^XSmi 1830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDETWRSS : double 

(D) TOPODOGY: linear 

45 

(xi) SBQUaJCE DESCRIPTICN: SBQ ID NO: 129: 
GCATGCAGAG GAGCACCCTG AGCGTCTYCC TGGAGCAGGC GGCCATSTTG GCACGGAGCC 60 
50 ACGGGOTGCT GCCCAAGTQC ATCATGCAGG CCACX3GACAT CATGCGGAAC AGGGCCCAAG 120 
GGTGGAGATr CTGGCCAAAA AOCTGCGAGT CAAGGACCAG ATGCCCCAQG GTGCTCCGOG 180 
CCTCTACCGC CTCTGCCAGC CGCCGGTQGA TQGGGACCTC TGAACACCCA AATGCCOCAC 240 

55 

GCTGOGOCGC GGCCTCTGGA GCTGGGAITT GGGAGGACAC AGCAGGCAGC GCTOGCCTTC 300 
TCCAGQGATG GCCCAANGCT TCCGCARCCG CCCGTTCCGG GACCTGCCCA GCGTCCTCCC 360 
60 TGOCTCCTTC CQGGACAAGC CTQGCCACCC TCGCTGTGAT GACGAQCTQG CTGATTGGCC 420 
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CrOGGCOSGC CCATTCTTCA CACX5CCTOCC ASAAGCTGGA QGGGTOCTOG AGACCCftTAG 480 

AGCTGATQGG AGCAGCTG&T GCCTGGOCTT CGGCTCCTGC GTCCOCAGAA CCCAAGGGAA S40 

5 

OGrrCATQGAG GCX»CA3X3GG COCMXXXXX: TCCCTCGGGA TGGCTCOGCT GCACTTTTGA 600 

AACCCXX3GTT TCCTTCAACG TCCACA3TCC AQGTGACXIAC ACGTCTCTCC TCCTCCTC A T 660 

10 CTTAGCTTCC AGGTTCACCC TAACCCTGTA CTAACCTGCT TOGTOGACTT GGAAAAGACT 720 

TOGCTCTGTC GGGAAAQGAG AGAOSGGGCC TCCMCACGC CTGTTAOCAG AGGATCOCOS 780 

AGAGCCACAC CAGCTCTGGA CATCACOGCC OCTQQAACTG GGGCCAOCAG CCCTQGGCAC 840 

15 

GAGATTTGCT CTGACTTTAT TTATATGGCA TGAAATCTCT GGrTTTATTTT GGGATTTTTT 900 

GTTGTTQGTC TTGTCAAAGT TTOn-TTTTC TAAAGTTGTG TGATTATATA TTTGACATTT 960 

20 TACATTTCAA AGAAAGGTAT GTTGTCTAAC AGGGGACCAA CASAAGGTAG TATTGACAAC 1020 

TGTTCCTGCT TCTACTAAAA AAAAAAGAGC ACAAAAGAAA AACTAAAITA TTGAAAAATT 1080 

AAAAAATC7PC ATTGrTTTCCT GTTTOTTAAT ATTAGQGTTG TAAGGTGTOG TTTTGAGGTEA 1140 

25 

TCGACTGrrGA TTCrTTOXC CACCCTCCAT TCTCTAGCGG TTGGCCXXrTG TTAGAACTOG 1200 

cnrrrcTTTGA GTGACTGGCT ACAAGGQCCT GAGAGGTGGC CAGCCAGGGT TQGAGCTQGA 1260 

30 GGQGATQGAG CCCCACCTGA GGTGCCGrPGT CACACGOGTT AGAGGGTCAC TQGGAAACAC 1320 

CXSGGCGGTCG CTTCTGTQAT TTATTTTCTT GATGGTAACT TCTCAGAGCA GGGCRATTGG 1380 

GACATCACCA GCCAGAGCAC AGGAAGCCAC CCTGOCTGCT GGGGAGGAGG GACCCACACA 1440 

35 

AGCCCCCrCG GCAGTTTGTC CCXXXAGCTT OQGTATGCCT TCAGGGAAAG GTCACAGCTG 1500 

GGGAGGAAGC GGGGGGAOSC CTGTCACCCC TGGCAGGTGG TGAGTTCAGG TGGGGGCTOC 1560 

40 CTGCnCOCCC CAGQCCTGGG AGCTTGAAGC CCTCCCX3GCA TCTOGCATCC GAGCCTCCCG 1620 

CXXmXAGGG TQCQCTTCCC TCTCTTGCCG CAGCATACAC GAGGGCAGQC AGTOGCCrTG 1680 

TCACTGTATC rPGCAIPCAGA GACAAAGGAG GACCCGCTTT AGCCCTGCTG CGGGAAATGG 1740 

45 

GGGATGGCCC AGGGCCAGCG CATTGTGCAC TOGTrTACTT TAAAATGTAC AGATTCTTCT 1800 

CXTTTAAATTC TTGATAGATT TTTTATTATT 1830 

50 



(2) INFOPMATION FOR SBQ ID NO: 130: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) L^JGrrH: 1864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEIJNESS: double 

(D) TOPOLOGY: linear 



60 
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(xi> SEQUENCE DB9CRIPTXGN: SEQ XD NO: 130: 

GGCCGCCXX3G ATOO^GACCC CAQCCPOGGC CCCAGACACA CGGGCTCTQG TGGCAGACTT 60 

TCTAGGTTAT AAGCTGAGGC AGAAGGGrTTA TGTCTGrPQGA GCTGGCCXXX5 GGGAGQGCCC 120 

AGCAGCTGAC CCGCTGCACC AAGOCATOCG GGCAGC3GQGA GAOGAGTPOG AGACCCGCTT 180 

COSGOGCACC TTCTCTGATC TQGCGGCTCA GCTOCATGTG ACCOCAGGCT CAGCCCAACA 240 

ACGCTTCACC CAGGTCTCCG ATGAACTPTT TCAflGGGGGC CCXIAACTGGG GCCGCCITCT 300 

AGCCTTCTTT GTCTTTQGGG CrOGACTCTG TGCTCAGAGT GTCAAC3UVGG AGArTGGAACC 360 

15 ACTGGTOGGA CAAGTGCAGG AGrGGATOGT GGCCTAOCTG GAGACGCGGC TGGCTGACTG 420 

GATCCACAGC ACTGGQQQCT GGTTATOXA GATCACTGAA QCTGAGATGG CTGATGAAGT 480 

AATTTGCAGT GAAATTTTAA GCGACTGTGA CTCTGCTGCA AGTTCCCCAG ATCTTGAGGA 540 

20 

GCTGGAAOCT ATCAAAOCTC GAGTrCAOOGA GATOGAOGAA GAAGCTGAGA AOCTAAAOGA 600 

GCTACAGAAC GAGGTAGAGA AGCAGATCAA TATGAGTCCA CCTCCAGGCA ATGCTGGCCX: 660 

25 GGTGAITCATG TCCATTGAGG AGAAGATQGA QGCTGATGCC CGTTCCATCT ATGrXTOGCAA 720 

TGPQGACTAT GGTQCAACAG CAGAAGAGCT GGAAGCTCAC TTICATGGCT GrrGGnTCAGT 780 

CAACOGTGTT ACXMACTCT GTGACAAATT TAGTTGGOCAT CCCAAAGGGT TTGCX3TATAT 840 

30 

AGAGTTCrCA GACAAAGAGT CAGTGAGGAC TTCCTTGGCC TTAGATGAGT CXTTATTTAG 900 

AGGAAGGCAA ATCAAGGrTGA TCCCAAAACG AACXZAACAGA CCAQGCATCA GCACAACAGA 960 

35 CCQGGGTTTT CCACGPiGCCC GCTACCGCGC COGGACCACC AACTACAACA GCTOXGCTC 1020 

TCGA3TCTAC AGTGGTTrTA ACAGCAGGCC CCGGQGTCGC GTCTACAGGG GCCGGGCTAG 1080 

AGCGACATCA TQGTATTCCC CTTACTAAAA AAAGTGrTGTA TTAGGAGGAG AGAGAGGAAA 1140 

40 • 

AAAAGAGGAA AGAAGGAAAA AAAAAAGAAT TAAAAAAAAA AAAAAAAAAA ACAGAAGmG 1200 

MCCTTGATGG AAAAAAAATA TTnTTAAAA AAAAGATATA CTGrGGAAGG GGGGAGAATC 1260 

45 CXMTAACTAA CTGCTGAGGA GGGACCTGCT TTGGGGAGTA GGGGAAGGCC CAGGGARTGG 1320 

QGCAGGGGGC TGCTTATTCA CTCTGGQGAT TCGCCATGGA CACGTCTCAA CTGCGCAACT 1380 

GCTTGCXX:AT GnTPCCCTGC CCCACCCCAC CXXTCTTCTC CGGCTCCXTTG CCCCTCCAGA 1440 

50 

TTGCXTCGTC ATCTATnTG TTTCCTTITG ' XUmU ' mT TCTGTTrTGA GriCTCTTTCT 1500 

TTQCAGGTTT CTGTAGCXXX; AAGATCTCCG TTCCGCTCCC AGCGGCTCCA GTOTAAATTC 1560 

55 CCCTTCCOCC TQGGGAAATG CACTACCTTG TTTTGQGGGG TTTAGGGGT6 TTTTTGTTIT 1620 

TCAGTTGrrTT TOrTTTTTTG ' XTn ' rrmi ' r nTCCTTTGC CTTTTTTCCC TTTTATTTGG 1680 

AGGGAATGGG AGGAAGTGGG AACAGQGAGG TGGGAGGTGG Al'lTlVmA TTTTTTTAGC 1740 

60 
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TCATTTCCAG GGGrrOGGAAT OTTTTTTrAA TATGriCTCAT GAATAAAGTT GTTTTTCAAA 1800 

AKAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1860 

5 AAAA 1864 

10 (2) INPORMATICN Ft3R SBQ ID NO: 131: 

(i) SBQUQICE CHARACTERISTICS: 

(A) LENGTH: 2041 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SBC^JENCE DESCRIPTION: SBQ ID NO: 131: 

20 GGCACGAGCG CGCGCXAGGG CCCTGGACCC GCGOGGCTCC CGGGGATCSGT GAGCAAGGCG 60 

CTGCTC30GCC TOGTGTCTGC OCTCAACCGC AGGAGGATGA AGCTGCTC3CT GGGCATOSCC 120 

TTGCTQGCCT A0GT0C5CCTC TGTTTGGGGC AACTTCGTTA ATATGAGGTC TATCCAGGAA 180 

25 

AATGGTGAAC TAAAAATTGA AAGCAAGATT GAAGAGATOG TTGAACCACT AAGAGAGAAA 240 

ATCAGAGATT TAGAAAAAAG CTTTACCCAG AAATACOCAC CAGTAAAGTT TTTATCAGAA 300 

30 AAGGATOGGA AAAGAATTTT GATAACAGGA GGCGCAQGGT TCGTGGGCTC CCATCTAACT 360 

GACAAACTCA TQATGGACGG CCACGAGGTG ACCGTGCTGG ACAATTTCTT CACGGGCAOG 420 

AAGAGAAAOG TQGAGCACTG GATOtSGACAT GAGAACTTCG AGTTCATTAA CCACGACGTG 480 

35 

TGGAGCCCCT CTACATCGAG GTTGACCAGA TATACCATCT GGCATCTCCA GCCTCCCXTPC 540 

CAAACTACAT GTATAATCCT ATCAAGACAT TAAAGACCAA TAOGATTQGG ACATTAAACA 600 

40 TGTTQGGGCT QGCAAAACGA GTCGGTQCCC CSTCTGCTCCT GQCCTCCACA TCGGAGCTGT 660 

ATQGAGATCC TGAAGTCCAC CCTCAAAGTG AGGATTACTG GC3GCCACGTG AATCCAATAG 720 

GACCTOQQGC CTCXTACGAT GAAGGCAAAC GTGTTGCAGA GACCATGTOC TATGCCTACA 780 

TGAAGCAGQA AGGCGTGGAA GTGOGAGTtSG CCAGAATCTT CAACACCTTT GGGCCACGCA 840 

TGCACATGAA CGATGGGCGA GTAGTCAGCA ACTTCATCCT GCAGGCGCTC CAGGGGGAGC 900 

CACTCAOGCTT ATAOGGATCC GGC3TCTCAGA CAAGQGCGTT CCAGTACGTC AGCGATCTAG 960 

TGAATQGCCT CGTGGCTCTC ATGAACAGCA ACGTCAGCAG CCCGGTCAAC CTGGGGAACC 1020 

CAGAAGAACA CACAATCCTA GAATITQCTC ACJTTAATTAA AAAOCTTGTT GGTAGCGGAA 1080 
GTGAAATTCA GTTTCTCTCC QAAGCCCAGG ATGACCCACA GAAAAGAAAA CCAGACATCA . 1140 

AAAAAGCAAA QCTGATGCTG GQGTGQGAGC CCGTGGTCCC GCTGGAGGAA GGTTTAAACA 1200 

AAGCAATTCA CTACTTCCGT AAAGAACTCG AGTACCAGGC AAATAATCAG TACATCCCCA 1260 
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MCCPMCCC TGCCAGAATA AAGAAAGGAC QGACrOCXX:A CAGCTGAACT OCTCACTTTT 1320 

AGGACACAAG ACTACCATTC TACACTTGAT GGGATGTATT r i'muc mT ' n ' i ' iUi '' ii;it: 1380 

5 

GnTTAAAGAA AGACTTTAAC AGGTGTCATG AAGAACAAAC TQGAATTTCA TTCTGAAGCT 1440 

TOCTTTAATG AAATGGATGT GCCTAAAAGC TCCCCTCAAA AAACTQCAGA riTlUXTit; 1500 

10 CACTTTTTGA ATCTCTCTTT TTATCTAAAA TAGCGTAGAT GCATCTCTGC CTAmTCAA 1560 

GTTTTTTTAT CTTQCTGrTGA GAGCATATGT TGrrGACICTC GTTGACAGTT TTATTTACTC 1620 

GrrTTCTTTGT GAACXTTGAAA AGGAACATTA AGOGGGACAA AAAATGCCGA TTTTATrTAT 1680 

15 

AAAAGTOGGT ACTTAATAAA TGAGrrCGTTA TACTATGCAT AAAGAAAAAT OCTAGCACTA 1740 

TTGTCAGGTC GTGGTGCGCC GGCATTGATT TTAGGGCAGA TAAAAGAATT CTOTGTCAGA 1800 

20 GCTTTATOTT TCTCTTTTAA TTCAGAGITT TTCCAAGGTC TACTTTTGAG TTGCAAACTT 1860 

GACTTTGAAA TATTCCTGnT GGTCATQATC AAGGATATTT GAAATCACTA CTGTGTTTTG 1920 

CTGCGTATCT GQQQCXX3GGG CAGGTTOGGG GGCACAAAGT TAACATATTC TTGGrTTAACX: 1980 

25 

ATGGTTAAAT ATQCTATTTT AATAAAATAT TGAAACTCAC CAAAAAAAAA AAAAAAAAAA 2040 

A 2041 

30 

(2) INFORMATION FOR SEQ ID NO: 132: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2012 base pairs 

(B) TYPE: nucleic acid 

(C) SmANDECNESS: double 

(D) TOPOU)GY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 132: 
TACCAAGCTG CAAGAATCTA CTATATCATG GCAGAAGAAG TAGAGTGGGA CTATTGCCCT * 60 

45 GACCGGAGCT GGGAACGGGA ATGGCACAAC CAGTCTGAGA AGGACAGTTA TGGrTPACATT 120 
TTCCTGAGCA ACAAGGATGG GCTCCTGGGT TCCAGATACA AGAAAGCTGT A3TCAGGGAA 180 
TACACTGATG GTACATTCAG GNTCCCTCGG CCAAGGACTG GACCAGAAGA ACACTTGGGA 240 

50 

ATCTTGQGTC CACTTATCAA AGGTCAAGTT GGrPGATATCC TGACTGTQGT ATTCAAGAAT 300 
AATGCCAGCC GCCCCTACTC TGrTQCATGCT CATGGAGTGC TAGAATCTAC TACTGTCTQG 360 
55 CCACTQQCTG CTGAGCCTGG TGAGGTGGTC ACTTATCAGT GGAACATCCC AGAGAGGTCT 420 
GGCCCTGGGC CAATGACTCT GCTTGTGTTT CCTGGATCTA TTATrCTGCA GTGGATCCCA 480 
TCAAGiGACAT GTATAGTOGC CTGGTGGGGC CdTGGCTAT CTGCCAAAAG GGCATCCTGG 540 

60 
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NAOCCCCATG GAOGACGGAN TGACA3t3GAT 
GATGAAA^ITA AGTCTTOGrTA TTTOGAOGAA 
5 GGCAGTATTA ACCTACAGGA TGAAACmC 
GGGAAACTCT ATGCCAACCT TAGGGGTCTT 
TACATGCTQG CCATOOGCCA AGATGTGGAT 

10 

TTCCTCTATC GGAATGGCGA GAACTAOCGG 
TTTGAGGrrTG TGGflGATGGT GGCCAGCAAC 
15 ACTGACCMG ICCATCCTGG CATQGAGftCC 
TTAAGCOCTC TCACCGTCAT CACCAAAGAG 
GAAGAAGGCA ATGrPGAAGAT GCTGGGCATG 

20 

GCCTCTGTTT TGGTTGCCAT TAGTGTCACC 
CTQGTTTQGT ACCAACATCG ACAGAGAAAG 
25 GACAGCTTCA AGCTTCTGTC TTTCAAACAG 
AACSCftCATCT GTAGrTGCACT CCCAGCAGGC 
GGGGCATQGG TGCTTOGAGAA OCAGAAOGAG 

30 

TATTTATTTT ACATC3GAAAT AATATGATTT 
GGGCACCTGG CACTAAGGGA GTACCTTATT 
35 TATATTTCCT TCTGACACTT QGAAGGTA1T 
AGTTAGAGACC AAGAGAAAAA CTCATTGATT 
ATTTCACTTT GAACTGAGGC CAAGTGAGCT 

40 

TAAGAATATA GGCTTGATCG GAAATTGAAG 
TrTTGATTCT CXTTQGCAGT GAACTACTTT 
45 TGAGCATGTA CAACCTCTGG AGCTAGAAGC 
AACCTGrPOGC ACTGAAAGGA ATGTTGAGTT 
TCCATTAAAG TACTTOTTAG AACACTGAAA 

50 



CGGGAATTTG CATTGnTCTT CTTGATTTTT 600 

AATGTTGGCAA CCCATGGC7IC CXAGGATOCA 660 

TTOGAGAGCA ATAAAATGCA TGCAATCAAT 720 

ACCMXHACC AAGGAGAAOG AGTGGCCTOG 780 

CTACACACCA TCCACTITCA TGCAGAGAQC 840 

GCAGATGrPQG TGGATCTGTT CCCAGGGACT 900 

CXrrGGGACAT GGCTGATQCA CTGCCATOTG 960 

CTCTTCACTG TTTTTTCTOG AACAGAACAC 1020 

ACTGAAAAAG CAGTOCCCCC CAGAGACATT 1080 

CAGATCCCCA TAAAGAATGT TGAGATC3CTG 1140 

CTTCTGCTCG TTGTTCTGGC TCTTOGTOGA 1200 

CTAOGACGCA ATAGGAGGTC CATCXTOGAT 1260 

TAACATCTGG AGCCTOGAGA TATCCTCAGG 1320 

CATGGACTAG TCACTAACCC CACACTCAAA 1380 

CAATCAAGCT TATCTGGATA TTTCTTTCTT 1440 

CACTTTTTCT TTAGrTTTCTT TGCTCTACGT 1500 

ATCCTACATC GCAAATTTCA ACAGCTACAT 1560 

GAAATTTCTA GAAATGTATC CTTCTCACAA 1620 

GGGTTTCTAC TTCTTTCAAG GACTCAGGAA 1680 

CrrTAAGATAA CCCACACTTA JACTAAAGGC 1740 

GTAGGCTGAG TATTGGGAAT OCAAATTGAA 1800 

GAAGAAGIGG TCAATGGGnT GTTGCTGCCA 1860 

TCCTCAGGAA AGCCAGTTCT CXZAAGTTCTr 1920 

ACCTCTTCAT GnTTAGACA GCAAACCCTA 1980 

AA 2012 



55 



<2) INFORMATION FOR SEQ ID NO: 133: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1669 base, pairs 

(B) TYPE: nucleic acid 
<C) STRANEEENESS: doxible 

60 (D) TOPOU)GY: linear 
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(xi) SBCJfUEPCE DESCRIPTION: SEQ ID NO: 133: 

GAGCAGTATT TTAACCAACT TGTATTACAG ATGTTACAGT TCATGTTAGG AAGTCAGAAA 60 

5 

AGACTTTGrrr TGTCTTTGfTT CTQCTGATCT GAGTCATGTT lTgIX3C3GglC TTCCATQGCA 120 

CATTTACCTG TTGCTCCGTC CAGATGriTGA OGGCCAGrPCT AGGCTCACAC ATCCTAIXOG 180 

10 AGGACAAGCC TGTTCTOCAT TTCTTCACTC TCCCCTCCCC ATATAGCAAC TCTCCCAGGT 240 

TTAGATTACC GTTTTCGACG ACAGATTAAC CAAAAATGCC CXIACACAGGT TTTATTACTG 300 

TTATATACTA TACTTTOAAC ACTTACAGACC CTAAATTTTA TTATTTGnTG CTOXCCAAT 360 

15 

CTGATACCAA ATGTTTAAAG TTGTTTGAAA TCCAAACATG GTAGTGTTCA TGQGTAAATA 420 

TTTTCTAGGC TATGTAAEAG TTAGCAGCCC ATAGCATAGA AGTAATCAAG TAGCATCTGA 480 

20 GACTGTTQGA GGCACTAGGG CCTCTCTGGG CCTAACAGCX: TCACTTCCCC AGCCTCACCT 540 

TQCTGrPCCTrC TGACACTGCX: AlCflGGGCTG TTAGTGGCAC CTGTATGAGG CCAAGTGTGC 600 

GTCCAGQGGA ACAGCACAGG TTAATGCGTC TCCCTAGAAC TCATGAAGTC AGTTTAATTC 660 

25 

AIGCATGAAC ATGAGTTCAT TTTATGTTTT ATATAGCTrT CTTAGACATA CCAAAOCATC 720 

ATTCATAAAT CAGATAAATT ATTCAGTTTT TGTGTTTAGA AAGCTAAGTA TGnCTAGCTG 780 

30 GAAACAAAAA TGAGOGTGTT TTCPCTCCTG TTAATCTAGA GTGTGCAGTT ACACATGrTGT 840 

GGATAATTTC ATGTTCCAGG GGCGCTTGGC ATCTCCCATG GACTGATTCC CAGGAAGAAA 900 

AGCXCAAAGG GAAACCCACG ATTCCTTTCG AGrTAGATGTG GGAAAGAGCC CATTQOAGGA 960 

35 

TATGAGCrrcC TCTGAAATTC ACTTGTX3TGT CTrGGCTCCTT GTTAGCAGTC ATGTTGACAT 1020 

GGT&TTAGGA GGCTCCC?CAT CCACCCTTTA CATGATGTAG GGACCAGTGT CTTGrrGftGAT 1080 

40 TAACCTTGGG ACACAGTOGG ITAGCCTGGA GAAAATGftGA GGCCCTGCCT GGACCCAGQG 1140 

AGAGGAGCCA GTQACACAGG CAGAGCGCTTG CAGCXTTCCT TCCCTTCCAT TTGGAGGAGG 1200 

TGGTGCCAGG AGCCTGCCCG CTTACCTCTG CTGAAGCATA AGTGGACTTT GCTTTTGGQG 1260 

45 

CTTTATCTCTG ATACATGCTG GAGCXXTGCC TCTCCACTGC TAGATGGAAC CTGGAATCTC 1320 

TCATCTACCT CTTAGTCTGT CAGTrTCTAC CTICTGAGAAG CAAGCTTGTG QGCCAGrPGTC 1380 

50 CTTGTACATG CTGTAGCACT TAAAAAATAA TTCCAGGGTT CCCTGGAAAA CCAGTCCCAG 1440 

QGTTCCTATG ATCTGTAGnT TCTACXTTOGA TTATAACTGG ITTTGGGTAC CTGAATTTTG 1500 

ATPGGTTAGC CTTAATTATA GrPCTGGC3GTG ATCATGTAGA ATCTTTTCTG GTGAACAGAT 1560 

55 

CATAAAGrrrC TATCAAGGAG TTCTATCAAG GCATCCATGT CAGTOGTGCT ATCCTGGTTA 1620 

CAACTTGAGA TTTTTGAAAT AAAAAATTTG TCATAAAAAA AAAAAAAAA 1669 

60 
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(2) INP0KMATIC3N FOR SBQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1565 base pairs 

(B) TYPE: micleic acid 

(C) gTRANDECWKSS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTICN: SEQ ID ^K): 134: 

CACrrrTGCT ATATAACCTA AGTGATAACC CTCrnrAGT TACCTGCCAA ACTCTGGNCT 60 

15 TQGTTTATAT TGCAGTTAAC ACAGTPACAA AGCTGTAATG G TCiX.Tm T TTCCTTTGTA 120 

ACGGAATGTC TAAATCAAAG TATATACA1T GTGTOGTGTT CCTGTTTCTG GAGTTTCATG 180 

AGGATTTACA CATGGCATTC AGTGTTCTGT ATAGATCTGC CTACCTTTCT GAATTCATCT 240 

20 

GTTAACCCCT CTTCCTTTGA GAGAGCACCG GCGATQGrTGG TTAACTCCTT GTGTTTTCTC 300 

TCTCTCCTAC TGfflTATrCT TGAATTAAGC ACAGACTCGT CAGCTCGC5TT GCTTTATCAT 360 

25 GAATAATGTG TGTGACCTTG CAGTrCTTCC ACAGrTTCAGC AAACAAGTGC TAGCTTCACT 420 

GACCAAAAAT TAAGGAAGGA AAACACAGTT TTTAAAACGA TCCATCTTTT AACAGCOGAA 480 

ACCGATGTGT CTATGGTGCT GCACCTTQCT GTTGTACTTC TGAAATCAGA CGTGTGTGAA 540 

30 

CGATCATTTC TGACTTAACC GTGAGATGCT CACGAGTACC CTTCCTGTTG TTTICTTAGC 600 

ATTGAAATCG AGACTATTTA TTTGGAATAT ATACAACAGT GTTTTTOCAC TGTATTTCAT 660 

35 TTQCAAAAGT TGAGAACTGC TTTCTCTACC rTTTGCAAAA TAATTGATAT TCCATA1TGG 720 

ATTCTCAAAG ACTTOGATAT GGTGAACCTA TTAAACCTAG AAATTGTATT CATCCTTTCA 780 

TGACTGTGGC CTGAGTTCCC CAGCCCCTCT CCTCCTTTTT TTTAGATGAG ATTTAGCACA 840 

40 

CTCTCAGTTA TTTAAACATG CAACATTTCT TGAGTATGTA TGTTGAGGCC ATCTGAGCTC 900 

ATAGCTGATT CACTAACCAG TTTCATOCTG TGTCA1TCAC ACTCACTACT TAATACTGCC 960 

45 ATGGTGAAAA TGTQGAGGAA AAATGTATCC ATGrTGTGTCT GGGAAGCATA TACACTTGTA 1020 

CATTTTTTAA TACTCTGATT CTGTAACATT TCTGAGTTTT (jTi ' l ' ltrrn ' r ACACNAAAAA 1080 

AAAAAAAAGT GATAAAGCAA TCAGAAGACC AAGAGGTTTA CTATTGATGC TTAGGGTCGT 1140 

50 

CTGACCTTQG CTGGCCAATA GACCTACACG GCCAAATTAA TTTACGAGAG TAATAATTTT 1200 

TCAAAAGCCA ATTTTTTTTC TCTATTTTCT GTATGAAACT GCCAATATCA TGAATAGAAA 1260 

55 GGGAGAACCA TAAAGGAGAA AGAACGTGAT GTTCTGnTAT GTTCATGTAA ACCTAAAGAA 1320 

ACAGTGTGGA GGCAGGOGCG ATCAGCOGAA CTCTAGGGAC TTGGTGTrGC TTGGAAGGCA 1380 

TCCATACCTG CATTTTGCAT TCTTCGTATG TAATCATATT GCCAAAGACA AACTATTTCA 1440 

60 



359 
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TCATTTATTG TAAAIEAACAC TTrTCCOCAG ACCTACCATA AAGTrTCTGT GATCTATTCT 1500 
CTTCCAGrrrG CAATAAAAAT TACTGAGrrrG CATCAATTGA AGAAAAAAAA AAAAAAAAAA 1560 
CTOGA ^5g5 



10 (2) INFX3RMATICN FOR SEQ ID NO: 135: 

(i) SEQUQCE CHARACTERISTICS: 

(A) LaJGTH: 2007 base pairs 

(B) ITPE: nucleic acid 
15 (C) STRANDECNESS: double 

<D) TOPOLOGY: liitear 

(xi) SEQDENCE EESCRIPTICN: SBQ ID NO: 135: 

20 TCTAAAAGCC CCCTTATACC CCACTTTGTG CAGCAAftGAT CCCCGTCCAG GTCACAGCCT 60 
GATTTGTQGC CAGQCTQGAC AAATTCCTGA GGCACAACTT QQCTTCAGTT CAGATTTCAA 120 
GCTGTCTTGG TGTTGGGACC AGCAGAAGGC AAAOGTCCAG CCAACACACA QGACICTAAG 180 
AGGACTCTGA GCTACGTGCC CTGTGAAGAC OCCCAGGCTT TGTCATAGGA GGTaTTTCAG 240 
CTTCCCCAAA GTCAGAGGTC ATrTCATTTG GGGAAGACTG AATATTCACA CCTAAGTCGT ' 300 

30 GAGCATATCC TGAGrTTTAC TTOOTTATGG CTTGCCCTCC AAGriTCTCTC TCTCATACAC 360 
ACACACACCC TTOCTCCAGA ATCACCAGAC ACCTCCAOXSG CTCCAGCTAT QGGAACAGCT 420 
GCATTOGGGC TGCCTTTCTG TTTGGCrTAG GAACITCTGT GdTCITCTG GCTCCACTCG 480 
CGAGQCflGCT CGGAGGTGTC GACTCCGATT QGGCTQCAGG CAGCTCTOGG ACGGCACAGG 540 
GOQGGCQCTC TQATCAGCTC GTGTAAAACA CACCGTCTTC TTCGCCICCT QGCAGTTCTT 
40 TCTGCGAATA GTCCTCTCCC TGGCCAGTTG AATQGGQGAA GCTGCTOGCA CAGGAAGGAG 

AGGCGATCCC GGCTGAGGCT TAGGAAA3TG CTGGAGCCGG CTCCAAQCAG ATAATTCACT 720 
GQQGflfc*,l-i-r TCAGAGTCAA ACAITCATTCT GCCTGTKTTG GGQGCCAGGT GTOTCACACA 780 

45 

AGCATCTCAA AGTCAAAAGC CATCTGGGGC TQCTGCTTCT CTTTCTCAQG CTCTGGGGAA 
AGGAATCPCC CTCTCCTCTC ACTTGATTCC AAGTCTGGTT GAATTGTCTG GAGCACTGGG 
50 ACITTTTTTC TCTTTTCCTT GATGGACCAA CAGTGCAAAT GCAATCTCGC CATTTAACTT 
ICAGGTCGAT TTCCTrTCCT GATCAGACAT CTTTOTGCCC CCTTTAGGAA QGAAAAGAAT 



600 
660 



840 
900 
960 
1020 



ACACCTACGA TGTGCCAGGC ACTOTGTTAG GCGCTTTTAT ATAGATCCTC GTrAGGATQA 1080 

GACTAAGGGA TQAGGACATC TCTTTATAAA AGGCCCCTAA GTAATQGATA AACAGAAACA 1140 

CTTAGAGGTG AGAAGGTCTG TCTTCAAGAT CCAAGGTAAG ATTOCCTTCA GTCTCATGTT 1200 

60 TGnrCTCAAG GACITATCCC CTACAATATT CTCCCACTCC ATACTTCTCC TTCTACCCCA 1260 
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CCATGTGCTC CCCTrGCACTC CTCAGATGC7P CAGAQGGGTA ACCCAAGTOC TTAGAGAATT 1320 

TGGGGACCAA TAGAATATCT GATGTGTCAA TTTTCTTTAA AAAACTTAAG GAGTCTTTGC 1380 

TACCTTCTGC TTCrrTGAGTT GTTTTC3GCAT TCATATTAAA AGCXAGCATC TCACTATTTA 1440 

TTGACAGGrrr GGGCTOPGTG TOPGCGCATG TGTGrTATACA TTTCCAGGCG TGCCTGTGTC 1500 

CTGTAGCTTT TTAAAAGGAA AOCXZAGTCAT CCCACTATGA ATCTQQCATC TTCTTATQCTr 1560 

TCTAGrrGTrT TGGCTATACA TCAACCAAQG GGTTTAATTT ATtXAATOCT TGAOGACATC 1620 

TTCAQGAOGG GCTGGATCAA PnrmXSUiPiG GGTTATGGGA AAGGQAOC3GG GAGAAGAAAT 1680 

TGACATTTAT TTTATTATTT ATTTTAAATG TTrACATCTT CTTTATOrTG TATCAAGCCT 1740 

GAATAGAAAC TQATAGCAIT AAAATACTCC GTTCCTCTCT CTCTTCTCGC ITCCTTTTTT 1800 

TTTTTTTTTA AATTTAGGAT AACACATTTT TGTTTCTAAA GTGATTTGTG ATTTGTOCTG 1860 

TATAAACTGT ATAAAAGGTT CTGTTTTTAA AGGTCGATTT TCATTCCTCT GGGGACAGrPG 1920 

GTCGOCAAGA CATCTACATT GTAA3AGAAC ACAGltSGAAG ATCCTGTCCT GATTCTCAAA 1980 

AATTATTTTC TCTGTATGAIT TAAAACT 2007 



30 

(2) INFORMATION FOR SBQ ID NO; 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDECMESS: double 
<D) TOPOLOGY: linear 

(xi) SBQUaiCE DESCRIPnCN: SBQ ID NO: 136: 

40 

CTTTTAACCC TCCCCCTTCA CACACATACA TATCAGGnTG TTTTCTAGTT AAAAACCCAA 60 

GTAGCTCAGA TTCTACITTA ATGTCAGrTGC AGATTTGCAT TGAATCATGC CATTATGTTT 120 

45 TTTCTCATTT TTATC3CTGTT GGGTCTTAGT TTTTAAATTG ATATAAAGAA CTCAGCAATG 180 

CTTTTATTTT CTACTCATAC TTAGGGTTTA GGAAACACTA CCACTAGTrTA TCATTTAATC 240 

AACTTCAATG GTCTACTGAA ACAAAAATOG TAACTTTTCA TTAGTGGATT ATTTAGAGTT 300 

50 

ATAGTAGTTG TTTCCAGAAA ACACTTOCTC ACAATTGTAC TTCCCAATCA AATCATGTGA 360 

TCATACAGTT ATTCCCATGA AAGQCAGAAT GTTTGTrTCA AAATTAATCT AGTTrPCTGT 420 

55 ACATTTAAAT TTGAGAAGGT GACAACTGGC TCTTTTCCAG TCTTCCTTCA TGTCAGTTTT 480 

CTGATAGACC ACTATTGGCA AACAGTATCT GTCAACTACC AAATGTGTAA AATTTTCTGT 540 

ATTTCACTTT GTCTTATTTG TAAATAGTGA ACTAAAACTT TTGGCAGATC AGCAACATTT 600 

60 



W0 9a/39448 



361 



PCT/US98/04493 



10 



GCTGAGOCTG TTTTTTAAGC TAATCTrGTAT TCTTACTAAT GTTCCTATCA AGAATOGAIT 660 

TGTAATATAT GCTGTCTATT TCTAATGnTC ACATTCATAT TTTGAGGrTPC TATCTTATTT 720 

TAATAGAGAA CAGACTTCTC AAAAAATCTT CAGAAGCAGC TTArrATTGA AATAIXXSAAA 780 

TATTCAAATA AACCCX3GTGG GrFTAGATTAC TCATCTGrTOC ACCAAGTOGG ACATTTOCAT 840 

GGACTGGGGG CTTAAAGGAC TTAGAAGAGA OCTGTAAGTA AATCCTGAAA A3X3AGCCAAT 900 

CXXXACTTGA ATGGTTACTG GAGTEAAACCC ACCTTTACCA CCCCAAHTAC AGCACCCX3AG 960 

GCOGATAAAC CAACTTGGCT CPGGTTCATT T lTCiTriVT TCATTTGTGA TGCTCAGATT 1020 

15 CAAAAOCTGT GITCTACACT GTTACAGGCT TCTCTTTTC3T TTGATTAAAG ATTTTAGnCC 1080 

TACTTTTGTA TGSACACATT AGAATATTCA GAGACCAAAA TAGAAGAATT TGCTGTTAGA 1140 

TATTTTTCAG AAGTCAGCAG AOTPGTGGCA AATCATTTAT TTGCCTTTTT AAAAATTCAT 1200 

20 

TIAAGCAGTT CAGAGAGTAG ACTACTCAGA AAATTATTTC AOGTAATTGT CTAAGAGGTC 1260 

AATATTTTTT AATGCATATT GAATCAAATA A 1291 

25 

(2) INPORMATION FOR SEQ ID NO: 137: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIXCT: linear 

35 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 137; 

GGCACGAGGA CCTACTTTTC TAACAGACCA TGGTTGTGTC CAAGGTAAAA' CCACAGTGAT 60 

40 ATTTTTQGAT GCTTTGTCTG CAATCTTGAC TTOTTTTTGC AGTATCATTA TTCAGACTTC 120 

AAATTGTGAA TCTTTTAAAC ATCTTGATAA T TlVl ' l XJ rm ASAQCTGTTC ATTCTAAAAT 180 

GTAATGAAAT TCAGTCTAGT TCTGCTGATA AAGATCATCA GTTTTGAAAG GTTACTGATT 240 

45 

TTCCTCTTCC CTCTTAGrrTT TTTACCCAAT ATATGGAGAA GAGTAATGGT CAATCTTAAC 300 

ATTTTGTTTT AATTGTTTAA TAAAGCTGCT GGGCAGTGCTr QCAGCATTCC TACCTAGTGT 360 

50 CATAAAAGCA AAATACTTAC ATAGCTOTCT TAAAATATAG GAATGACATT ACATTTTTAG 420 

GAGAAACTTAA GTTGCTTTGC ACCGCCTACT TAATTCCTTT CCATATATTG TGATACAAAC 480 

TTTTGAATAT GGAATCTTAC TATTTGAATA GAAATGTGTA TGTATAATAT ACATACATAC 540 

ATAAGCATAT AJTGTGTGrrcrr GTGTOTGTAT ATATATATAT ATGCATGCTG TGAAACTTGA 600 

CTACACAACA TAAATCACTT TTTAAATTCC AGGAACGGGT AGTCTGACAC GGTGATTATC 660 

60 CmTGAGGC TGAATCOGTT ATTAACTTGT TATTTAGGTT TTACTCCCAG TAGCAAGGGA 720 
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30 



35 



40 



TTCTAAGTTA GITGCACTTA CATGATTATT GTCAITEAAA ACTAAGAATA AAGGCTGCAT 780 

TTTCAAAGAT AAATTGGAAT TOCTGTTQGT GAAATAACAA CCAAAATACT GAATCTCATG 840 

TACATACAGG TTTCTACAGG AAGAGA3X3GT ATAATTTACA ATTTOGAGAT TTAATAAOCA 900 

C3GGCTACCXA GAAAAAGTGA CITCATAACA TGGTACCAAT AAGTAAGGGA TGCTCTCTOG 960 

GTTTGCTTTT GCCACTTTCA AGATTTTAAC TTCTCAGGTT ATTAATCAAA ATTATTGTAT 1020 

AAGTTAGCCA ATAGAATTTT TAGGTTAAAA CAACAGATGG GGGGTrTGTG GACrTGnTTAA 1080 

TGTCAIGGGC ATTITTAGTA GCATAGACCC TTTGrTTCTGC ATTTGAATOT TTCGTATATT 1140 

TTTGTTTCAC AGTTAATCTT CCCTCCCCAA GTTTOCTATT CAAATCAACT GCCTGAATGA 1200 

CATTTCTAGT AGTCTGATCT ATTTTTCTGA GGAATAGrTTT GTQATrcCAA TQCAGGTOTC 1260 

TTCATTACCA TTACCTCTAC ACTGCAGAAG AAQCAAAACT CCTrTAITAG AATTACTGCA 1320 

CATGTGTATG GGGAAAATAG TTCTCAAAGG CTAGAATGAT ACAAGTGAGC AAAAGTTGGT 1380 

CAGCTTQGCT ATOGAGnGGT GGCAATAATC TCTAAACATT CCAAAAGACC ATGAGCTGAA 1440 

CCTAAACTCC CTTQQGAATC TGGAACAAAG GAATATGAAA ATTGCCAITr GAAAACTQAC 1500 

CAGCTAATCT GGACCTCAGA GATAGATCAG CCAGTGGCCC AAAGCCATTT CAAGTACAGA 1560 

AATTATAGAG ACTACAGCTA AATAAATTTG AACAITAAAT ATAAnTTAC CACTTTTTGrr 1620 

CTTTATAAGC ATATTTGTAA ACTCAGAACT GAGCAGAAGT GACITTACTT TCTCAAGrnT 1680 

GATACTGAGT TGACTGTrCC CTTAICOCTC ACCCTTCCCC TPCCCTTTCC TAAGGCAATA 1740 

GTIGCACAACT TAGGTTATTT TTQCTTCOGA ATTTGAATCA AAAACTEAAT CCCATGCATT 1800 

TTTTTCITrr GCAAGACaCC TGTrTATCAT CTTGrrTTAAA TOTAAATCTC CCCTTATOCT 1860 

TITGAAATAA ATTTCCTTTT GTAAAAAAAA AAAAAAAAAA AAAAAA 1906 



45 (2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTEElISriCS : 

(A) LEMGIH: 1935 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQXJEtSCE DESCRIPTION: SEQ ID NO: 138: 
55 TCTGAACTAA TGCTAACAGA TCCCCCTGAG GGATrCTTGA TQQGCTGAGC AGCTGGCTQG 60 
AGCTAGTACT GACTGACATT CATTGTGATG AQGGCAGCTT TCTGGTACAG GATTCTAAGC 120 



60 



TCTATGTTTT ATATACATTT TCATCTGTAC TTGCACCTCA CTTTACACAA GAGGAAACTA 



180 
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TGCAAftGrrTA GCTQGATCXX: TCAAGGTCAC 

CTCACXrrCCT caggtcagca agtctacttc 

5 GTGCCTAGCT TTQGAAAGTC TAGAATOCX3T 
GTTTCTGCCT CTTTTTQGAA AAGAAAACAA 
CAATAGCATG AGGTQAACAG GACGTAGTTN 

10 

AAACACTATC TTCCCATCTC TTTCTCAATG 
GGAGAGCAGC AGTTAAACCX: GTIGGATTTTG 
15 CACTAATTQG CTATGTCTCT GGACAAGTTT 
AACTTTCACT TTCTATGTCT AOCTCAAAGA 
TAAAGGGTCT GCCAGATAGG AAGATGCTAG 

20 

AGAGTCTAAA ACCTACAGTG AATCACAAT6 
AAAACTAGCC AGAAC3TCTCT TTTTCAAATT 
25 TAATOGATAA TCTTATTTAT CTAAACTAAA 
TGQGATAAGA TAAATGACCA CACTACCTTA 
TGTAGGTAAG GACATTTTCT YTTTTTCAGC 

30 

GAGGGGGGTA GCSOOTVCCX: AGCAGGGGAC 
GTGGGGTAGT GAGCTOCCTT TCTGTGATCG 
35 TTGCTATTCG CAGCTACATA CAAOGTQGCC 
ACSrrOCPGGTG CTGACTCTCA ATAGCCCCAC 
AATTAGGCAA CCTAAAATAT TGATGCTOCT 

40 

TGAAACTTAG AGTTATAATT CATCTATTAG 
ATATGTATAT ATGAAAGGGA GCTTATTAGG 
45 GrrCQCACAAT AGGCCX3TCTG CAAGCTGGCST 
GTTCAAAAAC CTCAAAACTG GGGAAGCTGA 
AQGCCAAGAG CCCCTGCXAA CCAACXX:ACT 

50 

AACCTGGAGT CTGATGTCCA AGAGCAGGAA 
CAAQGTAGAC ACnCTCTftCC ACCAyAGTGG 
55 GCTACCTQGA TOXTGAAGT TGCCXTPOGTC 
CnrOCATTA CATGAGCTGT CTCAAAGCCC 
AAAAAAAAAA AAAAA 

60 



TTAGGTAAGT TGGCAAGTCC ATGCTTCCCA 240 

TCTQCCTATT TTGTATACTC TCTTTAATAr 300 

CCCTC3GTGCY TTTTTACTTT GAAGAAATCA 360 

AGTGCAATTG TPmTACTG GAAAGTTAOC 420 

AGGCCTTCCT GTAAACflGAA AATCATAOtlA 480 

CCTQCTACTT CTTGrTAGATA TTTCATTTCA 540 

TAGTTAGGAA CXTTOGGKTCA AACCCTCTTC 600 

TTTrrTTTTT TTTnTTTAA ACCCTTTCTG 660 

ATTGTTOrGA GGCITCAGAT AATGCATITC 720 

TTATGGATTT ACAAGGTTGT TAAQGCTGTA 780 

CATTTACCXX: CACTGACTTG GACATAAGTG 840 

ACTTACAGC3T TATTCAATAT AAAATTTTTG 900 

GCTTCCTGTT TATACACACT CCTGTTAITC 960 

ATTTCTAGGT GQG1X3CCTGT GATGCTTCAT 1020 

AGCTGTGTAG GTCCAGAGCC TCTGGGAGAG 1080 

TGAACTGGGA AACTCAAGGT TCTTTTTACT 1140 

C3TTTCCCTAG GGATC3TTOCT GrTTCCCCTCX: 1200 

AACCCCAGTA GQCTGATCCT ATATATGATC 1260 

CCAAGCTGGC TATAGGrTTTA CAGATACATT 1320 

CTTTGGTGTCA CATAATQCTA TOG(XAGAAC 1380 

GGTTCTCCAG AGGGACAGAA TTAGTAGGAT 1440 

GAGAACTC3GC TCCCACAGTT AGAAGGOGAA 1500 

TAGAGAGAAG CCACTAGTOG CTCAGCCTGA 1560 

CAGTOCAGCC AGCCTTCAGT CTGTGQCCAA 1620 

GGTOCAAGTC CTAGATTCCA AAGGCTGAAG 1680 

GAGTCGAAGA AAGCCAGAAG ACTCAGCAAA 1740 

CCATACCAAA GAGQCTACCG ATTCCTTCCT 1800 

TCTQCACCTT CTAAACCTAG TTCTTAAGAG 1860 

TCC7ATWAAT TCTCAGTGTA AGYTTCAAAA 1920 

1935 
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(2) INFORMATION FOR SBQ ID NO: 139: 

5 

(i) SBQUHflCE CHARACTERISTICS: 

(A) IJ39GTH: 1446 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEMESS : double 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 139: 

NGCCCCCTTG GCACAAGTCA GATGAAGCAC GmCTGCCGG GGAGGCCCTC AMCTTCCAGA 60 

15 

GAGGACAGAC ACAGATTTCC TGCTGQQQGA GGGAGGAGTC CACGCATCCT GATGCTQCCT 120 

GGAAGCTTAT TTTCCOGTGG CCAGGATGCA TTTCTCTGAG TQGAAACAOG TrCTTGCATG 180 

20 TQGATGTGTG TOTOCCCAGG CAGACGGCCC CTCmTCCC AGCACTTCCC TQCCTCCCOC 240 

AGGCCTCAGG CCAGCACCCA GTTCCTCCTC ACATQGCAGG TGAGCACAGA CTTCTAGmG 300 

GCAGGAGCTG AGGAGGGTGA ACAAACCCCG AGGGAGGCCC GGCCCTTGCT COCGAGTTGG 360 

25 

GGGGAGOGGG TGTTGGCAACG TQCCCCCCGC AGAGGCCACG CATGTTTGAC CAAAGCCCTC 420 

ATTGTQGrrCC GAGGACAGCC TTTTCCCCAG GCCTCARAGC ATTGCTCATC CGTGCCAAAC 480 

30 TGGGTAGGTG 6ATTTGAGCG GAAAGACTCC CAAAATGTGC CAAGAATTTC CCRGTCCCAG 540 

GCAGGGCAGG GGAAACTAAG GGCAAGCAGG ATACAGGGCG AGGGATGTQG CAGGTCAGGG 600 

GGCTOCCGCC TGTGCCOCTT CTCCTCACCA TGTCTCCCCC ACCCTGCCTC AGrTTCTCOGT 660 

35 

TCCCCTTCAT CTCCX3TCCCC CTCTTTGAAG CTGTCCCCAT CTCAGTGTCA GACCAGCCTT 720 

CTCCTCAKCT GACCACCCTC CTCTGACCSA CGCCCCCTCC TTGTCTGAAA AAAGGAGCCT 780 

40 TGAATGGTGG AGGGAGGCAG TGGGGAGAAA GGTCTCACCG GACAGGTTQG GAGAATGAGG 840 

TCAGOGGTGC TGGGGAACAG ATOGAGGOGG CAGTGGGGAC AGGGCTTGGG CAGACACCAG 900 

CAGGAATAAT TTGAAATGTG TGAGGTGACT OCCCGGAGGC CTTGGGCTTG QGCATTTQGG 960 

45 

AAAAGAATGA TGTCTGGAAG OGCTTAAOGG ACACAGTGGA CGAGGGGAGA GTCCTCATCT 1020 

GCrGGCATTT TGTGGGGrTGT TAGTGCCAAA CTTGAATAGG QGCTGGGGTG CTGTCTTCCA 1080 

50 CTGACACCCA AATCCAGAAT CCCTGGTCTT GAGTCCCCAG AACTTTGCCT CTTGACTGTC 1140 

CCTTCTCTTC CTACCTCCAT CCATGGAAAA TTAGrTTATTT TCTGATCCTT TCCCCTGCCT 1200 

GGTCTAGCTC CTCTCCAAAC AGCCATGCCC TCCAAATGCT AGAGACCTGG GCCCTGAACC 1260 

55 

CTGTAGACAG ATGCCCTCAG AATTGGGGCA TGGGAGGGGG GSTGGGGGAC OCCATGATTC 1320 

AGCCACGGAC TCCAATGCCC AGCTCCTCTC CCCAAAACAA TCCCGACAAT CCCTTATCCC 1380 

60 TACCCCAACC CTTTGCGGCT CTGTACACAT TTTTAAACCT QGCAAAAGAT GAAGAGAATA 1440 
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TTCTAA 1446 



(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LQKTTH: 1109 base pairs 

(B) TYPE: nucleic acid 

(C) STOANDEENESS: double 
<D) TOPOLOGY: linear 

15 (xi) SBCyJENCE DESCRIPTION: SEQ ID NO: 140: 

TTTTTTTTTT TTTGATATGA AATTGTCTTT CTCCATTGCA GAAATAAGCT AGGGAAACAC 60 

TAACCCAAAA ACTTTCTGTA GAGCTGTTCC TTTGGAGGCA GCATCACTTA TTOQCAGTAA 120 

20 

AGACTCAGTA TAAAAGCACC AGCATCCCTA CTTGGGTCAT GGGGATTAAT TTTATAGCAT 180 

TCCAriTTCC TAGTCCCACA TGrTGAAATTG GATTTTGATG ATCTTAATCT ATATTCTACC 240 

25 CTTATAATAA AAGATCAAAA GATATATCTC CTATGAACAG ATTOGAGATA OGAGATGAAA 300 

AGTTQGGAGG ATGTCTTTAT TCTAATCrTGA GGGTAGQGAA AATGTGGATA ACATTACTGG 360 

GGTGARGGAG GCATrGTTCT TTAGrTTGGAG TTCTCATTTT TATTCTCCAG TACTGACTTG 420 

30 

TGGGGAAAGC ATACTTTTTC ACTGCCAGGT ACTGAATGCA GAGGCTCAGT GAAGTTATATA 480 

TGTGQGAAGT GCATGCATTT CGTTTATTAG CAAACATAGC TGGATTAAGA CAAAGTTGTT 540 

35 GC?rTTGGAAA GGGGTTAAAG CCTTAAGTGA ACAAATCTAG CTAACAGTGA ATGAACTAGG 600 

TAATATAACT TGCATATTTT TAATTTCCTT TGGTTAAAGG TCCCCCATAC TTCTCTGrTPC 660 

GGAGACATGA GAAGTATGAT TACTTCAGTG TTAGTTITCT TAATTTTTTT TTTCCCCTAT 720 

40 

TTGTCCCTTG TCACTTTGTT GCAAGCTAGA AATCTGTOGG TTATACATAG GGCAGCTCTT 780 

TCTGAAACTG GTTTATTCCA CTGGAGAAAG GGGATTGAAA ATCAGnTAGA ACCAATGTAT 840 

45 TTCTTGCCCC AOGGAACACT ATTCCTATAA GATAGCTGAA AGAAGCTGCT GTGAGGAGCT 900 

CAGCTCCAAA CACAGGATCA GCACCTTGTA TAGGAATTCC CATGAATTAT GACTTCTCAT 960 

TCTGTTTTAT CAGAGTGCAT ATATGTCCTA CTTCAGGAAA AGTAAAACAG TCATTTACGA 1020 

AAGAAACTCA ATCTGTATCC TAAGCATTTT AATAAAAAGT TAAAACAAAA AATTAAAAGG 1080 

GACACTCGAG GGGGGGCCCG AAACCCAAT 1109 



50 



55 



60 



<2) INFORMATION FOR SEQ ID NO: 141: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LE2JGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STOANDHSIESS: double 
tD) TOPOLOGY: linear 

5 

<xi) SEWENCE DESCMPTION; SEQ ID NO: 141: 
TAGGACTAAC TTAAATTCTT TTATTCATCT TrTATTTATT AAAAAATTTT ATTTCTTTGA 60 
10 ATTTTCCTGT AATTTCCTTA RGCTCTTCTA TAAAATGrTTA TATTCATGTG AACCATACCT 120 
CATTATOCTT AACATTTACT CTCAAAAAGC TTTTTATTTT TATTTTTTTG AAGGTAGTTT 180 
TTCTGTGTGT ACTCrGTAAC ATGATTTTGC TTTCAAATCA TTGTTCTGCX: CXX:ATACAAA 240 

15 

ATOCCTTTTA TTTTTGAGGA TCGTQGACTT TTTAGTATGG CATGAGTGTG CTAAAAGCCA 300 
GATATCTTTC CACATTCACT GGTQGCTTTG ACACCTAGTT TTTAATCTCC CATCCTTACT 360 
20 TTAAACCCTG ACACTTGCAGT CCTCAGTCAG GCXXAGGACC GGGCTGAGGC CCTTTGTC3GA 420 
GATGCTGCAC CACCAGCAGA AGGCTGAGAC CTGGTTACCT GTACCTGriTC ACTTGTAATA 480 
AAAAGAATTA TCTAAAA 497 

25 



30 



(2) INFORMATION FOR SEQ ID NO: 142: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 
{B) TYPE: nucleic acid 
to STRANnECttJESS : double 
35 (D) TOPOLOGY: linear 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

ATGAGGCAGA GGCAAGCTGC CTQCCAACCC CCTCCCTCAA GGAATGGCCT TQCCCAGGAA 60 

TGCCCACCAC ACATACOCTC TTCTmTTT CTAGTCAAAC TCTTGTTTAT ' I X .VrilXj L' Tr 120 

GCCTCCCTCC TTTCCTOCCC TCTCAACCTT TTACTTCTGG TTTCTATTTC ATGGGATTTG 180 

45 GGGTTGAAGT TAAACTTACA ACAGTCCCGC CAACACCAAG TCTTGCAGGA AAAAAATACA 240 

AAGAAATTTA ACAAAAAAAA AAAAAAAAA 269 



50 



(2) INFORMATION FOR SEQ ID NO: 143: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LQJGTH: 1269 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEENESS: double 
<D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
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TTGaVTTGACT ATGGTCTCTC OGCXTAOCAG GAftGAGTCTC CX3GAAGTGAA QGCCATOSAC 60 

TTCATCACCT CCACAGCTAT CCTGCXXXTTG CTGTTCGQCT GCCTQGQCGT CTTCGGOCTC 120 

5 

TTCCXSGCTGC TQCAGrTOGGT GOQCGGGAAG GCCTACCTC3C GGAATGCTOT QGTOGrPCSATC 180 

ACAC3GCGCCA CCTCAGGGCT GGGCAAAGAA TGTOCAAAAG TCTTCTATQC TGOGGOTOCT 240 

10 AAACTGGrroC TCTGTQGCCG GAATQGTQGG GCXXTAGAAG AGCTCATCAG AGAACTCACC 300 

GCTTCTCATG CCACCAAGGT GCAGACACAC AAGCCTTACT TOGrTGACCTT OGACCTCACA 360 

GACTCTQGQG CX:ATAGTTGC AGCAGCAGCT GAGATCCTQC AGTQCTTTGG CTATGrTCGAC 420 

15 

ATACTTGTCA ACAATGCTGG GMCAGCTAC OGTOGTACCA TCATGGACAC CACAGTQGAT 480 

GTGGACAAGA GGGTCATGGA GACAAACTAC TTTGGCCCAG TTGCTCTAAC GAAAC3CACTC S40 

20 CTGCCCTCCA TGATCAAGAG GAGGCAAQGC CACATTGrrCX; CCATCAQCAG CATCCAGGQC 600 

AAGATGAGCA TTCCTTTTCG ATCAGCATAT GCAGCCTCCA AGCACGCAAC CCAGGCTTTC 660 

TTTGftCTGTC TGCXTTOOCGA GATGGAACAG TATGAAATPG AQGTGACCGrT CATCAGCXXX: 720 

25 

GGCTACATCC ACACCAACCT CTCTGTAAAT GCCATCACCG CGGATGGATC TAGGTATGGA 780 

GTTATGGACA CCACCACAGC CCAGC3GCCGA AGCCCTGTGG AGGTGGCCCA GGATGTTCTT 840 

30 GCTTGCPGrrGG GGAfiGAAGAA GAAAGATGTC ATCCTGGCTG ACTTACTOCX: TTCCTTC3GCT 900 

GTTTATCTTC GAACTCTGGC TCCTGGGCTC TTCTTCAGCC TCATGCCTCC AGGGCCAGAA 960 

AAGAGCGGAA ATXXAAGAAC TCCTACTACT CTGACCAGCC AGGGCCAGQG CAGAGAAGCA 1020 

35 

GCACTCrrAG GCTTGCTTAC TCTACAAOGG ACAGTITOCAT TTGTTGAGAC TTTAATGCaG 1080 

AITTGrrCTCA CAAGrrOGGAA AGACTGAAGA AACACATCTC GrrGC3«3ATCT GCTGCXAGfiG 1140 

40 GACAAOCAAA AACGACAACA ACXOTCITCC CAGGGrTQAGG GGAAACACTT AAGGAATAAA 1200 

TATGGAGCTG GGGTTTAACA CTAAAAACTA GAAATAAACA TCTCAAACAG TAAAAAAAAA 1260 

AAAAAAAAC 1269 

45 
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(2) INFORMATION FOR SBQ ID NO: 144: 



(i) SBQUmCE CHARACTERISTICS: 

(A) lJ3XJni: 1944 base pairs 

(B) TYPE: nucleic acid 

(C) STBANDECNESS : double 
55 <D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOJ: SBQ ID NO: 144: 



60 



AAAAGGCAAA CTATAGGATA ACACAGAGCC CTTTTTGAAA ATAAATTGGC ATTGGAGTGT 



60 
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TTTACCCTCT AQCTGTTTTA CTTAGAATGT AACATATGCT GCCEACOCAC CTCAAAATGT 120 

CTGTACTGCA AGAGQGCCCT GGGCCTCTQC TTTCCATATT CACGTTrOGC CAGAGTTGTA 180 

GTCCCAAAGA AGAGCATC3GG TGGCAGATGG TAGQGAATTG AACTGGCCTG TGCAAITGQGC 240 

ATOGAGCACA AQGQGTCACA GCATC3CCTCC TGCCTTACCG TQGCAG1ACX3 GAGACAGrPCC 300 

AGAACATGGT CTTCrTGCCA OJUXTIXJI' TC TTGTCTCTGG TOGTOCTGCA TGTCTGTGGC 360 

TCACCTTTAT TCTTGAAACT GAC3GTTTACC TOGATCTQGC TACTGAGC5CT AGAGCCCACA 420 

GCAGAATGGG GTTC3GGCCTG TGGCCCCCAA ACTAGGGQGT GrPGGGTTCAT CACflGTGTTG 480 

CCTTTTGrrCT CCTAAAGATA GGQATCTACT TTTGAAOGGA ATTGTTCCTC CCAAATAAAT 540 

■ rXUCiTr ftCC TTGGTCCrTT CTTTTGTCCC AGTMTCAflG TGGTATAGCT CTGflGCflGGG 600 

TCACATTTGG CCAAAOCTGA CACTGTCTTG CTGCATTCTC CTTTGGCAAA CATCAGGC3TC 660 

AGAATTCAQG ATAGCCCTTC CTAGQGCACT GGACTTTCTG GCATGGGGGC TC7ICTTTGCA 720 

CAAGTTATTT TCATGTCACC TGGASAGTGT OCAGAGGCTG CTCTGAGGCT GAGGTCTGrTT 780 

CCCCCrroCC TQGTTCCAGC TCrCAGftGOG ATACCATCCT ACXSGTCTOGG AATCCAft3GC 840 

CACGAGACTC CrrGGTTTGT GGTCCGftGAT CCTGTACTAA GGAQGGTCTG GCCflGAGGAA 900 

CAGACCAGCT TTTGCACAAT GAAGCGCAAG GGAACAftGTG GTTTGCCTGG TOTCCTACCT 960 

GTCCTGAACC TQGrrcCTGTG GGCCATTGAA AAGTTAGATC TGTGATCTCT GCXXTTTTTTG 1020 

TGGCTTTGTT CAATGCTTCC ACTCTAGGGC AGGCAGAGCA GTCTATACTC TCCCAAGCCT 1080 

GCTTGACOTC CAAGTAGAGC TGATACAGAG ATCTGTGAAT ATrGTCATAG AAATTCTTTG 1140 

GTATTCATAC ATTTCAGCTG CAAGTCAGCA ATITCCCAGG TACCATOTAA GCTATAAAAC 1200 

AGTCATTCTT AAAGACAGAG GATAGCTGTG ACTCATGGGA TCATGAGGTC CATGCSCTQGrr 1260 

TGCAQGrrrcc ctttttcctt cxncAGGrrrr tgtctcttcc rcrrcnnarcc ccagcaaggg 1320 

AGAGACTGTG GGGTGGATTG GGAGAACAGA TTAGGAGTAT AGCAAATGAA CCCAGAATGG 1380 

AACAGTQGGG AGCTAACTGT GAATGAGGAG ACTACCTQCT GCAGGACCTG GAGGTCAGGT 1440 

GTXy^TGCTG TATTGGCACA GGGAATAAAT ATCCTGGCGT CTOGAGCCTT CACCTCTCCG 1500 

TCAAGTCCTT CCTGrTGATAC TGCCATGGCA CAGGATCTGA OrTOCAGCTC TGCACCCTAA 1560 

ATCACAOCCT GGGCATTC5TC TQGGCTGCAG GGCTGCCAGG TTCTGTACTr GTGTCCAGCT 1620 

GTGGCCCTGG ATGCTOGAGC TGGAGCX?m* TCTG?rGCTCA GACTGrTAGCX: TCTAGCTCTT 1680 

GGCCTGrrGTA GAGCCCCTTC CIOTGCCCTC AGrrGCSCTGTC GmTGTTAAC ATCATCAGGA 1740 

AGATGQGAAA GGTCAGGCAG AATTTTTCTG CCCTACAAAG QGTGGAAGAG AAAGGACACA 1800 

GTATTTTCAT GAATTTACCA TATATCTTTG TTTTTCTTCA ACGAAAAAGT TAATTGAGGC 1860 
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AATGrrCATCT GCTCAAAGTT GAGTQGTTTA TTCACAATAA ACTGTAAGTT TCTGATTATA 1920 
AAAAAAAAAA AAAAAAAAAA AAAG 1944 



(2) INFORMATIGN FOR SBQ ID NO: 145: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 
<B) TYPE: nucleic acid 
<C) STRAKDEENESS: doxible 
(D) TOPCajOGY: linear 

15 

(xi) SEQUENCE DBSCRIFTICN: SEQ ID NO: 145: 
TCGACCCACG OCJrCCGGCSGT GCGCAAOGGG GACTTTCCGGC TQGAGACCCG TtSCTCTGQGC 60 
20 CQOOQCCTTC ACCATQGCCT OGGCAGAGCT GGACTACACC ATOGAGATCC OQGATCAGCC 120 
CTGdGGAGC CAGAAGAACA GCCOCAQCCC AGGTQQGAAG GAGGCAGAAA CTCGGCAGCC 180 
TGTGGrTGATT CTYTTGGGCT GGQGrPQQCTG CAAGGACAAG AACCTTQCCA AGTACAGTGC 240 

25 

CATCTACCAC AAAAGQQC3CT GCATOGTAAT CCGATACACA GCCCCGTOGC ACATGGTCTT 300 
CTTCTCOGAG TCACTGGGTA TCCCTTCACT TCGTCTTTTG GCCCAGAAGC TGCTCGAGCT 360 
30 GCTCTTTGAT TATGAGATTG AGAAOGAGCC CCTQCTCTTC CATGTCTTCA GCAAOGGrPGG 420 
CGTCATGCTG TACCGCTACG TQCTC3GAGCT CCTGCAGACC CGTCGCTTCT GCCQCCTC3CG 480 
TGTGGTQGGC ACCATCTTTG ACAQCGCTCC TQGTGACAGC AACCTQGTAG GGGCTCTGOG * 540 

35 

GGCCCTGGCA GCCATCCTGX5 AGCGCCGGGC CGCCATGCTG CGCCTGITGC TCXTTOGTOGC 600 
CTTTGCCCTG GTGGTCGTCC TGTTCCACCT CCTGCTTQCT CCCATCACAG OCOTCTTCCA 660 
40 CACCCACTTC TATGACftGGC TACAGGACGC GGGCTCTCGC TQGCCOGAGC TCTACCTCTA 720 
CTCGAGGGCT GACGAACTAG TCCTQGCCAG AGACATAGAA CGCATGGTGG AGGCACGCCT 780 
GGCACGCCGG GTCCTGQCGC GTTCTGTOGA TTTCGTGTCA TCTQCACAOG TCAGCCACCT 840 

45 

COGrrGACTAC CCTACTTACT ACACAAGCCT CTGTGTCGAC TTCATGOGCA ACTGCGTCOG 900 
CTC5CTGAGGC CATTGCTCCA TCTCACCTCT GCTCCAGAAA TAAATQCCTG ACACCTCCCC 960 
50 ACAAAAAAAA AAAAAAAAAA ACTOGAGGGG GGGCCOGGTA CCCAATTCGC CCTATAAAGG 1020 
T 1021 



55 



(2) INFORMATICN FOR SEQ ID NO: 146: 



(i) SBC^JENCE CHARACTERISTICS: 
60 (A) LENGTH: 1285 base pairs 
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(B) TYPE: nucleic acid 

(C) STElAKnECMESS : double 

(D) TOPOUXjY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 146: 

GGCAOGAGGA GQCSOCAOSGC AGCXATOGCG CTTTQCAGTT OGGnCTCCTG GTGrrACQQCC 60 

AACGCCAAGT AGGGGATTCC GTTCCCTCCA GTCQCAGACC CTATCAGATT TGGATAICTC 120 

CTTCOTATTT GA3TOGATTT ACAGTOGTTT CAGCAGTGTG CTACAGTTTT TAGGATTATA 180 

TAAGAAAACT GGTAAACTGG TATTTCTTGG ATTGGATAAT GCAGGAAAAA CAACATTGCT 240 

15 ACACATQCTA AAAGATGACA GACTTQGACA ACAICTCCCA ACATTACATC CCACITCCGA 300 

AGAACTGACC ATTQCTGGCA TGAOTrTTAC AACTTTTGAT CTGGGTQGAC ATGTTCAAGC 360 

TOGAAGAGTG TGGAAAAACT ACCTTCCTQC TATCAATOGC ATTGTATTTC TGGTGGATTG 420 

20 

TGCAGACCAC GAAAGGCK7P TAGAGTCAAA AGAAGAACTT GATTCACTAA TGACAGATGA 480 

AACCATTGCT AATGTGCCTA TACTGATTCT TCGGAATAAG ATCGACAGAC CTGAAGOCAT 540 

25 CAGTGAAGAG AGGTTGCC3AG AGATGmTGG TTTATATOGT CAGACAACAG GAAAGQGC5AG 600 

TATATCTCTG AAAGAACTGA ATGCCCGACC CTTAGAAGTT TICATGnCTA GTGTGCTCAA 660 

AAGACAAGGT TACGGAGAAG GCTTCCGCTG GATGQCACAG TACATTGATT AACACAAACT 720 

30 

CACATTGGTT CX3VGGTCTCA ACGTTCAGGC TTAOTCAGAG ATTTGATTGC TCAACATGCA 780 

TAACTTGAAT TCAATAGACT TTTGCTGGrTT ATAAAACAGA TGTTTTTTAG ATTATTAATA 840 

35 TTAAATCAAC TTAATTTGAA TGAGAATTGA AAACTGATTC AAGTAAGTTT GAC3TATCACA 900 

ATGTTAGCTT TCTAATTCCA TAAAAGTACT TGCTTTTTAC AGTTTATAAT CTQACATCftC 960 

CCXAGCGCCA TTTGTAAAGA GCAACTTTCC AQCAGTACAT TTCAAGCACT TTTTAACAAC 1020 

40 

ATGAAACTAT AAACCATATT TAAAAGCTCA TCATGTTAAA TTTTTTATGT ACTTTTCTQG 1080 

AACTAGrmT TAAASTITAG ATTATATGTC CAOCTATCKT AAGTGTACAG TTAATAATTA 1140 

45 GCTTATTCAA TGATTGCATG ATGCCTTACA GTrTTCAATA ACTTTTTrTC TTATGCAAAC 1200 

GTCATGCAAT AAAACAAACT CTAATCTTTTG GCAAAAAAAA AAAAAAAAAA NTCGAOOGGG 1260 

GGCCOGTACC CAATTCQCCC TAAAG 1285 



50 
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(2) INFORMATION FOR SEQ ID NO: 147: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1386 base pairs 

(B) TYPE: nucleic acid 

(C) STOANDKDNESS: double 
60 (D) TOPOIOGY: linear 
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(xi> SEC^JOTCE DESCRIPTION: SBQ ID NO: 147: 

GCSCAOGftGGT GGCGCAGGGG TGAGTGGTTC TCTCGGGTCT OGCXftCAGGT GAGCACCCTG 60 

5 

ATGAAGQCCA CX3GTCCTGAT GOGGCACCTG GQCGQGTGCA GGAGATCGTG GCSOGCCCTCC 120 

GCAAGGGCGS CGGAGACCGG TTACftGGrTGA TTTCTGATTT TRACATGACC TTGAGCAGGT 180 

10 rrQCATATAA TQGAAAGCGA TGCXXnTCTT CTTACAATAT TCTGGATAAT AGCAAGATCA 240 

TCAGTGAGGA GTGTCGQAAA GAGCTCACAG OGCTCCTTCA CXACTATTAC CXi^ATTGAGA 300 

TCGACCCACA CXX3GACCX3TC AAGGAGAAGC TACCTCATAT QGTGQAATGG TGGftCCAAAG 360 

15 

CXSCACAATCT CCTATGTCAG CAGAAGATTC AGAAGTTTCA GATAGCCCAS GTOGTTAGAG 420 

AGTCCAAPGC AATOCTCAC3G GAGQGATATA AGACCTTCTT CAACACACTC lACCATAACA 480 

20 ACATTCXXXT TTTCATCTTT TCTGCGGGCA TTOGrTCATAT CXTPQGAAGAA ATTATCCGAC 540 

AGATGAAAC3T GTTCCACCCC AACATCCACA TCGTGTCTAA CTACATGGAT TTTAATGAAS 600 

ATGGrrrrrcT CCAGQGATTT AAGQGCCAGC TGATACACAC ATACAACAAG AACAGCTCTG 660 

25 

TGTCTGftGAA CTSTGGTTAC 1TCCAGCAAC TPGAGGGCAA AACCAATGTC ATCCTGCTGG 720 

GAGACTCTAT CGQQC3ACCTC ACCATGGCXX3 ATGQGGTrCC TGGTGTGCAG AACATTCTCA 780 

30 AAATTGGCTT CCTGAATGAC AAC3GTQGAC3G AGCC3GOGGGA NOXniACATG GACTCCTATG 840 

J 

ACATOGTGCT GGAGAAGGAC GAGACTCTTOG ATGrTGGTCAA CGGGCTACTG CAGCACATCC 900 

TOPGCCAGGG GC?rcCAC3CTG GAGATGCAAG GCCCCTGAAG GCGCAC3GCTN CCAGNCCGCC 960 

35 

TGCAGGCCGT GGTGAGGAGG GGOGCCTCCC CAGAGTCTGC TCCCCCGTGA ACACAGBGCA 1020 

GANQCCftGGG TQGCXAGCAG TGGCTGGGTC CTTCCGCXXX: (XTCCGTCCT CCTTTCOCTG 1080 

40 AGCACCTTCA TCACCAGAGG CTTGAAC3GAA CCCCQCCATG TGGCAQGGCA CAGGCACTOP 1140 

TCCTQGTGAA CCTTGGACCA CAGCATGTCA GrPGCTCTAGG GATTGICTAC TCCAGGGATT 1200 

TTCTTCAAAA TTTTTAAACA TQGGAAGrTTC AAACAAATAT AATGTGrTGAA ACAGATCAAA 1260 

45 

ATTTTTAAAA TGAAAAAAAA GCTGCTCTGA ITCAGGGGAT GTGGGTCGGG GTAGAACCTG 1320 

GACCTCTTGG CCTQGGQGCA CATGGGATGC TTCTAGGAAC ACAGTTTGAG AACCACCAAA 1380 

50 AAAAAA 1386 



55 (2) INFORMATIOfN FOR SBQ ID m: 148: 

(i) SBC^JENCE CHARACTERISTICS: 

(A) LQ9C?rH: 2098 base pciirs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: dcmble 
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(D) TOPODOGY: linear 
(xi) SBQUETCE nESCRIPTION: SBQ ID tK): 148: 

AGCCCTTCTC CCXXSCGCTTG GGACPCTGAC ATCTTAAGGC TQCAOGGTCG TGrrCCTTGTC 60 

TGGGTGAGGC CATGTCTGTG ATCCAAGGTT CXTOGAACTG ACACAGGAAG GGGCTGrPGAA 120 

CCXTAAGrroC GTGTMMCTC CTCC31ACCGA GGCTTCTMAC OCTGGAGATG GCACHTACTC 180 

CTQGOCATQG TTGCTGAGCA TGGQCAGACC AGTOQAGGCX: ACCCTACTGT GrTTATCTQCXS 240 

CCTTCRATGA AGrTGAGACCC TTOGGGAGAA CGQGCTGTGG ATGAAGGAGT GGACTQCAGC 300 

15 CTTGGCCTAG CXrACTGGCXT GQGATCTTCT GOGTPCAICTG ACTGaCTATC CAGGAGCAGA 360 

AACTTGrnATT CTCAGGATTC AOGATCTACC CAGCACCAAA GATGTATTTr CAGGAGAACA 420 

GACCTAGAAA TGGGCCTC3TC TGGCATTTCA GAGTCA3GCA AAGCAGGCAG GGCCAGGGAG 480 

20 

CTTCTGrrOQG TCTACACAAG AAGGTrCCTG TGAGGGCTAT CAGTTGTTGC CTTCTAGCTr 540 

GCrOGTAACT TTGGCGCCTC CGCCAAGCCC TGCCAGACTC CCCTGGCTGrT GAIXSGCKTIC 600 

25 TGTQCCATCC TGCCTTGrTCC CCAGCCTCTG CAQGATGCCC TCCCTACCCA MCTYTYCCTG 660 

GOCCTTCXXT GTCCACTQGG CTOGATTCAT GTTCAAACCA CTGGACTQGC AGGGCAACGA 720 

CTTCTTCCCA CCTCAAGATG AGGTCCTCGC CCCCTTGTCT TGGCATAAAA ACACCTTTAA 780 

30 

AGCATGAGCC ATGTCCTTCT TTGCXXTITCT CT3TCCTGTT CXAATCTTCT GCCTCCCAGT 840 

CACTCCCTQG GGACTATGGG ATCACTGTCC CCCCACCTGT GTGGCCACAC CATGTGTCCT 900 

35 GTCAATCCAG AACTGCCTCT GAGCTCCAGG CTGACCACAG ATCAGCCACA GCCTGATGCC 960 

TQCAGCCCCA CTTTGCTCAC CCTTCCXETC CXXTICCTCCT TCCTTCCACA CAGCAAGCCT 1020 

ACXrrTTVTCC ATCCATGCTC AOCATAGCCC CCTTCCTTGrT GACXTOGACX: CTCCATTGTA 1080 

40 

CCTGGCTGAG ACTGTCAGCC TCCTQGAGGA GriGGGGTCCA CCTTCrPCTT GCXXTATGCA 1140 

GTGCAAGCTT CACTTCTCAC CCAGCAAGGT TGACTCATCT GCXrTCCATGrr CTCTGGGGCT 1200 

45 TTGCTGTTGC CCTGAAACCT AGCTQGGCTG GTCTPGCTCC CaGCTTQCTT CXXXXTCCTC 1260 

QGATGTCCCT TTGCAGGCCC CTGTCGITCC TCCGGCACCA GTGTCCTTGG CTGCCATGGC 1320 

AAGCnCATCA GQQGCTTGTA (XXnGCTTCAC CAAGCATQGT AGCAGCTGCC TGCATTGTAT 1380 

50 

CTCCATCTGG TCACTGCAGG TGCCT^ACCCT TCATOXCCA TGTTTTCCTG GGCCATGGAG 1440 

GGCTGACCTC CGTrPCTGGG GAATGTOGCT GAGCTGrPGGT AACCAGCTAC ACCCCAGGTC 1500 

55 CTCTTTCXAT GGTGGTGCCT GCTCATCTTG CTGAITGCAAA CTAGGAAGTT AGGCTGCATC 1560 

TCGGAGTGGC TTTCGCTQGA GAGGrTGCTTT GCTGTCTCTC AGACTCAGTC ACTGTGTTCr 1620 

CTCCCCGCCT CTCTTATCTC CATGQCTGTT TGCAGCTCTC OCAGGTACTT TGGGGTCTGA 1680 

60 
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CXTQGAATTC CmtTiWrr TGCTCTTCTG CTTCTCACTC ITCTATTAAG AAGGATTCCA 
CAAAGOGAGA GTQGCATCCC TGCTC3CTGCT GTGCXIAGACC AGAffnTCCT GAQGGGOCCT 
GACCCTAACC CTCCAGCTCA GCCXTOTACA CCTGACCCTG TAAATGACTC GGGTTTGCTG 
ACTGTAATCC CTGACAOCAG TAAAACCAAA AC3GACTCTTG GGGGCTCAGT GTGAGAGCCA 
GGGnTACCTA CTCTGCCAAG TGAGGACAAA CTGCTA13GCT GTATCCCATA ATTTCAOGAT 
GAGAAACAIT AACAATAAAA ATTTGTAGTA AACATAACCT CATGANGACT AAAAAAAAAA 
AAAAACTYGG GGGGGGGCCC GTAACCCATT GGGOCCTTNG GGGGGGNGriT TTAAAATT 
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1740 
1800 
1860 
1920 
1980 
2040 
2098 



(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTEEaSTICS : 

(A) LENCrrH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : double 

(D) TOPOLOGY: linear 

(Xi) SBC^JeiCE DESCRIPTICSM: SEQ ID NO: 149: 

TCGACCCACG CGTCCGAACT GAGQCGGCGG CGGGAGCCGG TTGGKGTCTG GTCTTCGOGT 60 

CGOCCCCGCG GACCAGACGC TGCCCXXX3GC GCGGGGAGAA GATGGTOCXK AGCX3GCCTCG 120 

GGCCCX3CCAC GOGCCGCCAC GAGTGAGCCC AGOGCGACCG CGOGCGTCCG CCGAGCAGCT 180 

GGCCCGGCTG GGCCCGGGGC Ga3CAmX3CC CGCCGGGGCG GQGTGGAGCT GATCAGAATA 240 

ATCmCAGCA TCAACCCCCT QGAGAACXTTG AAQGrTCTACA TCAGCAGTOG GCCTCCOCTG 300 

GTGGTCTTCA TGATCAGCGT AANQCCCATG GCCATAGCTT TCCTGACCCT GGGCTACTTC 360 

TTCAAAATCA AGGAGAITTAA ATOXCAGAA A3XX3CAGAQG ATTGGAATAC TTTTCTGCTA 420 

CGGTrCAATG ATTTGGACTr GrrGTGTATCA GAGAATGAAA CCCTCAAGCA TCTCACAAAC 480 

GACACXIACAA CTCCQGAAAG TACAATGACC AGCGGGCAGG CCCX3AGCTTC CACCCACTCC 540 

OCCCAQGCCC TGGAGGACTC GGGCCOGGTG AATATCTCAG TCTCAATCAC CCTAACOCTG 600 

GACCCACTGA AACXXTTTCGG AGGGTATTCC CGCAACX3TCA CXXOTCTGTA CTCAACCATC 660 

TTAGGGCATC AGATTGGACT TTCAGGCAGG GAAGCCCAOS AGGAGATAAA CATCACXinTC 720 

ACCCTGOCTA CAGCGrTGGAG CTCAGATGAC TQCGCCCTCC ACGCJTCACTG TGAGCAGGTG 780 

GTATTCACAG CX7rGCA3X3AC CXTCACQGCC AGCCCTGGGG TGTTCCCCGT CACTGTACAG 840 

CCACCGCACT GTOTPCCTGA CACGTACAGC AACGCCACXX: TCTGGTACAA GATCTTCACA 900 

ACTGCCAGAG ATGCCAACAC AAAATACGCC CAAGATEACA ATCCTTTCTG GTGTTATAAG 960 

GGGGCCATTG GAAAAGrTCTA TCATGCTTTA AATCCCAAGC TTACAGTGAT TOnCCAGAT 1020 
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GATGACC3GTT CATTAATAAA TTTOCATCPC ATOCACACCA GrTTACTTCCT CTTTGTCSlTG 1080 

CrrGATAACAA TGrmTGCTA TGCTGrTTATC AAGOOCflGAC CTAGCAAftTT GCGTCAGAGC 1140 

5 

AATCCTGAAT TTTGTCCCGA GAAGGTOGCT TTGGCTGAAG CTTAATTOCA CAGCTCCTTG 1200 

TTTTTTGAGA GAGACTGAGA GAACCATAAT CCTTGCCTGC TGAACCCAGC CTGGGCCTOG 1260 

10 ATQCTCTGTC AATACATTAT CTTGCGATGT TQGGTEATTC CAGCXAAAGA CATTTCAAGT 1320 

GCCTGTAACT GATTTGTACA TATTTATAAA AATCTATTCA GAAATTGCTC CAATAATQCA 1380 

OGTGCrrrGC CCTGGGTACA GCCAGAGCCC TTCAACCCCA OCTTGGACTT GAGGACCTAC 1440 

15 

CTGATQQGAC GriTTCCACGT GTCTCTAGAG AAGGATTCCT QGATCTAGCT GGTCACGACX3 1500 

ATGrrTTTCAC CAAGGTCACA GGAGCATTGC GTOGCTGATG GGGTTGAAGrr TTGCrnTOGrT 1560 

20 TCTTGTTTCA GCXXZAAtATG TAGAGAACAT TTGAAACAGT CTGCACCTTT GATACGGTAT 1620 

TGCATTTCCA AAGCCACCAA TCCATTTrGT GGATTTEATG TGTCTGTGGC TTAATAATCA 1680 

TAGTAACAAC AATAATACCT TTTTCTCCAT TTTGCTTGCA GGAAACATAC CTTAAGTTTT 1740 

25 

TTTTGTTTTG ' ITlTlVriTn ' ' lT l X> n'iT iTr GmTCCTTT ATGAAGAAAA AATAAAATAG 1800 

TCACATTTTA ATACTACCAA AAAATGGACA AAAAAAGTCG AGGGGGG 1847 

30 

(2) INFORMATIO* FOR SEQ ID NO: 150: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1569 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEEaOESS: double 

(D) TOPOliOGy: linear 

40 

(xi) SBQOEJitCE DESCRIPnON: SEQ ID NO: 150: 
GACGCTGACG AGAGAAGGCC TCTTCCTTGA GGGTrOGTQC TGTGrrTQCAG TGAOCGTOGC 60 
45 GGATTAOGCC AACTCGGATC CGGCGGTCGT GAGGTCTGGA OGAGTTCAAGA AAGCCGTAGC 120 
CAAOGCTGTT CAGCAG6AAG TAAAATCTCT TTGTGQCTTG GAAGCCTCTC AGGTTCCTGC 180 
AGAGGAAGCT CTrPCTGGGG CTGGTGAQCC CTGTGACATC ATCQ^CAGCA GTGATGAGAT 240 

50 

GGATGCCCAG GAGGAAAGCA TCCATGAGAG AACTGTCTCC AGAAAAAAGA AAAGCAAGAG 300 
ACACAAAGAA GAACTQGACG GGGCTGGAGG AGAAGACTAT CCCATGGATA TTTGGCTATT 360 
55 GCTGGCCTCC TATATCCGTC CTGAGGACAT TGTGAATTTT TCCCTGATTT GTAAGAATGC 420 
CTGGACTGTC ACTTGCACTG CTGCCTTTTG GACCAGGTTG OT^CCGAAGCA CTACACGCTG 480 
GATGCTTCCC TGCCTTTGCG TCTGCGACCA GAGTCAATGG AGAAGCTGCG CTGTCTCCGG 540 

60 
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GCTTGrrcrrGA tccgaictct gtaocatatc tatgagccat ttgctqctog aatctccaag 

AATCCAGCCA TTCCAGAAAG CACOCCCAGC ACATTAAAGA AITOCAAATG CTTACTTTTC 
TGGTGCAGAA AGArrGTTGG GAACAGACAG GAACCAATOT QC3GAATTCAA CTTCAAGTTC 
AAAAAACAGT CCCCTAGGTT AAAGAGCAAG TGTACAGGAG GATTGCAGCC TCCXCTTCAG 
TACGAAGATG TTCATACCAA TCCAGACXAG GACTQCTGCC TACTOCAGGT CACCAOXnC 
AATTTCATCT TTATTCCGAT TGTCATGGGA ATGATATTTA CTCTffrTTAC TATCAATGTC 
AGCAOGGACA TQCGGC3VTCA TCX3AGTGAGA CTQGTGTTCC AAGATTCCCC TGrrCCATQGT 
GGTCGGAAAC TGCGCAGTGA ACAGGGTCrTG CAAGTCATCC TGGACCCAGT GCACAGCGTT 
CGGCTCTTTG ACTGGTGUCA TOCTCAGTAC CCATTCTCCC TGAfiAGOGTA GnTACTQCTT 
CXXZATCCCTT GGGGC3CAGCC TOGAGTGTAG TCCATTAGTA ATCAGATTCC AGrPlTQGACA 
GGGTGC3CTGG ATTGTATATC TCGTTAGTAA TGTACATGCT CTTCAQGnC TAGQGCTCCT 
CJITAGC3QGAG GGAGAAATGT TCAATCAAGA GGGAAAACAA CTACTATGAT TTATAAACAT 
ATTTTAATGT AAAAATTTGC A1TTAAAAGG AGTGGCCCTG TTTTCTGrPGT TAAAACCCCA 
TTTGGrrGCTA TTGAGTTTGT TCTTTATTCT TTTATCCCAG TGAAAATTCT TGATCTTGCT 
GTAGQC3AAAA ATTAAACTCT TTGAATCTCX: AAACAAGGAA GTTTCAGCAr TCCCTTATGG 
ATCAGAGGAA CCTTAGAGGC CTCAAATTGr TGCTTCCAGT TTAGCTGCCC CTCAAATTCA 
AGTGAATATT TTCCCTTCTC CCTTTACCCT TCTOCAGAAA TAAAGCAGGT GACAGGGrTT 
CAGAATCTT 

(2) INFORMATION FOR SBQ ID NO: 151: 

(i) SEQU03CE CHARACTERISTICS: 

(A) LQIGm: 1540 base petirs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE EEESCRIPTION : SEQ ID NO: 151: 
CCCACGCGTC OGGAAGGATT GflCCAGTTAA CCAACATCTT AGOCCCCATG GCTGTTQGCC 
AGATTATGAC ATTTGGCTCC CCAGTCATCG GCTGTGGCTT TATTTCGGGA TGGAACTTGG 
TATCCATGTG CC3TGGAGTAC GTCCTGCTCT GGAAGGTTTA CCAGAAAACC OCAGCTCTAG 
CTCTGAAAGC TGGTCTTAAA GAAGAGGAAA CTGAATTQAA ACAGCTGAAT TTACACAAAG 
ATACTGAGCC AAAACCOCTG GfiGGGAACTC ATCTAATGGG TGTGAAAGAC TCTAACATCC 
ATGAGCTTGA ACATGAGCAA GAGCCTACTT GTGCCTOCCA GATQGCTGAG CCCTTCOGTA 
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CCTTCOGAGA TGGATQQGTC TCCTACTACA ACCAGCCTGT GTTTCTQGCT GGCATQOGTC 420 

TTQCrrrcCT TTATA3X5ACT GTCCTOQQCT TTGACTGCAT CACCACAGGG TACGCCTACA 480 

CTCAGGGACr GAGTOGGTrTC CATOrrCAGT ATTTTGATQG GAGCATCAGC TATAACTGGA 540 

ATAATQGGAA CTGTAQCTTT TACTTGGCTA OGTCGAAAAT eSTOg rntA/r TCQQCAGGTC 600 

TGATCPCAQG ATTQGCftCAG CTTTCCTGTT TGATCTTGTG TGTGATCTCT GTATTCA3K3C 660 

CTOGAAGCCC CGTOGftClTC TCCGTTTCTC CTTTTGAAGA TATCOGftTCA AGGTTCATTC 720 

AAC3GAGAGTC AATTACftCCT ACCAAGATAC CTCAAATTAC AACTGAAATA TACATGrtCTA 780 

ATQGGrrCTAA TTCTC3CTAAT ATTGTCCCGG AGACAAGTPCC TGAATCTGTG CCCATAATCT 840 

CTGTCAGTCT GCTGrTTTGCA GQCGTCATTG CTQCTAGAAT OQGTCTTTGG TCCTTTGATT 900 

TAACTGTGAC ACAGTTQCTG CAAGAAAATG TAATTGAATC TGAAAGAGQC ATTATAAATG 960 

GTCTACAGAA CTCCATGAAC TATCTTCTTG ATCTTCTGCA TTTCATCATG CTTCATCCrGG 1020 

CTCCAAATCC TGAAGCTTTT QCXnTOCTCG TATTGATTTC ACSTCrcCTTT GrrOGCAATOG 1080 

GCCACATTAT GTATTTCCX3A TTTGCCCAAA ATACTCTGQG AAACAAGCTC TTTGCTTGCX5 1140 

GTCCTGATGC AAAAGAAGTT AGGAAGGAAA ATCAAGCAAA TACATCTGTT GrTTTGAGACA 1200 

GTTTAACTGT TGCTATCCTG TCACTAGATT ATATAGAGCA CATCTGCTTA TTTTGTACTG 1260 

CAGAATTCCA ATAAATQGCT GGGTGrmTG CTCTGTrTTT ACCACAGCTG TGCXTTTGAGA 1320 

ACTAAAAGCT GTTTAGGAAA CXTTAAGTCAG CAGAAATTAA CPGGATTAAT TTCCCTTATG 1380 

TTGAOGOCCA TGCmAAAAA ATTGGGAAAA GGAAAAACTC AGTTTTAAAT ACOGGAGACT 1440 

ATAATGGATA ACACTGRATT CCXXTTATTTC TCATGAGTAG ATACAATCTT ACGTAAAAGA 1500 

GTOGTTACJPC ACGTGAATTC AGrTTATCATT TCACAGATTC 1540 



45 <2) INFORMATION FOR SEQ ID NO: 152: 
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(i) SEQUENCE .CHARACTERISTICS: 

(A) I*ENGTH: 1719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECB3ESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIFTION: SEQ ID NO: 152: 
TACTTATGAG GTCAATTGGA AATAAGAACA CCATTTTACT GGGTCTAGGA TTTCAAATAT 
TACAGTTGGC ATGGTATGGC TTTGGTTCAG AACCTTGGAT GATGrrOGGCT GCTGGGGCAG 
TAGCAGCCAT GTCTAGCATC ACCTTTCCTG CTGTCAGTGC ACTTGTTTCA CGAACTGCTG 



60 
120 
180 
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ATGCTGATCA ACAGQGPGTC GTTCAAGGAA TGATAACfiGG AATTCGAGGA TTATGCAATG 240 

GTCTGGGACC GGCCCTCTAT QGATTCATTT TCTACATATT OCAItTTOGAA CTTAAAGAAC 300 

TGCCAATAAC AQGAACftGftC TTGQGAACAA ACACAAGCCC TCAQCACCAC TTTGAACfiGA 360 

ATTCCATCAT COCTOC3CCCT CXXnTCCTAT TTOGftGCCTG TTCftGTACTG CTGGCTCTGC 420 

TTGTTOOCTT GITTATTCOG GAACATACCA ATTTAAGCTT AAGC3TCCAGC AGTTC3GAGAA 480 

AGCACTGrPGG CflGTCACAQC CATCXTTCATA ATACACAAGC GCCAGGftSAG GCCAAAGAAC . 540 

CTTTACTCCA GGACACAAAT GTGTGACGAC TGAAATCADG AAGMmTC TATCAGCAOC 600 

CAGGTCTTAG TTTTCACCTC TAGTTCTQGA TGTACAITCC ATTTCCATCC ACAGTGTACT 660 

TTAAGATTGT CTTAAGAAAT GTATCPGCAT GAACTCCGTG GGAACTAAAS GAACTTGGGAA 720 

CTTAGAACCA GACAGTTTTC CAAAGATGTT ACAATrTCnT TTGAAAAACC TTTTGITTAT 780 

TAGCACCAAT TTCTYGCCAC TAAGCTATTT GTTTTATTAT ACATCCTTTA ATTAAAAACT 840 

ATATATGTAA CTTCTTAGAT ATTAGCAAAT GTCTCTOCTA C3CATTTCCTT AAGCTrGTTGA 900 

GCTTTAACTC TATGCTGACT CAGTGAGACA CACTAGGTAG TATQGTTGTG GACCTATTTG 960 

TTTTAACATT GTAAAATTTT GAGTCAGATT TTAATATTGT AAAATCTTGG GTCAAATAAT 1020 

TCAAAGCCTT AATGCAGATG CACTAAAACA AAGAAATOGT AAATGAATTG TTTGCATITA 1080 

AAAAAAAAAA CTCTTAAGAA AACTGTACTA AATCTGAATC ATGTTTTGAG CTTGTTTQCA 1140 

GTACTTTTAA ACATTA3TCA CTACTGriTTT TGAAGTGAGA AAGTATCAGC CATTTAGCAT 1200 

TTAAGrrrOGG GTATTTAGAG CCTGTAATCT AAATGCTGGC TCAAATTTAT TCCXX^GCTA 1260 

CTTCTTATAC CACTATTCTT TTAATGTTTG CATAATCATA AGCACCTCAA CACTTGAAIA 1320 

CATAATCTAA AAAITATATA GTAAAGCTGG TAGCCTTGAA AA3CTCAGTG TGATATCTAT 1380 

TAOXSTAGATA AATATATATA GrPGGCCTTTC AGGACTGTPCA CAGTAACACT TTATTTACAG 1440 

AGCTAATGTT TGTCCTAAAT TTTCAGGACC CTAGAGGAGA GCTTTATACA ATTACCGATG 1500 

TQAATTTCTC TAAAGTCTAT ATTTITGrGT CCAGTTATAT 'CATTTAAAAA AGTGTTACTr 1560 

TGTAAAAATT GTATATAAAG AACTGTATAG TTTACACTGT TTTCATCTTG TGtGTrGGlTA 1620 

TTGCTTAATG CTTTTTAAAC rPQGAACACT CACTATGGTT AAATAAGGTC TTAAAAGAAA 1680 

TGTAAATATT YTGTTAATAA AGTTAAATAT TTTAATGAT 1719 



55 



(2) INFORMATION FOR SEQ ID NO: 153: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQKTTH: 863 base pairs 

(B) TYPE: nucleic acid 
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(C) STRMUDEO^ESS'. double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SBQ ID NO: 153: 

5 

GGCACGftGGG AAGOCGQGAC GATOTCCGCA TGACAACCGA CGTTOGAGriT TQGAGGTOCT 60 

TQCCTTAGAG CAAGQGAAAC AGCTCTCATT CAAAQGAACT AGAAC3CCTCT OCCTCAGTGG 120 

10 TAGQGAGACA GCCAGGAGCG GnTTTCTGGG AACTGTGQGA TOmLXXmU QQQQCCCGAG 180 

AAAACAGAAG GAAGATQCTC CAGACCAGTA ACTACAGCCT QGTOCTCTCT CTGCAGTTCC 240 

TQCTCCTGTC CTATGACCTC TTPGTCAATT CCTTCTCAGA ACTQCTCCAA AAGACTOCTG 300 

15 

TCATCCAGCT TGTGCTCITC ATCATCCAGG ATATTC3CAGT CCTCTTCAAC AStMCATCA 360 

TrrrocTCAT GnrcrrcAAC accttcctct tccaggctgg (xtogtcaac ctcctattcc 420 

20 ATAAGTTCAA AGGGACCATC ATCCTGACAG CrGTOTACTT TQCCCTCAGC ATCTCCCTTC 480 

ATGrrCTGGCSrr CATGAACTTA CGCTOGAAAA ACTOCAACAG CTPCATATGG ACAGAIOGAC 540 

TTCAAATQCT GTTTGTATTC CAGAGACTAG CAGCAGTGfTT GTACTGCTAC TTCTATAAAC 600 

25 

GGACAGCCGT AAGACTAGGC GATOCTCACT TCTACCAGGA CTCTTTGrPGG CTGCGCAAGG 660 

AGTTCATGCA AGTTCGAAGG TGACCTCTTG TCACACTGAT GGATACTTTT CCTTCCTQGA 720 

30 TAC31AGGCX:A CAnroCTGC TTTQCAGGGG AGAGTTQGGC CCTA3GCATG GGGCAAAACA 780 

GGTQGGATTT TCCAAGQGAA GGGTTCAGAA TTAGGOTTGT TGTTTCAGCC ATTTCCAAGG 840 

AAGGGGAAGG GTTTCCCTNC CCT 863 



35 



(2) INFORMATION FOR SEQ ID NO: 154: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) Lajcm: 1101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 
45 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 154: 

AACAGCAAAA AAGAATGATT TCTTCTGAAA TTGTGGAACA TGAGGATTCA AGTTTTTATT 60 

50 

TTGTTACTAG GTGCTGGAGG AACATCCCAG TTCACAAAGC CCCCATCTCT TCCTCTGGAG 120 

CCAGAGCCTG CGGTGGAATC AAGTCCAACT GAAACATCAG AACAAATAAG AGAGAAATAA 180 

55 GAATAGAATG AATGACOCCA AAATARGGTT TTCTTGGQCG AGGATGTGCT GGATTAGGAA 240 

AGGTGACATG ACACAGGCAG AGCAGAGTPGG CACCCACCAC AGAATACAGT GrPGTGTTATT 300 

ACGAGGAGCC AGCAGTTGAG CCTAAGGTCC TTCTACCTAC CTGGTAITGG CATTTGAGGT 360 

60 
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OQGAAACCCT CTACTOOCCC ATAAOCCAGG AAAAGTGAAA ASAGAACftCA GTTCCTTTAA 420 

GAACTOQCAG CAAGGCTTGA GQCCTTAror ATOTAGCTGA GTCAGCAAGG TACATGATQC 480 

TGTCTGCrrr CAAAAGCSACT TTTCTCTCCT ASCTGACTGA C l V C ri X X - lT AGTTCAAGGA 540 

ACAGCTGAGA CAGACCTCTG CTCAGTAGCT CTGTQATGAC AAAGCCTTGG TTTAACTGAG 600 

GTCATCCrCA GGTTGTGAGG TTTATTACTrC CXXAAGQCAA ACACAAATAT TAGATTAATA 660 

ATCCAACTTT AATAGTATAC ATTTAAAAGA AAAAAAACAA AAGCCCTQGA AC33TTGAGGC 720 

CAAGOCTGCT GAGTATTQCA GCTGCATTTG CCCAAAGGGA ATCCAGAACA AGTCCCTCCC 780 

15 TGTATTTTGT TCTTGAGAGG GGTCAGTCTA GAAGCTAGAT CCTATCAGGA TGAGGAGCAG 840 

CAGCCXAGGG CTTGTCTQGA TCAGCACCAA CGATTTTAAA GAAAAAAGGA AGAGTTTCTT 900 

AGATGAGTAA TTGTTAITGA AGATAGTCAG TGATAACCAC TCACCAGATG CTATCAATAC 960 

20 

ACTATGICTC CITTTTAGAA TAAAGATTAC ATATCATCAT TCCTirOGGG AAAATTC3TTA 1020 

TTCAQGTATA AAAACAAGAG ATTATAATAA AAAANTAAAA GAACCCTAAA AAAAAAAAAC 1080 

25 CTCGTQCXX5A ATTCCCTGCA G 1101 

30 (2) INPORMATICN FOR SEQ ID NO: 155: 

(i) SEQUQICE CHARACTERIOTICS: 

(A) LQJGTH: 2031 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANnECNESS: double 

(D) TOPOIXCY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 155: 

40 CAATTAACCC GTTTGAGGCC TAGGTTGTTT QGCAAGCXXX: NGGCCTAAAG TTTTAATTCG 60 

GCAGAGOCAA GGGCCTGAAA GGAAGGGAAA GGGGAGGGTA GCGGGAGGGT AQCAGGTGAG 120 

TTCCTAGGGC TOGAAGCnTT AGCAGCAGCC TGGTTGCAGTG CXXrTGTCATC AAGACAAACC 180 

45 

CACGGTCCTC CTQGGTQCCT ACCAAGCTTG GTTTGTACAA AAGCAAGGTG GGAGTCTATT 240 

TTTGTACATG AGAOACATCA CACTTACCTG TGGGCCAGrTA TTGTGAAGTG AGTCTGAGTT 300 

50 GrrTACACTG ATGCCTTCCC TQCCCACCAC AAAITGrrGTA CATAGTCTTC AGAATGATAC 360 

CACCCCTTTC CCCAGCTCCC AACCAAGAGC TGGITCTAGG CCTGTGTTAT ATGTCATATT 420 

TAGCGTTTTT ATATATGACC TTTGATrrCT GrrTGrTTTGTA TITTAGCACA CTGTATGCAC 480 

55 

CTTCATTTAA ATACATCTGT GTGCATACAG ATACGCATAT AICTGTGTOC GTATGCATAT 540 

ATCTCTCATC TGTAGrTTTCC AAGAGTTCAG CTGAAGCAGA TGGAGTCCTG CAGCCCAGGA 600 

60 GACACCXTOC ATCCCTGCTA ATAGTGrTTC OCACAAGTAT TAGTGAGTCT TCCTTATTAA . 660 



wo 98/39448 



380 



PCTAJS98y04493 



TATTTTCATT TCAGAAGACT GAAGCAAAGC TGATAGTGTT TCCTCrTi'iV i ' TTGQCAGCTA 720 

AGTGAGGGTC TTC3QGATCAC TTQCTGTGTT CXrTCAAGCTG CACTTTOGGG CX:ATCTCTOC 780 

5 

ACTATTAAGC COCCnTTTG CTTQGTOGTA CTCTGTCTGT GOCTOICTGT GTOTGTGATA B40 

GTCACTCTTG CATOGCTTCC A3CTCTGGTT TGTGGCAnT OOGC2ATAAGT GCTGAACCfiG 900 

10 AGCATTTCCA GrrTTGTTTCA QQCCPCGTTG CXJ^ATGATAG ATCACTCCTG TTGACCTOCTr 960 

ATGTCTGCrr GCTTGCTGCT TTTCCTIGCT TTCTCTTOGA AGftGGAAAQG ACTCTGCTTCA 1020 

GGCCCAGGCT GAGTCfiGATG AGCTGCAGCT GGCTCATGGC CTTCTTAGftG CAGAGAGAGG 1080 

AGTATGTCAT TTTACTAAGT TCCTAAACAA ACATTTATGC AGGCAACACT CCTTGCAGAT 1140 

CCAGAAACTG AQGCACAATA GGGTTAItSAC TTOCTCAAGA ATATGTAGCT GCTAGGGGGT 1200 

20 AAATCAAGGC ATCACAAITT CTGriTCAGCG GGCAGGAATA GGCTGTGAAT TGCTAGCACT 1260 

irrTTTTTAA GCAATTACTT TTTGACTTGT TCCTCTGAAA GTCCAAGAGG OGTACACXTTT 1320 

TOCCAAATGT AGACTAGAAT CTQCAGGATG CCACCTACTG TATAGTTCTG CTTTCCCAGA 1380 

25 

GAGGAAGAAC TTTTAGAAAC CAAATGATCT TAATTGTTAT TQCCXIAOCCC TGC3CTTTTCC 1440 

GQGTAGAAAA TTCACAGTAG GAAIGATrGT TAAGAGAGAG TQCTTGGAAC CATGGGTTAA 1500 

30 CAQGAAAGGC TACCTAACTT CACATATCTC CAADCAGAGC AGCCACCAAG CATTACTTAG 1560 

CAGCAGGAAA ATGATTOTAT TTGAGTTCCT GTGTCTCCAA AACTGAGGCA CGATGTTCTr 1620 

TGAAAACATG CCACCTCAAG GCTQGGCGCG GTGQCTCACA CCTGTTAATC CXAGCACTTT 1680 

35 

GGGAQGCCGA GGCGGGCXX3A TCACCGGAGT OQGGGAGTTT GAGACCAGCC TQGACCAACA 1740 

TGGGAGAAAC CCCATCTCTA CCTAAAAATA CAAAATTAGC CXSGGOGTOGT GGCATCOGCC 1800 

40 TATAATCTCA GCTACTTGGG AGGGYTGAGG CAGGRGAATT GCTTGAACCC RQGANOQOQG 1860 

AGGTTTQCGG TTGAGTTGAG GATOGTOCCA TTGCACTTCC GGGCCTTGGG GCAACAACAG 1920 

CAAAAAOTCC GTCTTCAAMW MKKXICGAAT TOGATATCAA GCTTATCGAT ACCGTCGACC 1980 

TCGAGGGGGG GCXXX3GTACC CAATTCGCCC TATAC2JC3ATC GTATTACAAT C 2031 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID ^30: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) hENCmi: 1981 base pairs 

(B) TYPE: nucleic cLcid 

(C) STRANtSCNESS: double 

(D) TOPOLOGY: linear 

(xi) SBQUKJCE DESCRIPTION: SEQ ID NO: 156: 
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CCTGCACCCT GAGCCXTPCA CCCCTCOGAG TTCCCCCCAG tf r m G Cr i tx r TTCGATTCCT 60 

TTTCTTGGTA TCAACGTTTG ATTGGAAGAA CAACCCCCTC TITOrCAACC TCAATAATGA 120 

5 GCTCACTCTG GftGGAGCAGC TOCSCSGCACAG CTCMCCXTrYA TGGrTCATTGT TACOCCCCAA 180 

GACCGCAAAA ACTCTGrTCTC GACACAGGAT GGAOCCTCAG OCCAGATCCT GCAGCAGCTT 240 

GTCGTCCTGG CAGCTGAAGC CCTGCCCATG TTAGAGAACSC AGCTCATQGA TCCCCGGGGA 300 

10 

CXrrOGGGACA TCAGGACAGT GnTCCGGCCG CXXHTOGACA TTTACGAOGT GCTGATT0C5C 360 

CTGrTYTCCTC GCXJVTATCCC GOGGCACCGC AGQCTTGTGG ACTOGCCAGY TO LX:itX.TitJ 420 

15 TGCCGGGGCC TGCTCAGCCA GCCQQGGCCC TCATCCCTGA TCCCXX5TGCT GQGnNATQAT 480 

CCTNCTCAGC TCTATCTGAC GCAGCTCAGG GAGGCCTTTG QQGATCTC3C3C CCTTTTCTTC 540 

TATGACCAGC ATQGTGGAGA GGTGATTQGT GTCCTCTQGA AGCCCAOCAG CTTCCAG0CX3 600 

20 

CAGCCCTTCA AGGCXTTCCAG CACAAAGGGG CGCATGGTGA TCTCTCGAGG TQGQGAGCTA 660 

GTAATGGTGC CCAATGTTGA AGCAATCCTG GAGGACTTTG CTGTOCTQGG TGAAGGCCTG 720 

25 CTTGCAGACTG TGGAGGCCCG AAGTQAGAGG TGGACTGTGT GATOCCAGCT CTQGAGCAftG 780 

CTGrrASACGG ACAGCAGGAC ATTGGACCTC TAGAGCAAGA TCTCAGTAGG ATGACCICCA 840 

CCCTCCrrCG ACATGAATCC TCCATGGAGG GCCTGCTGGC TGAACATGCT GAATCATCTC 900 

30 

CAACAAAACX: CAGCCCCAAC TTTCTCTCTG ATGCTCC7W3C ATTGGGGCAG GGC3CATQGTG 960 

GCCCATGTAG TCTCCTGGGC CTCACCATCC CAGAAGAGGA GrTGGGAGCXIA GCTCAGAGAA 1020 

35 GGAACTGAAC CXMGAGATC CATCCACCTA TTAGCCCTGG GCCTQGACCT CCCTGOGATT 1080 

TCCCACTCCT TTCTTAGTCT TCTTCCAGAA ACAGAGAAGG GGATGTGTGC CTGGGAGAGG 1140 

CTCTGrrCTCC TTCCTGCTQC CAGGACCTGT GCCTAGACTT AGCATQCXOT TCACTGCAGT 1200 

40 

GTCAGGCXnT TAGATQGGAC CXZAGCGAAAA TGTOGCXXnT CTGAGTCACA TCACCGACAC 1260 

TGAGCAGTGG AAAGQGGCTA TATGrTGTATG AATAGACCAC ATTGAAGGAG CACAATQCXX: 1320 

45 TCCTGTCTTG ATGCCACTTC CXIAGGGTGGA GACAGTTGGAA AAGAACCGAG GACAGGAAAG 1380 

GATTQGGTAG GTGAAGGQGT CAGGGGACTG GTAGTCACCC AATCTTGGAG AGGTGCAAAA 1440 

AGCACTGGGG GCTACCCGTT AGCTGCATCT GCCCTGGCTG TTTQCCXOTT CATGTCACAA 1500 

50 

ACTGCXrACTA CTATffTACCT GCAGTPGGGGT TGCAGAGATG GGGGAGACTC AAGTCTTACT 1560 

COCCAGGAGC TCCCAGGGCC CAAGGAGGAG AATGCTGCCT CCTTTCAGTC TGCTTCTACAC 1620 

55 CCACTTTCTG GTAGCX^CTC TGCTTCCTGT AATTCTQGCT GTTTTTCCAG ACTCAGCTCA 1680 

AATAGrrGCCC CTCCTTAAGC CCATCCCTCG CCCCCAGCCT GAGGTGATCT TTCCXnCCTC 1740 

TGAACTATTA GAGCAGTTAC TGnCTGTTCA GTTC5GTTTX3G CAGGCACACA CAGTGGCATA 1800 

60 
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382 



AATTCTATTG TTITGAACTC TGATTTAAAA TTAAATTOCA GCTOGGOGTC GTGGCTCATG 



1860 



CTTGTAATCC CAACACTTAG GGAGlMAGCai GAATCACTTG ASCYCAGGAG TYCTAGACCA 



1920 



ATCTQGGCAA MAGAGAGACC CCATCTCTTT TAAATAAAAA GTTAAATTGC TTAAAAAAAA 



1980 



A 



1981 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEMESS: double 
/(D) TOPOUOGY: linear 

(xi) SBQUQICE DESCRIPTICN: SBQ ID NO: 157: 

GAATTCGGCA CGAGCGOGGC CATGQCGCTC CTGCTTTCGG TQCTGOGTGT ACTGCTGGGC 60 

GGCTTCTTCG CGCTCGTOQG GTTGGCCAAG CTCTCGGAGG AGATCTCGGC TCCAGTTTOG 120 

GAGCQGATGA ATGCCCTGTT CC?rGCAC7rTT GCTGAGGrTGT TCCOGCTGAA GGTATTTGGC 180 

TACCAGCCAG ATCCCCTGAA CTACCAAATA GCTGrTGQGCT TTCTGGAACT GCTGGCTGGG 240 

TTGCTGCTGG TCATGGGCCC ACCGATGCTG CAAGAGATCA GTAACTTGTT CTTGATTCTG 300 

CTCATGATGG GGGCTATCTT CACCTTQGCA GCTCTGAAAG AGTCACTAAG CACCTGTArC 360 

CCAGCCATTG TCTC3CCTQGG GTrCCTQCTG CTGCTGAATG TCGGCCAGCT CTTAGCCCAG 420 

ACTAAGAAGG TGGTCAGACC CACTAGGAAG AAGACTCTAA GTACATTCAA GGAATCCTQG 480 

AAGTAGAGCA TCTCTGTCTC TTTATGCCAT GCAGCTGTCA CAGCAGGAAC ATGGTAGAAC 540 

ACAGAGTCTA TCATCTTGTT ACCAGTATAA TATCCAGGGT CAGCCAGTGT TQAAAGAGAC 600 

ATTTTGTCTA CCTGGCACTG CTTTCTCTTT TTAGCTTTAC TACTCTTTTG TGAGGAGTAC 660 

ATGTTATGCA TATTAACATT CCTCATGTCA TATGAAAATA CAAAATAAGC AGAAAA6AAA 720 

TTTAAATCAA CCAAAATTCT GATGCCCCAA ATAACCACTT TTAATGCCTT GGTGTAAGTA 780 

TACCTCTQAA CiTi'lTlCTG TGCCTTTAAA CAGATATATA TTITTTTTWA ATCAAAATAA 840 

AACCATATAT CCTATTTTAT TTCCTCCrTT TAAAACCTTA TAAACTATAA MAAAAAAAAA 900 

AAAAAAAAAA CTCGA 915 

(2) INFORMATION FOR SBQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2117 base pairs 
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(B) TYPE: nucleic acid 

(C) gTRANnKDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

AGftGC3GAAGC GAGQGTGGCG CGGGTCOGGG CATGAAGCTG GGCOCSGQCOG TC3CTQGGCCT 60 

GCTGCTGCTG GOGOOGTCCG TOGTCCAGGC GGTQGAGCCC ATCAGCCTOG GftCTGGCCCT 120 

GCXXX3GCC5TC CTCACCX3GCT ACATCTACCC GCC?rCTCTAC TQCCTCTTCG COGftfTPQCTG 180 

COGGCAGAAG CGC3AGCCTTA GOCGGGAGGC ACTGCAGAAG GATCTGGACG ACAACCTCTT 240 

TGGACAGCAT CTTGCAAAGA AAATCATCTT AAATGCOGTG TTTGGTrPCA TAAACAACCC* 300 

AAAGCCCAAG AAACCTCTCA CGCTCTOCCT GCAOGGGTCG ACAGQCACOG GCAAAAATTT 360 

CXSrrCflGCAAG ATCATCGCAG AGAATATTTA CGAGQGTOGT CTGAACAGTG ACTATCTCCA 420 

ccrcrmCTG GCCACATTGC ACTTTCCACA TGCTTCAAAC ATCACCTTCT ACAACXSATCA 480 

GTTACAGTTG TGGATTCGAG GCAACCTTGAG TQCCTOPGCG ADGTCCATCT TCATATTTGA 540 

TGAAATQGAT AAGATCCATG CAGGCCTCAT AGATQCCATC AAQCCTTTCC TCGACTATTA 600 

TGACCTQGTG GATGGGGTCT CCTACCAGAA AGCCATGTTC ATATTTCTCA GdAATGCTGG 660 

AGCAGAAAGG ATCACAGATG TGGCTTTGGA TTTCTGGAGG AGTPQGAAAGC AGAQGGAAGA 720 

CATCAAQCTC AAAGACATTG AACACGCGTT GTCICTGrrOG GTTTTCAATA ACAAGAACAG 780 

TGGCTTCTQG CACAGCAGCT TAATTCACCG GAACCTCATT GATTATTTTG TTCCXTTTCCT 840 

CCCCCTGGAA TACAAACACC TAAAAATGTG TATOCGAGTG GAAAPGCAGT CCCGAGGCTA 900 

TGAAATTGAT GAAGACATTG TAAtSCAGAGT GGCTGAGGAG ATCSACATTTT TCCCCAAAGA 960 

GGASAGAGTT TTCTCAGATA AAC3GCTQCAA AAOGGrTGrTTC ACCAAGriTAG ATTATTACTA 1020 

CX3ATGATTGA CAGTCATGAT TGGCAG(XGG AGTTCACTGCC TGGAGTTQGA AAAGAAACAA 1080 

CACTCAGTCC TTCCACACTT CCACCCXXaG CTCCmCCC TGGAAGfiGGA ATCCAGTGAA 1140 

TGTTCCTGTT TGATGTCACA GGAATTCTCC CTQGCATTGT TTOCACCXXX: TGGTQCXrTOC 1200 

AGGCCACCCA GGGACCACGG G0GAQC3ACCT GAAGCCTCCC GAACACGCAC AGAAC3GAAGG 1260 

AGCCAGCTCC CAGCCCACTC ATCGCAGGGC TCATGATTTT TTACAAATTA TGTTTTAATT 1320 

CCAAGTGnrPr CTGTTTCAAG GAAGGATGAA TAAGrmTAT TGAAAATC3TG GTAACTTTAT 1380 

TTAAAATGAT TTTTAACATT ATGAGAGACT GCTCAGATTC TAAC3TTGTTG GGXnTGTCTG 1440 

TGTGTmTr TTTAAGTTCT CATCATTATT ACATAGACTG TGATGTATCT TTACTQGAAA 1500 

TGAGCCCAAG CACACATQCA TGGCATTTGT TCCACAGGAG GGCATOXTG GC3GATGTQGC 1560 

TQGAGCATGA GOCAGCTCTG TCCCAGGATG GTCCCAGCGG ATGCTGCCAG GGGCAKTCAA 1620 
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384 



10 



15 



GTGTTTAGGT GAAGGACAAG TAGGTAAGAG GACGCCTTCA GQCACCACAG ATAAGCCTGA 1680 

AACAGCCTCT CCAAGCSGrTTT TCACCTTAGC AACAATQGGA GCTGTGQGAG TGATTTTGOC 1740 

CACACTGTCA ACATTTCTTA GAACCAGTCT TTTGAAAGAA AAGTATTTCC AACTTGrTCAC 1800 

TTQCCAGrrCA CTCCGTirrG CAAAAGGIGG OCCTTCACTG TCCATTCCAA AS'AGCCCACA 1860 

CGTGCTCTCT GCrGGATTCT AAAITATGTG AATTTTGCXIA TATTAAATCT TCCTCATTTA 1920 

TACTATTATT TGrTTACGnTC AATCAGAATC CCCGAAACCT CTTATAAAGC TTAGCTGCCC 1980 

CTTCTGAGGA TGCTGAGAAC GGTGrTCTTTC TTTATAAATG CAAAJTOGCTA CXXnTTTACA 2040 

ATAAAATTTT GCATGTGCAA AAAAAAAAAA ANAAAAAAAA AAAATCCCGG GGGGQGGCCG 2100 

GTAAOCAATT TGNCCCC 2117 



20 
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30 



35 



40 
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(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANnEENESS: double 

(D) TOPOLOGy: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

TGTTCCTTAA TCCCTTTTCT AAAAAGGGGG GAAAATCCGG ATGGATTTTA GGGATTGGTC 60 

TGGTGTCAGC TGTGTTTTAT TGCACAiXTA AATCCTGATT ATAGGCTTTT CATTTCTCOG 120 

CAAAGCCTTT ATTTTGGCAG TTAAGCCAAA TGTGTTTTCC AGAAAGTTAG TTATTTTCTC 180 

CTCTTTCTTT CCTTTCTrrC CTCCCmTT CCCGTCTGAC CCCAAAOGTT ATTGTCCAAA 240 

CATGACTGGA CAGCAGCTTT TGTTTCTTGA OCCTGTAATA TGACAGTCTG CTAATATTGA 300 

CAGAAGCyPGC AGTTTTTGGG ITATAGTCGT GATTTTOGCT AATCAATCAT ATTAGCAGGA 360 

AAAAAAAKGA CTTGTTTCTG TTGTACTTGA GnTCTTAAGAA AAAGTGGCCC ATAGTTTAGT 420 

GGACAATTTC CAAAGGCTTT AGTACCACCT GTATTTCAAA ATGGGGGACC CAAACTCCOG 480 

GAAGAAACAA GCTCTGAACA GACTACGTGC TCAGCTTAGA AAGAAAAAAG AATCTCTAGC 540 

TGACCAGTTT GACTTCAAGA TGTATATTGC CTTTGTATTC AAGGAGAAGA AGAAAAAGTC 600 

AGCACTTTTT GAAGrTGTCTG AGGTTATACC AGTCATGACA AATAATTATG AAGAAAATAT 660 

CCTGAAAGGT GTGCGAGATT CCAGCTATTC CTTGGAAAGT TCCCTAGAGC TTTTACAGAA 720 

GGATGTOGTA CAGCTCCATG CTCCTCGATA TCAGTCTATG AGAAGQGATG TAA3TGGCTG 780 

TACTCAGGAG ATOGATTTCA TTCTTrGGCC TCGGAATGAT ATTGAAAAAA TCGTCTGTCT 840 

CCTGTTTTCT AGGTGGAAAG AATCTGATGA GCCTTTTAGG CCTGTTCAGG CAAATTTGAG 900 
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TTTCAirCATG GTCACTATGA AAAACAGITT 
GGAATCGTTG TCAACAATCC TAACCAGTCA 

5 

CAGACrCCAA AAAACAAAGC TACAATCTTC 
CAGGAACAGC TCACCCACTG GQGCAGrTTGG 
10 GCCAGAGTAG AGTACTGACC AGCAAAATOG 
TTTCTrGTTT TCTTACCACT TTATTCTTTC 
AACACTATGC ATTTTGAAAC TTGTrCATCC 

15 

ACTTAAACAA AAATTAGATG TCGTGCACGG 
GCTGCACTGC ATCAGTATCT TACTAAAAAT 
20 TGCTTAAATG TATCTGAAAG CACAAGGTGA 
raTTTTTTGC CTCTTTGACA TCTOTCATCA 
GC3TATAAATA ACATTTTTAA AAACAATAAC 

25 

TGTCAGATAC ATTCTAATGA CCAGAACTGG 
GAAATCTTAA ATGAAAAACG CATCTCATTG 
30 CATGTTTCTT TOCTOGnACT CArGTATTTT 
ATTGTAAGAG TATTCTGCTG CGTGTGGATG 
CTGAASXAGC TATAAAQCAG CTATAAAACA 

35 

TAOGTAGAGG ACrTTTCTTT TGGTrTTGTT 
CAGAGAAGAG ATTTTTATTA CAAAGAAAAA 
40 TTTTACACCA TCCTAAA3AA AAACTTTACA 
AAGTTQGAAT CITAAATTGT AAAATTAACC 
CATTTPGrrGC AATGAACATA AGGAAAGACT 

45 

AAATGAAGAA AAGCTTTGCT TAAGQGTTGC 
CTACTCCTTT GGAGTAAAAC TAGTOCTTAC 
50 GAAOTTGAAA AAAAAAGAAA AAAAGAAAAA 



CTGCATGTAC TGAGCCGCAA GGACAAGACT 960 

GTGTTTCTCT TCATTGACAG ACAGCACTTG 1020 

AAGTTATGCA GCATCTGCCT CTACXTTGOCA 1080 

CACCATAGAG GRlCftCCTTCC GTCCTTATAT 1140 

AGAAGATCAG AGAATGCAGC AGCAGTTTTT 1200 

AGAGTTTAAA GAAAATGGAC TCATGCACAG 1260 

TGGATTTTTT TAAATCATTT TTATCrCAGA 1320 

ACTOTGTGAA AGAAGATGCT TTGCATATTT 1380 

GTGAAATGAA AGGACTATTG TACACTGAAA 1440 

TACTCATTTT TATGGTCTTC CCATTTCTGC 1500 

GTATTTAGAG GGTGAGAAGT GAATGTAACA 1560 

TTTGCTATAA TCACAGTTGT TCCAGAGCAC 1620 

TTTAAAAAAA GAAAATACAA CXATGGGAAA 1680 

TAGGCAaTTT TGCCTCATAT TTTACTQGGC 1740 

TTTTTTCCAG ATCTCTTTCC OCAAGTTGCT 1800 

CAGTTATACA CATTAAAGCA GATCTQQAGT 1860 

GAAATACATG CATAGCTQCA GAAACCATGA 1920 

TICTTTTGIT TTGTTTTGTT TTTGGrTTTA 1980 

AATTCCAGTG AATTGrPQCAG AAATGCTGGT 2040 

AGGGrrcrrrTT ggagtagaaa aaaggttata 2100 

ATTGAGTOTC AAAGTTCTAA AAGCAGAACT 2160 

ACTGTATAGG ' mTriTlTr TTCTCCTTTT 2220 

ATACTTTTAT TGGAGTAAAT CTGAATGATC 2280 

CAGTTTCCAA TTGTATTTAG CTiXJ'iWriXJ 2340 

GAAAACCTAA ATAAAATAGG TGAAA 2395 



55 (2) INFORMATION FOR SEQ ID NO: 160: 



60 



ii) SEQUENCE CHARACTERISTICS: 

(A) LQ3GTH: 2120 base x>airs 

(B) TYPE: nucleic acid 

(C) STRANnKT^ESS: doiible 
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(D) TOPOLOGY: linear 
(xi) SBQiOENCE DESCRIPTIGN: SBQ ID NO: X60: 

CCCOQGATAC CGCCTGACGT AGTOCXAATC ACACXTTCTCG CGrTCTCGGCG CCTOQGAGGC 60 

TAATGAGGAC GCCTOGCGAA AOGCAGTAAC GGATTTOOGG GTOGMXTTC GCTTTAOGGC 120 

TCCTTGACTrTC TTCOSCCCAA CCCAGAGGAA GCGGGAGAGC AGnTTAOGAC AGCGCCGGflC 180 

GTCTTTACGG CQGCGCXCGC TQCGCGCGCA UVmiViV r irrCCTOGTr TCTCAAGACT 240 

GCroCTOCTA AOSaSGTCCC CGGCACQCAC CATCTGTroC CATCXXX3QCC GGCXX3AGOCA 300 

15 TTGCfiGATTT TGGAAGATGG CAAAGTTCAT GACACXXX5TC ATCCAGGACA ACCCCTCAGG * 360 

CTQGQGrrCCC TGTOCGGTTC CCGAGCAGTT TCGGGATATG OCCTACCAGC CGTITCAGCAA 420 

AGGAGATCGG CTAGGAAAGG TTGCAGACTG GACAGGflGCC ACATAOCAAG ATAAGAGGTA 480 

20 

CACAAATAAG TACTCCTCTC AGTTTGGTOG TGGAAGTCAA TATOCTTATT TCCATGAGGA 540 

GGATGAAAGT AGCTTCCAGC TGC3TQGATAC AGCGCGCACA CAGAAGACGG CCTACCAGCG 600 

25 GAATOQAATG AGATITGCCC AGAGGAACCT CCGCAGAGAC AAAGATOGTC GGAACATCTT 660 

GCAGITCAAC CTGCAGEATCC TC3CCTAAGAG TGCCAAACAG AAAGAGAGAG AACGCATTCG 720 

ACTGCAGAAA AAGTTCCAGA AACAATITOG GGTTAGGCAG AAATGQGATC AGAAATCACA 780 

30 

GAAACCXXXA GACTCnCAG TTGAAGTTCG TAGTCATTGG GAAGTGAAAG AGGAAATGGA 840 

TTTTCCTCAG TTGATGAAGA TQCGCTACTT GGAAGTATCA GAGCCACAGG ACATTGAGTC 900 

35 TTGTGGGGCX: CEAGAATACT ACGACAAAGC CTITCAOCGC ATCACCACGA GC5AGTCAGAA 960 

GCCACTQCGG ASAINCAAGC GCATCTTCCA CACTGTCACC ACCACAGACX5 ACCCTCrCAT 1020 

CXX3CAAGCTG GCAAAAACTC AGC3GGAATGT GTTTGCCACT GATGCCATCC TGCCCAOJCT 1080 

40 

GATGAGCTGT A0CCX3CTCAG TGTATTCCTG GGATATTGTC GTCCAGAGAG TTOGGTrOCAA 1140 

ACTCTTCTTT GACAAGAGAG ACAACTCTGA CTTTGACCTC CTGACAGrTGA GTGAGACTGC 1200 

45 CAATGAGCCC (XTCAAGATG AAGCTAATTC CTTCAATTCA CCCCXX:AACC TGGCCATQGA 1260 

GGCAACCTAC ATCAACCACA ATTTCTCCCA GCACTTGCTTG AGAATGGGGA AGGAAAGATA 1320 

CAACTTOCCC AACXXAAACC CGTTTGTGGA QGAOGACATG GATAAGAATG AAATCGCCTC 1380 

50 

TCTrTGCXSTAC CGTTACCXXaV GTGGNAAGCT TGGAGATGAT ATTGACCTTA TTGrCXXTTTG 1440 

TGAGCACGAT GGCCHCATGA CTGGAGCCAA CXSGGGAAGTC TCCTPCATCA ACATCAAGAC 1500 

55 ACTCAATGAG TGGGATTCCA GGCACTGTAA TGGCX3TTGAC TQGCGTCAGA AGCTGGACTC 1560 

TCAGCGAGQG GCTGTrCATTG CXZAOQGAGCT GAAGAACAAC AQCTACAAGT TGGCCCGGTG 1620 

GACCTGCTGrr GCTTTGCTQG CTGGATCTGA GTACCTCAAG CTTGGTTATG TGrrCTCGGTA 1680 

60 
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CCACGTCAAA GACTCCTCAC GCCACGTCAT 
GTTTOCCAGC CAGATCAACC TGAGCGTOGA 
5 TGACATCTGC ATOAAGCTQG AGGAGGCXAA 
GGTCATCCGT CTTCTACAGCC TOCCTGATOG 
AGAGG^^GGAG GAAGAAGAGG AAGAAGAAGA 

10 

GGAGTTTGrrC CTTCCACCGA GACTACGAGG 
AOTTGCTCTC TGACATTTAG CflGATGAAAT 
15 AAAAAAAAAA AAAAAAAAAN 



CXTTAGGCACC CAGCAGmCA AGCCTAATGA 1740 

GAATGCCTGG GGCATTTTAC GCTGCGTCAT 1800 

ATADCTCATC CTCAAGGACC CCAACAAGCA 1860 

CAOCTTCAGC TCTQATGAAG ATGAGGflGGA 1920 

GGAAGAAACT TAAACCAGTG A3CTGGAGCT 1980 

GCCTTTGATG CTTAGTGGAA TGTOPGTCTA 2040 

AAAATATATA TCTGrrTACT CTTAAAAAAA 2100 

2120 



20 (2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQXmi: 900 base pairs 
<B) TYPE: nxicleic acid 
25 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

<xi) SBODENCE DESCRIPTION: SBQ ID NO: 161: 

30 GGAAGCTGAA GTCCTTCCAG ACCAQGGACA ACCAGGGCAT TCTCTATGAA GCTGCACCCA 60 

CCTCCAOCCT CACCTGTRAC TCAGGACCAC AGAAGCAAAA GTTCTCACTC AAACTGGATG 120 

CCAAGGATGG GCGCTTGTTC AATGAGCA6A ACTTCTTCCA GCGQGCOGCC AAGCCTCTGC 180 

35 

AAGTCAACAA GTQGAAGAAG CTGTACTCGA CCCCACTGCT GGCCATCCCT ACCTGCATGG 240 

GTTTCGGTGT TCACCAGQAC AAATACAGGT TCTTGGTGTT ACCCAGCCTG GGGAGGM3CC 300 

40 TTCAGTCGGC CCTGGATGTC AGCCCAAAGC ATGTGCTGrTG CAGAGAGGTC TGTGCTGCAG 360 

GTGGCCTGCC GGCTGCTGGA TGCOCTGGAG TTCCTCCATG AGAATGACTA TGTTCATGGA 420 

AATGTGACAG CTGAAAATAT CTTTGTGGAT CCAGAGGACC AGAGTCAGGT GACTTTGGCA 480 

45 

GGCTATGGCT TCGCNTTCCG CTATTGCCCA AGTGGCAAAC ACGTGGCCTA CGTGGAAGGC 540 

AGCAGGAGCC CTCACGAGGG GGACCTTGAG TTCATTAGCA TQGACCTGCA CAAGGGATGC 600 

50 GGGCCCTCCC GCCGCRGCGA CCTCCAGAGC CPGGGCTACT GCATGCTGAA GTQGCTCTAC 660 

GGGTTTCTGC CATGGACAAA TTGCCTTCCC AAMAMTGAGG ACATCATGAA GCAAAAACAG 720 

AAGTTTGTTG ATAAGCCGGG GCCCTTOJrG GGACCCTGCG GTCACTGGAT CAGGCCCTCA 780 

GAGACCCTGC AGAAGTACCT 6AAGGTGGTG ATGGCCCTCA CGTATGAGGA GAAGCCGCCC 840 

TACGOCATGC TGAGGAACAA CCTAGAAGCT TTGCTGCAGG ATCTGCGTGT GTCTCCATAT 900 

60 



388 



PCT/DS98AM493 



10 



(2) IKFTORMATIGN FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQISTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANEECNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUQICE DESCRIPTION: SEQ ID NO: 162: 

GGCAOGAGAT GAGGGGCACC CAGrPGCTTCT AGGGCAGGCT GGGTQGTCCT CCCCTAGGTEA 60 

15 TCAGCCTCTC TTACTGTACT CTCOGOGAAT GnTAACCTTT CTATTTTCAG CCTCTOtXAC 120 

CTGTCTAGGC AAGCTQGCTT CCCCATTQGC OCCTGTGGGT CCACAGCAGC GTQQCTC3CCC 180 

COCAGGGCCA COGCTTCTTT CTPGATOCTC TTTCCTTAAC AGTGACTTGG GCTTGACTCT 240 

20 

GGCAAGGAAC CTTGCTTTTA GCTTCACCAC CAAGGAGAGA GGTPQACATG ACCTCCCCQC 300 

CCCCTCACCA AGGCTGGGAA CAGAGGGGAT GTGGTGAGAG CCAGG?rTCCT CTQGCCCTCT 360 

25 CCAQGGTGTT TTCCACTAGT CACTACTGTC TrCTCCTTGT AGCTAATCAA TCAATATTCT 420 

TCXXTITOCCT GTQGGCAGTG GAGAGGCTGC TGGGTGTACG CTGCACCTQC CCACTGAGnT 480 

GGGGAAAGAG GATAATCAGT GAGCACTGTT CTGCTCAGAG CTCCTGATCT ACCCCACCCC 540 

30 

CTAGGATCCA GGACTGGGTC AAAGCTQCAT GAAACCAGGC CCTGGCAGCA AACCTGQGAA 600 

TGGCTGGAGG TGQGAGAGAA CCTGAACTTC TCTTTCCCTC TCCCTCCTCC AACATTACTG 660 

35 GAACTCTATC CTGrTTAGGAT CTTCPGAGCT TGTTTCCCTG CTGOGTGOGA CAGAGGACAA 720 

AGGAGAAGGG AGGGTCTAGA AGAGGCAGCC Ci'lCITltJit: CTCTGGGGTA AATGAGCTTG 780 

AOCTAGAGTA AATGGAGAGA ' CCAAAAGCCT CTGATTTTTA ATTTCCATAA AATGTTAGAA 840 

40 

GTATATATAT ACATATATAT ATTTCTTTAA ATTTTTGAGT CTTTGATATG TCTAAAAATC 900 

CATTCCCTCT GCCCTGAAGC CTGAGTGAGA CACATGAAGA AAACTGTGTT TCA1TTAAAG 960 

45 ATGTTAATTA AATGATTGAA ACTTGAAAAA AAAAAAAAAA AAA 1003 



50 (2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 2196 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDECaOESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIQN: SEQ ID NO: 163: 



60 



AAGAAGOGGC ACAOGGATGT GCAGTTCTAC ACAGAAGTGG GAGAGATAAC CACGGACTTG 
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QGGAAACATC AGCATATC3CA TGACCGAGAT GACCTCTATG CTCAGCAGAT GGAACGAGAA 120 

ATGAGQCACA AACTGAAAAC AGCCTTEAAA AATTTCATTG AGAAAGTAGA QGCTCTAACT 180 

5 

AAGGAGGAAC TGGAATTrGA AGTGCCmT AQGGACTTGG GATTTAACGG AGCTCOCTAT 240 

AGGAGTACCT GOCTCCTTCA GCCCACTAGT AGTGCCXTGG TAAATGCTAC QGAATQGCXIA 300 

10 CXTTTTGTGG TGACATTCGA TGAGGTAGAG CTGATCCACT TTRAGCXXSGT CCAGTITCAC 360 

CTGAAGAACT TTGATATGGT AATCGTCTAC AAGGACTACA GCAAGAAAGT GACCATGATC 420 

AAOGCCATTC CTGTAGCXrrC TCTTGACCCC ATCAAGGAAT QGTTGAATTC CTGOGACCTC 480 

15 

AAATACACAG AAGGAGTACA GTCCCTCAAC TQGACTAAAA TCATGAAGAC CATICTTGAT 540 

GACCCTGAGG GCTTTCTTCGA ACAAGGTC3GC TC3GTCTTTCC TC3GAGCCTC3A GGGTGAGGGG 600 

20 AGTCATQCTG AAGAAQGGGA TTCAGAGTCT GAAATTGAAG ATGAGACTTT TAATCCTTCA 660 

GAAGATCACT ATGAAGAGGA AGAGGAGGAC AGTGATCAAG ATTATTCATC AGAAGCAGAA 720 

GAGTCAGACT ATTCTAAGGA GTCATTQQCTr AGTGAAGAAG AGACTOGAAA GGA3TQQGAT 780 

25 

GAACTGGAGG AAGAAGCXXX; AAAAGCX3GAC CGAGAAAGTC GrTTACGAGGA AGAAGAAGAA 840 

CAAAGTOGAA CTATGAGCCG GAAGAGGAAG GCATCTGrTGC ACAGTTCGGG OCGTOGCTCT 900 

30 AACCGTOGTIT CCAGACACAG CTCTQCACCC CCCAAGAAAA AGAGGAAGTA ACTTCTGAAC 960 

TITOGCCCrG AGCTCCATTC TTCCTCCAGC CAACCCCTGA AAATTTTACA TGACATA6AA 1020 

ACTGTATTTT TCCTTTCGTT TTCATTTGAA GTTTTGCXyVT TTGTGTTTAT GQGTTTAGGG 1080 

35 

GGCCAfTTTGT GTGGACCAAT CTACICGGGG AATTCCAGGC (XACCAGGAC ACGTGCCAAT 1140 

GQCaXATTC AGATGGCAAG GGAGGAGGTG TTCTTGAAGA CAGGAGGAGG CTCOCGCTGrT 1200 - 

40 TAATAAATAT TGTTTCATTC TTCTCTCTrC CTGTCACCTT CTGCCAAGAC ATTGATQGCT 1260 

TCrPGACATCT TAaTTQGTGT CTCAAAGCTG TATTTCCAAG ACAGTOGTAC AAGGTGACCC 1320 

TTAATTAOCC GTATCATGGT TCTTGACCAG CACATTCAAT CCTCCAACCT ACCCTACTGC 1380 

45 

CATGACCTTC OGCACATCTC TAAGTTTTAT CTTTGCAATA CTCAAGGrTTC TCGGAAATTT 1440 

GCTAATQGTT GTGATAAACC ATACAGCTTG AGCCAGTGAG GCAGATTGGG CiWlXjCClT 1500 

50 CGTCTGAGTT TTCCTGCTTT CCTQCCTCGT GCAGATTCTG AGGTATATCT GCrGCCTTGG 1560 

AAGACATAAG AAGCAGTGAT ACTCCCTGGC TCGGTrAITT TCTCCATACA ATGCACACAT 1620 

GGTACAATCA TAGAAGGCAA AATTQCX:ACT G ^ iirrXCTm ' TTTTCTCATA TATCTAAGGA 1680 

AGATATATCA GGTTGTGCCT CATGTAOCGC TTCTAGTCAA ATOEAGAGGA AGGCTCAAAG 1740 

GAGTCAACAT TTAGATCTGG AAGGGACAAG OCATGCCTTG GGCCTAGAAT ACCCTGATCA 1800 

60 GAAAAGAGAA GAGGAAGGGA GGCX:ATATCT ACAACANCAN CCTCTCGGCA CTGCTGCTCC 1860 
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TTATTTTAAC TTTOTCTTGC ATPGrrCCTOT ATrTATCACA GTTTCTGTTG AACAGCTTTT 1920 

CAAGTATTTG QGGAGTTTAT CTTGCCATCC TCCCCTTCTG CTICPCTOCA CCCACCTGnC 1980 

5 

CCACTGCAGT TCCTTCOGrTG CTCTGTGACT TTAAGAGAAG AAGQQGGGAG GGGTCCCGGA 2040 

TTTTATGTTT GTl ' lVrmT TCTCCTrAGC AGTAGGACTT GATATTTTCA ATTTTQGAAG 2100 

10 AACTAAAAGA TGAATAAACT GUCjrinTrri ' TGTTGTTTGT TlTrGTAAAA AAAAAAAAAA 2160 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 2196 



15 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEJQOENCE CHARACTERISTICS: 
20 (A) LENGTH: 1945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOIjOGY: linear 

25 (xi) SEQUENCE nESCRIPTICN: SEQ ID NO: 164: 

GCACAGAGTC GGC5CGGACGG ACAGGGfiGAG GAGGAGAGGG GCTCTGCGCG OQGCCGCTAC 60 

CCAGAAGCCA GCGGACGGCA GCAOGGAGTG GGCTGTCCCC GAGCCCAGCC OCGAGCGAGC 120 

30 

CCCCCCCCCG CCCCCGMAGG ACGOGCCTYC CAGCCAGCCC GACTYCTAGG AGGAGGGGAG 180 

GCGQGAAAGC AGCTCAAGCC TCACCCACCG CCCTGCCCCC AGCCCCGCCA CTCCCAGGCT 240 

35 CCTCGGGACT CGGCGGGTCC TCCTQGGAGT CTCGGAQGGG ACCGGCTGTG GAGACGCCAT 300 

GGAGTPGGTG CTGGTCTTCC TCTGCAGCCT GCTGGCCCCC ATGGTCCTGG CCAGTGCAGC 360 

TGAAAAG6AG AAGGAAATGG ACCCTTTTCA TTATGATTAC CAGACCCTGA GGATTGGGQG 420 

40 

ACTGGTGTTC GCrGTQGTCC TCTTCTOGGT TGGGATCCTC CTTATCCTAA GTCGCAGGTG 480 

CAAGrPGCAGT TTCAATCAGA AGCCCCQGGC CCCAGGAGAT GAGGAAGCCC AGGTQGAGAA 540 

45 CCTCATCACC GCCAATGCAA CAGAGCCCCA QAAAGCAGAG AACTGAAGTG CAGCCATCAG 600 

GTOGAAGCCT CTGGAACCTG AGGCGGCTGC TTGAACCTTT GGATQCAAAT GTCGATGdT 660 

AAGAAAACCG GCCACTTCAG CAACAGCCCT TTCCCCAGGA GAAGCCAAGA ACTTGTGTGT 720 

50 

CCCCCAOCCT ATCCCCTCTA ACACCATTCC TCCACCTGAT GATGCAACTA ACACTTGCCr 780 

CCCCACTGCA GCCTQCGGTC CTGCCCACCT OCCGTGATGT GTGTGTGTGT GTGTGTGTGT 840 

55 GTGACTGTGT GTGTTTQCTA ACTGTGGTCT TTGTOGCTAC TTGTTTGTGG ATGGTATTGT 900 

GTinViTAGr GAACTGTGGA CTCGCTTTCC CAGGCAGGGG CTGAGCCACA TGGCCATCTG 960 

CTCCTCCCTG CCCCCGTQGC CCTCCATCAC CTTCTGCTCC TAGGAGGCTG CTTGITGCCC 1020 

60 
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GfiGACCAGCC CCCTCCCCTG ATTTAGGGAT GCGTAQGCSTA AGAGCAOGGG CAGTOGTCTT 1080 

CAGrrCGTCTT GGGACCTGGG AflGGnTTGCA GCACTTIGTC ATCAITCrrC ATOGACTCCT 1140 

TTCACTCCTT TAACAAAAAC CTTQCTTCCT TATOCCACCT GATCCCAGTC TGAAGC?rCTC 1200 

TTAGCAACTG GAGATACAAA GCAAGGAGCT GGTGAGCCCA GCX3TTGACGT CAGGCAGGCT 1260 

ATGCCCTTCC GTOGriTAATT TCTTCCCAGG GGCTTCCACG AGGAGIXXCC ATCTGCOCCG 1320 

CCXXrrrCACA GAGCGCCCGG GGATTCCAGG CCCAGGCCTT CTACTCTCCC CXTTOGGGAAT 1380 

GTOTCCCCTG CATATCTTCT CAGCAAIAAC TCCATQQQCT CTGQGACCCT ACXXCTTCCA 1440 

15 ACCTTCCCTG CITCTGAGAC TTCAATCTAC AGCCCACXnx: ATCCAGATQC AGACTACAGTr 1500 

CCCTQCAATT GGGTCTCTQG CAGGCAATAfi TTGAAGGACT CClXJi'lVCGT TQGQQCCAGC 1560 

ACACCXX3GAT GGA3X3GAGGG AGAGCAGAGG CXnTTGCTTC TCTGCCTACG TCCXX:TEAGA 1620 

20 

TGGGCAQCAG AGGCAACTCC CXXATCCTTT GCTCTGOCTG TCRGTOCTCA GAGOGGTGAG 1680 

CGAGGrPGGGT TQGAGACTCA GCAGGCTCCG TGCAGCCCTT GGGAACAGTC AGAGGTTGAA 1740 

25 GGTCATAACG AGAGTGGGAA CTCAACCCAG ATCOOGCCCC TCCTGTCCTC TGTGTTOOOG 1800 

CGGAAACCAA CCAAACCGTG CGCTGTGACC CATTGCTGTT CTCTGTATCG TGATCTATCC 1860 

TCAACAACAA CAGAAAAAAG GAATAAAATA TCCTTTGTTT CCTAGTGAAA AAAAAAAAAA 1920 

AAAAAAAAAA AAAAAAAAAA CTCGA 1945 



30 



35 



45 



(2) INPOKMATICN FOR SBQ ID NO: 165: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 2933 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRMtDECNESS: double 

(D) TOPOrxx^: linear 



(xi) SBCPIENCE DE9CRIPTICN: SBQ ID NO: 165: 

GGGTOGACCC ACGCGTCCGG CAGCCGTCGT TTGAGTOGTT GCTQCCGCTG CCCCCTCCCG 60 

GATCAGGAGC CAGrTGTATAC CGCCCGCCCA CCGCCTTQGT GCCGCTAGAG GAAACGAGAA 120 

50 GGAGGCOGCC TGCGGTITGT OGCCGCAGCT CGCCCMCYGY CYGGRAGAGC CGAGCCCCGG 180 

CCCAGTCGGT CGCYTGCCAC CSCTCGTAGC CGTTACCCGC GGGCCGCCAC AGCOGCCGGC 240 

CGGGAGAGGC GOGCOCCATG GCYTCTGGAG CCGATTCAAA AGGTGATGAC CTATCAACAG 300 

55 

CCATTCTCAA ACAGAAGAAC CGTCCCAATC GGTTAATTGT TGATGAAGCC ATCAATGAGG 360 

ACAACAGTCT GGrPGrrCCTTG TCCCAraXA AGATQGATCA ATTQCAGTTG TTCOGAGGTG 420 

60 ACACAGTGTT GCPGAAAGGA AAGAAGAGAC GAGAAGCTGT TTGCATCGTC CTTTCTCATG 480 
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ATACTTGnTC TGATGAGAAG AITCGGATGA ATAGACTTGr TCGGAATAAC CTTCGTOTAC 540 

GOCTAGGQGA TGrTCATCAGC ATCCflGOCAT GCCCTGttlGT GAACTAOGQC AAAOTEATCC 600 

5 

ATGrrGCTGCC CATTSATCAC ACACrTGGAAG GCATTACTGG TAATCTCTTC GAGGTATAOC 660 

TTAAGCOSTA CTTOCTGGAA GOGTATCGAC CX:ATCCGGAA AGGflGACATT TTTCTTGrOC 720 

10 GTGGnOGGAT GCX3TOCTCTG GAGTICAAAG TOGTQGAAAC AGATCCTAGC CCTTATTQCA 780 

TTGTTGCTCC AGACACAGTG ATCCACTOCG AAGGGGAGCC TATCAAACGA GAGGATGAGG 840 

AAGACTOCTT GAATGAAGTA GGCTAITGATG ACATTGGrTGG CTGCAOGAAG CAGCTAGCTC 900 

15 

AGATAAAGGA GA!lX3G?rGGAA CTGOCXXTGA GACATCCTGC CCTCTTTAAG GCAATTGGrTG 960 

TGAAGCCTCC TAGAGGAATC CTGCTTTACG GACXrTCCTGG AACAGGAAAG ACCCTGATTG 1020 

20 CTCGAGCrGT AGCAAATGAG ACTGGAGCCT TCTTCTTCTT GATCAATGGT CCTCAGATCA 1080 

TGAGCAAATT GGCTGGTGAG TCTGAGAGCA ACCTTCGTAA AGCCTTTGAG GAGGCTGAGA .1140 

AGAATGCTCC TQCCATCATC TTCATTGATG AGCTAGATGC CATOGCTCCC AAAAGAGAGA 1200 

25 

AAACTCATOG CGAGGTGGAG CGGCGCATTG TATCACAGTT GTTGACCCTC ATGGATGQCC 1260 

TAAAGCAGAG GGCACATGTG ATTGTTATGG CAGCAACCAA CAGACCCAAC AGCATTGACC 1320 

30 CAGCTCTACG GCGATTTQGT aSCTTTGACA GGGAGGTAGA TATTGGAA3T CCTGATGCTA 1380 

CAGGACGCTT AGAGATTCTT CAGATCCATA CCAAGAACAT GAAGCTGGCA GATGATCTQG 1440 

AOCTGGAACA GTAGCCAATG AGACTCACGG GCATOrGGCST GCTGACTTAG CAGCCCTCTG 1500 

35 

CTCAGAGGCT GCTCTGCAAG CCATCCGCAA GAAGATGGAT CTCATTGACC TAGAGGATGA 1560 

GACCATTGAT GCCXSAGGTCA TGAACTCTCT AGCAGrTTACT ATGGATGACT TCCOSTGGGC 1620 

40 CTTGAGCCAG AGTAACCCAT CAGCACTGCX5 GGAAACCX3TC GTAGAGGTGC CACAGGrTAAC 1680 

CTGGGAAGAC ATCGGGGGCC TAGAGGATGT CAAAOSTGAG CTACAGGAGC TGGTCCAGTA 1740 

TCCTGrrOGAG CACCCAGACA AATTCCTOAA GrTTTOGCATG ACACCTTCCA AGGGAGTTCT 1800 

45 

GITCTATGGA CCTOCTGGCT GTOGGAAAAC TrTGTTGGCC AAAGCCATTG CTAATGAATG 1860 

CCAGQCCAAC TTCATCTCCA TCAAGQG?rCC TGAGCTQCTC ACCA3CTGGT TTGQQGAGTC 1920 

50 TGftGGCCAAT GTCAGAGAAA TCTTTGACAA GGCCCGCCAA GCTGCCCCXT GTGrTGCTATT 1980 

CTTTGATGAG CTGGATTCGA TTGCCAAGGC TCGTrGGAGGT AACATTGGAG ATGGrrOGrTGG 2040. 

GGCTGCTGAC CGAGTrCATCA ACCAGATCCT GACAGAAATG GATGGCATGrr CCACAAAAAA 2100 

55 

AAATGTOrrC ATCATTGGCG CTACCAACCG GCCTGACATC ATTGATCCTG CCATCCPCAG . 2160 

ACCTQGCrGT CTTGATCAGC TCATCTACAT CXXACTTCCT GATGAGAAGT OCCCTGnTGC 2220 

60 CAICCTCAAG GCTAACCTGC GCAAGTCCCC AGTIGCCAAG GATGrTGGACT TGGAGTTCCT 2280 
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GQCTAAAATG ACTAATGGCT TCTCTGGAGC TGACCTGACA GAGATTTGCC AGCGTOCTTG 2340 

CAAGCTQGCC ATCCXTTGAAT CXOTOGAGAG TGAGA3TAGG CGAGAACGAG AGAGGCflGAC 2400 

5 

AAACCCATCA GOCATGGAGG TAGAAGAGGA TGATCCflGTG CCTGAGATCC GTCGAGATCA 2460 

CITTGAAGAA G0CATC3CGCT TTOCGCOCCG TTCTgrCAgT GACAATGACA TTOGGAAGTA 2520 

10 TQAGATGTTT GCCCAGACCC TTCAGCAGAG TCGGGGCTTT GQCAGCTTCA GATTCCCTTC 2580 

AQQGAACCAG GGTGGAGCTG GCCCCAGTCA GGGCAGTrOGA GGCGGCACflG GTOGCAGTCT 2640 

ATACACAGAA GACAATGATO ATtaWXTGTA TOGCTAAGTG GTOGTOGCTA GCGrTGCfiGTC 2700 

15 

AGCTQGCCTG CXTOGACCTT CnTCCCTGGG GGTQGQGGCG CTTGCCCAGG AGAGGGACCA 2760 

GGGGrrOCGCC CACAGCCTGC TCCATTCTCC A3TCTGAACA GTTCAGCTAC AGTCTGACTC 2820 

20 TGGACAGGGG GTTTCTGrrTG CAAAAATACA AAACAAAAGC GATAAAATAA AAGOGATTTT 2880 

CATTTOGTAA AAAAAAAAAA AAAAAAAAAT CXXX3GGGGQG GCCCGAACCA TTT 2933 

25 

(2) INPORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) I«ENGTH: 2243 hase pairs 

(B) TYPE: nucleic acid 

(C) ' STRANIDSENESS : double 

(D) TOPOUXY: linear 

35 txi) SEQUENCE DESCRIPTION: SBQ ID NO: 166: 

TCGGAGAGCC 
GATGCCCAGA 

40 

GCAGTTTCTC 
TTCTTGGCTC 
45 GTTAGCEATG 
TGAGAGCAAG 
TrrCTTAAGT 

50 

TCACCTAGTC 
TGCCCGCACT 
55 CAGGTC3GCAG 
TCAAAGCCTT 
GGTAGGnTCA 

60 



GGCQGGOGNG CGCCTCTCGG CCAGGAAGCG CCTCTTGGAC GCGTGTNACC 60 

AGTGGCCTTG GGCTC3GGGAT CACCATAGCT TTTCTAGCTA CGCTGATCAC 120 

GTGTATAATG GrGTCTATCA GTATACATCC CCAGATTTCC TCTATATTCG 180 

CCTTGTATAT TTTTCTCAGG AGGOGTCACG GTGQQGAACA TAGGAOGACA 240 
GGTGTTOCTG AAAAGCCCCA TAGTGATTGA GTCTTCAAAA CCAOOGATTC . 300 

GAAGATTTTG GAAGAAAATC TGACTGrTGGA TTATGACAAA GATTATCnT 360 

AATCTATTTA GATCX3GGCTG ACTGTACAAA TGACTCCTGG AAAAAACTCT 420 

TAGAATAGGG AGGTGGAGAA TGATGACTTA CCCTGAAGTC TTCCCTTGAC 480 

GGCGCCTGTC TGTGCCCTGG AGCATTCTQC CCAGGCTACG TGGGTTCAGG 540 

CTTCCCAAGT ATTOGATTTC ATTCATOTGA TTAAAACAAG TTGCCATATT 600 

GAACTAAGAC TCAATTACCA ACCCGCAGTT TTGrTGTCAGT GCCCAAAGGA 660 

TGGTGCTTAA CAAACATGAA GTATGGTGTA ATAGGAATAA TATTTATCCA 720 
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AAAGATTTTT AAAAA3!ftGGG dCTCTTTAA AAAAAAAAAC AAAACARGAA AAGCftGCaCT 780 

GATTATAGftG AGGTCACftCT CTAAGTOGGG TCGOGGCGTG GCCA0C5CTTC AOGGTCAOSC 840 

TOSTCCGTCC TQCAGTGGCX5 TGTITACATG GTrCACAOGrPG TGTGrrATCAC CAGTOGGTCA 900 

ACTGCITOrC ATTCCTOXG TQQCAGTTTG TGTAGftCAAT CTTACTGAGC AAAAGGCAAT 960 

GAAAAGTCTT GGTHXCACA CTGOGftlATA TTQGAATTTT CACCTCflGTT TATGAAGTTT 1020 

ATTTCGAAAT CCATACTCAT CTAAGAATGA ATACCTGTCT GCCMGrKATT TCAATCTTAG 1080 

TGAGCCAAAA TTGTTTOTrr GTTACTACAG AATAGAGATG ACTOTTTTTT GCCACAGCCC 1140 

15 TATGOIATTT GCAATCTGrPG ATTGtXTTGT AAAAAGGAGA GTGCATATGG CACTGCATTA 1200 

AACX?rGTQGT GrTTTCTACTC AATGATATTG GTGAGCACAA TGrPATTCATT TAATGGCATA 1260 

GACCATACCA GACCTAATTT GCAASTATTG GGTCTTAAAC TTCAAGTOCA ATGTATATGA 1320 

20 

AAACCAATCT GAGCCTTGTA TCTCTTAAAT ATTTATTTTT TTTAAOGTCT GAGATGTTOG 1380 

AGAGAAGGTT CTCCATTCAT TTCAGTQCTG CXTTOGAQGAA ACTCGGCAAT GATTTCTTTC 1440 

25 AfTTTGrrGAAG TTCCTTTCGT GTTACACCCT CCACTGAACC CKMCCTTC GAAATACTCC 1500 

AGTTTTGTC3G GTTTGGTCAT TTTTACTTAT AAATTTACCT TTTTGTAarrT TGCAATTTAC 1560 

ATGTGrrTTGG TITGmTAA ATTCTGrTGAA AGTQQCTTGA TTAAAAGACT CXTTTTAAAT 1620 

30 

GGAAGCCACC AGTCAGCAGA ATC3GAAGCTT AGAOGAACTT GCCTGTGAGC GCTGGTCTTT 1680 

GTGnTGGTT TTGTGATGTA ACGATCTTTG CTQGQGTTTT ' I ' i Xj Cn ' ini r r T TTGAGQGAAA 1740 

35 TGTCTrGGAG TAAATTTTAA GrTTCCTGGftG TTAATTICTT TTACAGGAAT TTTGrmTTr 1800 

AAAAAAATAG GATCATTCTG AftCTTTQGAA TCa^CCCCCTT ATAaMTTTC TGAAAATGAA 1860 

AflCAGTTACA TGAAAAAAAT TTCCAATGAA GAICTCAGCA TTTTATGAAA AACCAGAAGT 1920 

40 

TATTAGAITGA AAGCAGOGAG TGAATCTTTA AAACflGACTT GATCACGCAC ACACAATAAG 1980 

TCTTTCTCTC CGAAACCGGA AGTAAATCTA TATCTGrTTAG AAATAATGTA GCCAAAAGAA 2040 

45 TCTAAATTTG AGGATTTTTT TGCCAATAGT TTATAGAAAA TATATCAACC AAACTGftTTT 2100 

GAGTTTGTAA AAATGTAAAA TAGTATGAAC AAAATTTOCA CTCTACCAGA TTTGAACRTC 2160 

TflGTGAGGTT CACATTCATA CTAAGTTTTC AACATTGTGT TCTTTTTGCA TTCATTTTTT 2220 

ACTTTTATTA AAGGTTCAAA ACX: 2243 



50 



55 



(2) INFORMATION FOR SEQ ID NO; 167: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1816 base pairs 
60 (B) TYPE: nucleic acid 
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(C) smANDEEDNESS: doilble 
<D) TOPOLOGY: linear 

(xi) SEQUENCE nESCRIPTIOW: SEQ ID NO: 167: 

5 

OGTGOGNAGC TTnCATTTC COCTTACWGG UUCUL'l'NTAA OOGGAAACCT TCCOQGAATT 60 

TrOQQGTOGA COCACGOGTC CX3GCCAGCCT AQGftGAAGAA CTTCGTftGTC CCAGAGGTCA 120 

10 GGCAGGADGC GGCAGTTTCT GGCGGGTCAG GGCXX^GCTG AAGTGACAGC GGAGGOQGAA 180 

OCAAOQGrrCG aiCGOQCGGA GAAOOGOGCT OOCCCCAOGA GGAGGAOGAA AQXTTCCGA 240 

GAAAACAGCA ACAAGCTSAG CTGCTGTGAC AG/yOGOGAAC AAGATGGCGG OGCOGAAGOG 300 

15 

GAGCXTCTGG GTGAGGACCC AACTQGGGCT CCCGCCGCTG CPGCTGCrGA CCATOGCETT 360 

GGCXX3GAGGT TCX3QGGACCG CTTOGGCTGA AGCATTTGAC TCQGTCTTGG GTGATADQQC 420 

20 GrrCTTGCCAC CGGQOCTGTC AGTTGAOCTA CXXXTTTGCAC ACCTACCCTA AGGAAGAAGA 480 

GITGTAOGCA TGTCAGAGAG CnTGCAGGCT GmTCAATT TGTCAGmTG TGGATGATQG 540 

AATTGAdTA AATC3GAACTA AATTQGAATG TGAATCTGCA TGTACAGAAG CATATTCCCA 600 

25 

ATCTGATGAG CAATATGCTT GCCATCTTQG KTGCCAGAAT CAGCTQCCAT TCGCTGAACT 660 

GAGACAAGAA CAACTTATGT CCCTGATGCX: AAAAATGCAC CTACTCTTTC CTCTAACTCT 720 

30 GCTGAGGTCA TTCTGGAGTC ACATGATGGA CTCCGCACAG AGCTTCATAA CCTCTTCMG 780 

GACTTTTTAT CTTCAAGCOG ATCACGGAAA AATAGTTATA TTCCRGTCTA AGCOCAGRAA 840 

TCCCAGCTAC GCAOCACAirT TG6AGCCAGG AGCXXTTACCA AATTTGRGRG RAWdfTCTCT 900 

35 

AAGCAAAATG TGCNTCAKMT CGSMAATGAG AAATTCACAA GCGCACAGGA ATTTTCTTGA 960 

AGATGC3AGAA AGTGATQGCT TTTTAAGATG CCTCTCTCTT AACTCTQGGT GGAmTAAC 1020 

40 TACAACTCTT GTCCTCTCGG TGATOGrCATT GCTTTOGATT TGTTGTGCAA CTTGTTGCTA 1080 

CACGCTGTTG GACX3CAGTAT AGTTTCCCTC TGAGAAQCTG AGTATCTATC GTPGACTTGGA 1140 

GTTTATGAAT GAACAAAAGC TAAACAGATA TCCAGCTTCT TCTCTITOrOG TTGTTAGATC 1200 

45 

TAAAACTQAA GATCATGAAG AAGCAGGGCC TCTACCTACA AAAGTGAATC TTGCTCATTC 1260 

TGAAATTTAA GCATTTTTCT TTTAAAAGAC AAGTGTAATA GACATCTAAA ATTCCACTCC 1320 

50 TCATAGAGCT TTTAAAATGG TTTCATTQGA TATAGGCXTTT AAGAAATCAC TATAAAATGC 1380 

AAATAAAGTT ACTCAAATCT GTTGAAAAAAA AAAAAAAAAA AAAAAAAAAC TCGAG00GG6 1440 

GCCCGTTACC AAKTCGCCCT ATWGTGAETB GTATTOITAT TTTACTAATA TCTGTAGCTA 1500 

55 

TTTTGTITrT KGCTTKGGTT ATKGTTTTTy TCCCTTYTCT WAGCTATRAG CTGAIXMKG 1560 

CYSCTTCTCA CXTCCTGCCA TGATACTGTC AGTTACCTTA GTITAACAAGC TGAATATTTA 1620 

60 GTAGAAATGA TGCTTCTGCT CAGGAATGGC CXIACAAATCT GrEAATTTGAA ATTTAGCAGG 1680 
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AAATCAOCTT TAATGACACT ACATTTTCAG GAACTTGAAAT CAOTAAAATT TTATTTGAAT 1740 

AATTATGTCC TGAAAAAAAA AAAAAAAAAA AMWMRARASK RKWWACTCGA GOOOGOOCOC 1800 

GGTAOCOOVT TCX3CXX3 1816 

(2) INFORMATICN FOR SBQ ID NO: 168: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQTOIH: 945 base pairs 
15 (B) TYPE: micleic acid 

(C) STRANDEIKBSS : double 

(D) TOPOLOGY: linear 



(xi) SBCSJENCE DESCRIPTICN: SBQ ID NO: 168: 

AGAAACCGTT GATQGGACTG AGAAACCAGA GTTAAAACCT CTTTQGAGCT TCTGAGGACT 60 

CAGCTGGAAC CAAOGGGCAC AGTTGGCAAC ACCATCAACT TCTCCCAAGC AGAGAAACOC 120 

25 GAACOCACCA ACCAGGGGCA GGATAGCCTG AAGAAACATC TACACGCAGA AATCAAAGTT 180 

ATTGGGACTA TOCAGATCTT GTGTOGCATG ATOGTATTGA GCnTCGQGAT CATTTTGOCA 240 

TCTGCTTCCT TCTCTCCAAA TTTTACCCAA GTGACOTCTA CACTGTTGAA CPCTGCTTAC 300 

30 

CCATTCATAG GACCCTTTTT TTTTATCATC TCTGGCTCTC TATCAATCGC CACAGAGAAA 360 

AGGTTOACCA AGCTTTTGGT GCATAGCAGC CTGGTTGGAA GCATTCTGAG TGCTCTCTrCT 420 

35 GCCCTGGrrGG GTTTCATTAT CCTGTCTGTC AAACAGGCCA CCTTAAATCC TGCCTCACTG 480 

CAGTGTGAGT TQGACAAAAA TAATATACCA ACAAGAAGTT ATCTTTCTTA CTTTTATCAT 540 

GATTCACTTT ATAOCftOGGA CTGCTATACA GCCAAAGCCA GTCTC3GCTQG AWCTCTCTCT 600 

40 

CTGATQCTGA TTTGCACTCT GCTGGAATTC TGCCTAGCTG TGCTCACTQC TGTGCTGOQG 660 

TGGAAACAGG CTTACTCTGA CTTCCCTQGG AGTGTACTTT TCCTGCCTCA CAGnTACATT 720 

45 GGTAATTCTG GCATGTCCTC AAAAATGACT CATGACTGTG GATATGAAGA ACTATTGACT 780 

TCTTAAGAAA AAAGGGAGAA ATATTAATCA GAAAGTTGAT TCTTATGATA ATATGGAAAA 840 

GTTAACCATT ATAGAAAAGC AAAGCTTGAG TTTCCTAAAT GTAAGCTTTT AAAGTAATGA 900 

ACATTAAAAA AAAOCATTAT TTCACTOTCA TTTAAAGATA ATOTG 945 



(2) INFORMATION FOR SEQ ID NO: 169: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 902 base pairs 
60 (B) TYPE: nticleic acid 
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(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DE9CRIPTICN: SBQ ID NO: 169: 

5 

GGCAGAGCCA CAGGEAAGGAT GAGGAAGACC AGGCTCTGGG GGCTGCTGTG GATGCTCTTT 60 

GTCTCAGAAC TCCGAGCTGC AACTAAATTA ACTGAGGAAA AGTATGAACT GAAAGAGGQG 120 

10 CAGACXXTOG ATGTGAAATG TGACTACACX5 CTAGAGAAGT TTOOCAGCAG OCAGAAAGCT 180 

TGGCAGATAA TAAGGGACQG AGAGATGCCC AAGACCCTQG CATGCACAGA GAGGCCTTCA 240 

AAGAATTCCC ATCXAGTCCA AGTOGGGAGG ATCATACTAG AAGACTACCA TGATCATGGT 300 

15 

TTACTGCGCG TCCGAATQGT CAACXHTCAA GrTGGAAGATT CTGGACTGTA TCACTGTGTG 360 

ATCTACCAGC CTCCCAAGGA GCCTCACATO CTOTTCGATC GCATCOQCTr GGTOGfTGACC 420 

20 AAGGGnTTT CAGQGACCCC TQGCTCCAAT GAGAATTCTA CCCAGAATGT GTATAAGATT 480 

CCTCCTACCA CCACTAAGGC CTTGTGCCXA CTCTATACCA GCCXXAGAAC TGTGACCCAA 540 

GCrCCACXXJV AGTCAACTGC CXaATGnCTCC ACTCCTGACT CTGAAATCAA CCTTACAAAT 600 

25 

GTCACAGATA TCATCAGGGT 'llXXXJlXJrit.' AACATTGTCA TTCTCCTGGC TGGfTGGATTC 660 

CTGAGTAAGA GCCTGGTCIT CTCTGTCCTG TTTQCTCTCA OGCTGAQGTC ATTTGTACCC 720 

30 TAGGCXX:ACG AACCCAOGAG AATGrrCCTCr GACTTCCAGC CACATCCATC TOGCAGrrrGT 780 

GCCAAGGGAG GAGGGAGGAG GTAAAAGGCA GGGAGTTAAT AACATGAATT AAATCTGTAA 840 

TCACCRGCTA AAAAAAAAAA AAAAAAAACN CGANCX7ITJGG TTTTCAGCTC CATCAGCTCC 900 

35 

TT 902 



40 

<2) INPORMATICai FOR SEQ ID NO: 170: 

(i) SEC2UEKCE CHARACTERISTICS: 

(A) LQIGrrH: 1883 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRMiiDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

50 

AGAAAACAAC TGAAAAACCA CATTTTTCTA CATACAGCTG GGGAGGTAGC TGAGAACTTG 60 
GCACTGOGCA CACATACTAG GTTGAAAGAG AGTTGAGGAA ACCAGAAGGC CAAGTGGATC 120 
55 TGCTOGCAAA CCCTGAACCT GTCTCCTGCG CTTGCTCTAC AGTTCTGAAG TTGAAAATCC 180 
TTTTCATGCC TAGCATCTGC TTGAGTTATA AACCCCAAGG CAGCCATGTC ATAGACTAGT 240 
GTTTACTCTT GnTTGACTT TGTTTTAATG CTTCCT7UU3A CCCAAGTGCC TCCTGCTGTT 300 

60 
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TCCTCCTTTG TOGTAGOCTC TGGCXOTCTG GGACXTTCAAT OCCCAGCnT CCXACTTTCA 360 

GCMnCCTTT GCTCTCTTTG CTTCTADCTC AAATAGCCCC AQGAGrTGGGC TTTAGTCTCC 420 

AATATQGAGC ATYTCAAGCT TCTCCTGQGG GATGGGGATT GQC5ATGGGCA GAATCTGrTTT 480 

TQGWrCTCCG GGrTTATTTCC AgTGGGrrG TA AAAGCAGAGC TGGQCCTTTC OCTCTCTTAT 540 

CCCTGAGQOT GGGTAAGAAG GACTOTATCT ACACCTGTTC TTCCCTACCT ' il. '' lVrmXJr 600 

TAGGGAGGCX: TCA3TCTAAG TTCCTCAAGA GAGTCCTTGG CTTAAAGCTG TftGCAAGGGT 660 

GrrGCTAGGTC GGGGATTTGG AQCAAAACOG TOGAGTAGGC ATOATACTQG TATGGAGrTGG 720 

GCXTTGCAAAA TCAGACAGAA ATGGCTTGAG AAGCCX5CAGG GGAGCATGCC TGTCTCTCAG 780 

TQATAGA^QTA TOCSGAGGGAC CTCCCTAOCT TGGAAAATGA GAATTGAA06 OGTTATGAAC 840 

AAATAGGATG CCTAGTTGAG GATGTTCCCA AALJl'lTXUiC CAATCTTATC ATTACTAGAT 900 

nTATAAGCC ACAGAGftCAA ACCAGAAACG GAATAATGTT ACTTTOGATG CTTTATTTTT 960 

TTGTTCTAGG TGrTGGCTTTG TACATGCAGA AGAAIQCTAT ATGCTGCACA TTTTGCCTTT 1020 

AAAGTCTTAC GACTTTCCCC ATTTTAGrTCT AATGGGAAGA TACflGATOIG CAAGTCTQCT 1080 

TTTTTGTTTT TTGnTArTAT TTTTITTTTT 1TOCTCTGTC TTATQGACAT TTTCAGACAT 1140 

GCACAGAAGT GGAGAGGATG GTCCTTGGAC CCGATCTGTC CATCACCTAG CTGCATCACT 1200 

TATCAGCTAT GGTCAADCTG GTTTCATCTG TATCTCTCTC TrTTCACCTG TATTGTTrAT 1260 

TGAAAATCCA AGACACTATO CCAATGCAAC CGTCACTACT TTGGGAGATT GGTAGTCTCT 1320 

TrTGATGGTC ATAGTGATC3G QGTGCACTAT CATAATCACA TCAGGTCPGC ' i ' l ' ri ' mLTi ' l ' 1380 

TAAlXJrTAAC TAATCAAGTT CCAGAGATGG GCCTTAGAAA TCTOTTTTAA GAATTAACAA 1440 

GGAGTCTCAA AAAGAAATGA GAGGGATGCT TCCTTTCCXX: TTOCATCTAC AAAACAAGAG 1500 

AGAGACTGTT CTGTTGTAAA ACTCTTTCAA AAATTCTGAT ATOGTAAOGTr ACTTGAGACC 1560 

CTTCACCAGA ATGTCAAICT TTTTrTCrGT GTAACATGGA AACTICTGTC ACCATTAGCA 1620 

TTGTTATCAG CTPGrACTGG TCTCATAACT CTGC3TTTTGG AAGAATAATT TGGAAATTGrr 1680 

TQCTGTGTTC TGTGAAAATA ACCTCCCCAA AATAATTAGT AACTQGTrGT TCTACTTGGT 1740 

AATTTGACAC CCTGTTAATA ACGCAATTAT TTCTGTGTTC TTAAACAGTA TAAATAGTTG 1800 

TAAGTTTGCA TQCATGATGG AAAAATAAAA ACCTGTATCT CTGTTAAAAA AAAAAAAAAA 1860 

AAAAAAAAAA AAAAAAAAAA AAA 1883 
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(2) INFORMATION FOR SEQ ID NO: 171: 
(i) SBQUQTCE CHT^RACTERISTICS : 
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(A) LEHGrm: 2100 base pairs 

(B) TYPE: nucleic acid 

(C) STTBAmEDttESS: double 

(D) TOPOIiXy: linear 

(xi) SEQUENCE DESCRIPnON: SEQ ID NO: 171: 

TACTTTTAGA TTTACTGCCT TCAAAAAGTG CCTATTCTGA GCAACATAAA CGTTATTCCT 60 

TACATATCTA TGTACACflCG GTACCCAGAG TCCTACTOPG GCAGCCTTCA AAAACATACC 120 

ATCAGAAAGA GTAGGTQCTG AGATAAGGNA ACTTTGCCAA ATCNAAGAAA GTCACTCACT 180 

TCCAATATCC CCTCTTCAAG CGGCTACCGT OlAASGGGCT GCAAACACAT TCCCTGftGCA 240 

TOXTTGCTG ATACAGCTTC TITATATTTA TATOCTACTG GATGGTAGCA TATTQCTAAG 300 

GTTTCCTGTA CTCTGCTTCA AGGGAATGTA AGYTTTATGG CATTGAAACA TTEAGGAAAA 360 

AAAAAGATGT TTAflGAGAAT TAAIEAGAGCC GTAGrrCTGTA TTAGGATOTG TGTCATATGT 420 

GrTGrrrCTATA AACTAAQCAT CGCSTGGGTTT AGAGnCTTAA AGTGTCAGCA CATTCCTTCT 480 

CCTTTTGrrCT CTCAGGCTAA CATGAGAGAA AATAGAAAAG TCTTQGCTGrr GGGGATTGGA 540 

AGCTCAGQQG GCXAAATGTC CTTGCCAGAT CTTTAGAGCA TTACTTTGAC TCCTAAAAAT 600 

AGTAGTGTAT GTTATTTGAT GQCTnTGTT TCCATAGTTC CATCACTGAC AAAACTGTCA 660 

ATACTGTTGA TGGAGCAGCA GCATAGCCTA GAGTGATGCA TVCTThCCCA GAGGTQGCAA 720 

TAGGAGAGC3G TCCATOTAAA TAGGACGAGG TAGACAGTGC ATGATTGTAG GAGAAGGGTT 780 

GAAGQGAGGA CATGATTCCA AAAAAGATCG TTCTCAAICT GTCGTCTCAC TCAACCAGCT 840 

GGCAGATTAC ACTTGCX:AAG TCGTTCCCTT TCCTTCTAAG TCAGnTGGCT CCATATTCAC 900 

TTGAATATGC CTCTOTTTOG GCAAAGCAAG ATACXTCCAC TTAACCTTTA TCCAAGGAAG 960 

CiirriuaiTyr cctcttggtc ATAAAGrrrGT ctcctaccta AcxxMrrrrr accaaatqga 1020 

AGTAAAAGGG GACAAACTAT GGAAGATOGA CTCCATGCCA TTGCAGTCAG CTACCATTCT 1080 

CTTTTCCATA TAAGGAGCCC CATTACATAA GCTACGGGTG AGGTTGGAAC AGCTATGTTr 1140 

CATAATTTCA AGAGnCTGAC CACCCTQCTC TAGTrCATCAT CATTGGATGA ATCCAGTTGA 1200 

CTCTrTGGCA AAAGGGTGAT ACTTTTCACT AAAAATGCCT ACTCTTCCTG TTGATGrTTCC 1260 

TTTTCTGrrrr TTACCTTGTC CAATTTCCAC ACTAGTCATT TTTTTTATTT TTTAGAGGAT 1320 

CAGATTTTAG CGCTGGAAAA TGAGTTCAAA AATTTCAGrG TAATCTCATA AGGATGTTGG 1380 

GATACAGAGA UriTriTrrr CXrrrGGAAAC AAATGGACTG GGAAGAAACA CAGCATGGCT 1440 

TTGCTCTGAG TTTCAATCTG AITGATTATGA CCATGGAAGA TAGTCTTATG TAAAOGTTAA 1500 

ATGGTGTTTA CAAGTGGATA GAITAAGGCGG AGATOGTGAG AAGCCGGGTT TTCTCTATOC 1560 

TAAATGriGTC TACTAAGAGC AGCACTTCCT ACTAGCTAAG CACAATCATA GCCCXACCGT 1620 
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GATCAGCTGC TAGTCTGAAT AACA3rrCCCT GACTEAGQGA AflGQCACACA AAAACA3ATA 
AAGAATATGT CTATTTTCAT ATGTGTCATA CTGACAGAGC GATGGTATTC CTAAAAmTA 
QGTTTCTCTT TTTTCTTOTA TTCTTAGCAA ATTGCATTTA TTCACEACAT TACAAACCAT 
CACTGATGTA TCCAAAATAG CACACATAGT TCAGTATGAA AATAAGAGAA TAAAATCTOT 
TATAAGCAAG TGATTTAGGT ATrTTCTTTT GrGrrTTATCC ATTATCTGAC TATATTAAAA 
CCTGTrTTTC TATTTACCTT CTATCAGrTTT TCTCTACXAA TT MUi ' mT TCAATGCTCT 
ATAAGAATGA ATATQGAAAT TATATTTCTT TTTTCTGTAA AAGAGTTGCA ACTACTTTAT 
TATATTTAGA AATCCAATAA ACTTCTTATT ACATTTAAAA AAAAAAAAAA AAAACTOGAA 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGra: 1930 base pairs 

(B) TYPE:, nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPnON: SEQ ID NO: 172: 
CC^^TGA^3TG TGGTCCOGGG TGCNGATTGG CAGCGCCTCC GCOQCGGCTC GTQGrrTGTOC 
CGCCATGGCA CTGTOGOQC3G GGCTGCOCOG GGAGCTGGCT GAQGCGGTGG OOOGGGOCOL; 
GGTtXTOGTG GTQGGGGCGG GOGGCATCGG CTGCGftGCTC CTCAAGAATC TOGTCCTCAC 
CGGTTTCTCC CACATCX3ACC TGATTGATCT GGATACTATT GATGTAAGCA ACCTCAACflG 
ACAGriTTTTG TTTCAAAAGA AACATGTTOG AAGATCAAAG OCACAOGTTG CCAAOGAAAG 
TCJTACTGCAG TTTTACOCGA AAGCTAATAT OGTTGCCTAC CATGACAGCA TCATGAACOC 
TGACTATAAT GTGGAATTTT TCCGACAGTT TATACTGGTT ATGAATQCTT TAGATAACAG 
AGCTGCCCGA AACCATGTTA ATAGAATGTG CCTGGCAGCT GATGTTCCTC TTATTGAAAG 
TQGAACAGCT GQGTATCTTG GACAAGTAAC TACTATCAAA AAGGGTGTGA CCGAGTGTTA 
TGAGTGTCAT CCTAAGOOGA CCCAGAQAAC CTTTCCTOGC TGTACAATTC GTAACACAOC 
TTCAGAACCT ATACATTGCA TCGTTTQGGC AAACTACTTG TTCAACCAGT TGnTTGOGGA 
AGAAGATGCT GATCAAGAAG TATCTCCTGA CAGAGCTGAC CCTGAAGCTG CCTGQGAACC 
AACGGAAGCC GAAGCCAGAG CTAGAGCATC TAATGAAGAT GGTGACATTA AACGTAXTTC 
TACTAAGGAA TGCX3CTAAAT CAACTGGATA TGATCCAGrTT AAACTTTTTA CCAAGCTTTT 
TAAAGATGAC ATCAGGTATC TGTTGACAAT GGACAAACTA TQGOQGAAAA QGAAACCTCC 
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AGrrrCOGTTG GACTOGGCTG AACTACAAAG TCAAGGAGAA GAAACXaATG CATCAGATCA 960 

ACAGAATGAA CXXXAGTTAG GCCTGAAAGA OCAGCAGGTT CTAGATGTAA AGAGCTATGC 1020 

5 AOGTCTTrrr TCAAftSAGCA TOGAGACTTT GAGAGTTCAT TTAGCAGAAA AQQOQGATGG 1080 

AOCTGAGCTC ATATGGGATA AOGATGACCC ATCTGCAATG GATTTTGTCA CCTCTOCTGC 1140 

AAACCTCAGG ATGCATATTT TCAGTATGAA TATGAAGAGT AGATTTGATA TCAAATCAAT 1200 

QGCAGGGAAC ATTATTCCTG CTATTQCTAC TACTAATGCA CTTAATTQCTG GGTTGATAGT 1260 

ATTQGAAGGA TTGAAGATTT TATCAGGAAA AATAGACCAG TQCAGAACAA TTTTTTTGAA 1320 

15 TAAACAACCA AACCCAAGAA AGAAGCTTCT TGTGCXnTCT GCACTGGATC CTCCCAACCC 1380 

CAATTOTPAT GTATGTOXA GCAAGCCAGA GGTGACTGTC OGGCTGAATG TCCATAAAGT 1440 

GACTCTTCTC ACCTTACAAG ACAAGATACST GAAAGAAAAA TTTGCTATGG TAGCACCAGA 1500 

20 

TGTCCAAATT GAAGATQGGA AAGGAACAAT CCTAATATCT TCCGAAGAGG GAGAGACGGA 1560 

AGCTAATAAT CACAAGAACT TGTCAGAATT TGGAATTAGA AATGGCAGCC GGCTTCAAGC 1620 

25 AGATGACTTC CTCCAGGACT ATACTTTATT GATCAACATC CTTCATAGTC AAGACCTAGG 1680 

AAAGGACGTT GAATTTGAAG TTCTrTGGTGA TGCCXXX3GAA AAAGrPGGCSGS CCAAACAAGC 1740 

TGAAGATGCT GCCAAAAGCA TAACCAATGG GCAGTGATGA TGGGAGCTTC AGCXXTCCAC 1800 

30 

CTYCACAGCT TCAAGGAGGC AAGATGGACG TYTCYCATAG TTGATYCGGR TGAAGAAGRT 1860 

TCTCCAATAA TTGCCCX3ACX; TTCAITGAAG GAAQGAGGAG GAGGCCCX3CC AAGAGCSQGAA 1920 

35 TTTAGGNTTG 1930 



40 (2) INFORMATION FOR SEQ ID NO; 173: 

(i) SEQOWCB CHARACTERISTICS: 

(A) LQXTTH: 1509 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANEEENESS: doijble 

(D) TOPOLOGY: linear 

(xi) SBCXroiCE DESCRIPTION: SEQ ID IK): 173: 

50 GGCCCTGGCC TCTQGGCTGA GGCTTGCTAG GGACTCGGGG TGGCTCTAAG GGGCAiCSGGAT 60 

AGGGCTGGGG AGCGCCGGCC TGrTGQCCCTG ACCAGCCCCT TCTCGTOCRG GTTCCACCCC 120 

GATGCAGGTG GTCACGTGCT TGACC3CQGGA CAGCTACCTG ACGCACTGCT TCCTCCAGCA 180 

55 

CCTCATGGTC GTGCTGTCCT CTCTC3GAACG CACGCCCTCG CCGGAGCCTG TTGACAAGGA 240 

CTTCTACTCC GAGTTTGGGA ACAAGACCAC AQGGAAGATG GAGAACTACG AGCTGATCCA 300 

60 CTCTAGTOGC GTCAAGTTTA CCTACCCCAG TGAGGAGGAG ATTGGGGACC TGACGTTCAC 360 
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TOTQGCOCAA AASAUXSGCTG AGCCAGAGAA GCXXXCAGCC CTCAGCATCC TGCTGTACGT 420 

GCAGGCCTTC CAGGTGGGCA TGCCACCCCC TXX3CTGCTGC AGQGGCCCCC TGCGCCOCAA 480 

GftCACTCCTG CTCACCAGCT CCGaGATCTT CCTCCTQGAT GAQGACTGT G TCCACTACCC 540 

ACTGCCCGAG TTTQCCAAAG AQCOXXXXrA GAGAGACAGG TACOGGCTGG ACGATGQCOG 600 

CCGCGTCCGG GACCTGGACX: GAGTGCTCAT QGGCTACCAG ACCTACCCGC AGCCCTCACC 660 

CTCGTCTTCG ATGACGTGCA AGGTCATGAC CPCATGGGCA GTGTCACCCT GGACCACTTT 720 

GQQGAGGrGC CAGGTOGCCC GGCTAGAGCC AGCCAGGGCX: GTGAAGTCCA GTOGCAGGTG 780 

TTTGTCCCCA GTGCTGAGAG CAGAGAGAAG CTCATCTCGC TGTTQGCTOG CCAGTQC3GAG 840 

GOCCTGTCTG GOCGTGAGCT GCCTGTOGAG CTCACOGGCT AGCXXAGGCC ACAGCCaQGC 900 

TGTCGTGTCC AGCXTGACGC CTACTQQQGC AGGGCAGCAG GCTTTTGICT TCTCTAAAAA 960 

TOTTPrATCC TCCCTTTOGT ACdTAATTT GACTGTCCTC GCAGAGAATG TGAACATCTC 1020 

TGTGTGTTGT GTTAATrCTT TCTCATOTTG QGAGrTGAGAA TGCCGGGCCC CTCAGGGCTG 1080 

TCGGTOTGCT GTCAGCXTTCX: CACAGGTGGT ACAGCCGTGC ACACCAGTCT OGTOTCTOCT 1140 

GrrrcrrGGGAC CGTTGrrCAAC AOnGACACT GTGGGTCTGA CTTTCTCriC TACACGTCCT 1200 

TTCCTGAAGT GTCGAGTCCA GTCCTrTGTT QCTGTTGCTG TTGCTG?rTGC T Crr mC ltr r r 1260 

GGCATCTTGC TGCTAATCCT GAGGCTGGTA QCAGAATGCA CATTQGAAGC TCCCACCCCA 1320 

TATTGTTCTT CAAAGTQGAG GTCTCCCCTG ATCCAGACAA CTQGGAGAGC CXOTGGQQGC 1380 

AGGGGAOCTG GAGCTGCCAG CACCAAGCX3T GATTCCTGCT QCXTTGrTATTC TCTATTOCAA 1440 

TAAAGCAGAG TTTGACACCG TCAAAAAAAA AAAAAAAAAA AAAAAAAAAA AITOCTGCGG 1500 

CCTCAAGGG 1509 



45 (2) INFORMATION FOR SEQ ID NO: 174: 



50 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) I^JXimi 3173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: doilble 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCBaPTICN: SBQ ID NO: 174: 
TOGACCOCAS GOGTCCGTGC TTTICCACAG AAGGTTAGAC CCTGAAAGAG ATQGCTCAGC 
ACCAOCTATG GATCTTGCTC CTTTGCCTGC AAACCTGGCC GGAAGCAGCT GQAAAAGACT 
CAGAAATCTT CACAGTGAAT GGGATTCTQG GAGAGTCAGT CACTTTCCCT GTAAATATCC 



60 
120 
180 
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AAGAACCAOG GCAAGTTAAA ATCATTGCTT 
CaCCflGGAGA CTCftGAAACA GCACCCGTAG 
5 GGATACATGC CTTAGGTCCG AACTACAATC 
CAGGAGACTA CAAAGCAGAC ATAAATACAC 
ACAACCTGCA AA3CTATCGT CQGCTTGGGA 

10 

CTGTCAACAG CACCTGTAAT GTCACACTGA 
TGACATACAA TTQGAGTCCC CTGGGAGAAG 
15 CTGAGGACCA AGAGCTGACT TACACGTGTA 
ACTCCATCTC TGCCCGGCAG CTCTGTGCAG 
CCGGGTTQCT GAQOGTGCT G GCTATGTTCT 

20 

nTTGrrrccG tttgttcaag agaagacaag 

ATATCATGGC TTCAAGGAAC ACCCAGCCAG 
25 AGrrcCAAGGT GCTTCCCTCC AAGGAAGAGC 
TTGCTGATAA GATC3QGGAAA GCCAGCACAC 
ATGAAATTGT GATCTAGGCT GCTGOGCTGA 

30 

CCAATACTOG CAGOTTCCCT GGATCCAGAT 
CAAACTGCCA CATCTCAGCC TGTAAGCAAA 
35 GCCTAAATGG ACAAATQGAT GCATACCCTT 
AAAGCAGGTT ACCTAGTATA GTTTTCCCAA 
TAAIATTTTT ATGGCACACT GQGATAAACA 

40 

ATGACTAGAG GCCTCTTGTG ACTGGAQGTA 
GGATCAATAT TTTGCACACC TGTAATAGGC 
45 CAGTCAGTAT GriCTGAAGAT CCCTGGTGCG 
GGAAAATGTA CCCTTCGCTT GAGGCAGATG 
AGCAGAATGT GGGCTGCATA TAAGCACACT 

50 

GCATAACAGG CTTAGTAAGT CCAAACACAG 
TTGTQGCrCT CAGCCATGTA GACACACTCT 
55 GCAGGGTCTA GAGACTGCTG GGACACmT 
ATTTTCTTTC TGGCCAAGAT TTCXnTCTGT 
AGCCATGCCC ACTTATTCCCA CTCTCCAAAA 

60 



403 

GGACTTCTAA AACATCTOTT GCTTA3CTAA 240 

TTACTGTGAC CXIACAGAAAT TATTATGAAC 300 

TGGrrCATTAG OGATCTGAGG ATGGAAGAOG 360 

AGQCTGATCC CTACACCACC ACCAAGOGCT 420 

AACCAAAAAT TACACAGAOT TTAATOGCAT 480 

CAOXSCTCTCTr AGAGAAAGAA GAAAAGAATG 540 

AGGGTAATGT CCTTCAAATC TTCCAGACTC 600 

CAGCCCAGAA CCCTGTCAGC AACAATTCTG 660 

ACATCGCAAT GGGCTTCCX3T ACTCACCACA 720 

TTCTGCrrGT TCXCATTCTG TCTTCAGrPGT 780 

ATGCTGCCTC AAAGAAAACC ATATACACAT 840 

CAGAGTCCAG AATCTATGAT GAAATCCTGC 900 

CAGTGAACAC AGTTTATTCC GAAGTGCAGT 960 

AGGACAGTAA ACCTCCTQGG ACTTCAAGCT 1020 

ATICTCCCTC TGGAAACTGA GTTACAACCA 1080 

CTTCTCTGCC CAACTCTTAC TC3GGAGATTG 1140 

GCAGGAAACC TTCTGCTGQG CATAGCTTGT 1200 

CXTOAAATGA CTCCCOTCTG AATGAATCAC 1260 

ACTTCTTCCX: ATCAIAGCAC ATGTAGAAAA 1320 

AGCAAGATTG CTCACTTCTG GAAGCTGCAT 1380 

ACAACXXTGC CXaVGTAACTG TGGGAGAAGG 1440 

CATQGCACAC CAGCCAAGAT GCTCTGCTCA 1500 

TGGCXnrCAC CACGCATCTT GAGCAAATTA 1560 

CAGCCCTTCC CXXGAGTQCA TGGCTTGGAG 1620 

CATCCXTTTTG TCTGGGAATC TTTGTCCAGG 1680 

ATGACAGTGC TGTGTGGGTC TCTGrrCAGAG 1740 

CCAAATGGAG TGTTGGAAAA TGrrTCTTXCr 1800 

CTTGGAGTGC TACTTCAGAA GCCTTATAGG 1860 

ATCACTOCAA GCAGCCTCAG CAGAAGAAGC 1920 

GGAACTGACC AGCTTATATT TCTCACACTT 1980 
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(2) INFORMATION FOR SEQ ID ITO: 175: 



(i) SBQUHJCE CHARACTERISTICS: 

(A) LQfGTH: 991 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
50 (D) TOPOLOGY; linear 



CTOGGGAACT GGGTATAATC CAACCATCAA AATAGAAGAC CTIQCAflGAA GCAGAGTCAT 2040 

TCTCCAGAAG GAACTTQGGA GATGATGGTG CAGATGATGA AACTGGG1TC ATCCCAGTTC 2100 

CAAAGACTCA GAGAACTAGA GTTTAAGCTG AGGCAGAGTG CCGCCACCCT GGCATQCCCC 2160 

ACAAACAGAT CACCAGCCAG CTTACACAGG CATTAACPCT CCTCAATGAG GAAGAATCAT 2220 

TCACAACTGA GCAAGACATT CATATGATCA TTTAAGGAAG ' l Ui TlXX V rr ATGTGfTTAGC 2280 

AAGTATAATC GGCTAACTCC TAAATCCCAA TGAATAGTCC TAGQCTQGAC AGCAATGGGC 2340 

TQCAATTAGG CAGATAAAGA CATCAGTCCC AGTAAATGAA TCCATAGACT CA3CTAQCAC 2400 

15 CAACTACCAT TAGCACTATC TTAGGAGCTG CAAGQCOCCA AAGTAGAAGA TGTOCAfEAAT 2460 

GTCTGCTCTT GTGTAGCTCA GGAGACAATT CCAOCACAGA CACTACAGTT AACOCTGAAC 2520 

TCSCAGCTGCA ACTAATAGCA TGAACAGTCA GAAAAATACC TTATGAGGGG GCAGQGCTGA 2580 

AGCTQQQCCT TGAAGGATGG ATGAAATTTG GATAGAGAAT GAGGAAGACA GAGOtSCCTCC 2640 

AAGTGAGAGA AGCATGAAAA ATGAGCAGGG GCCTC3GATCA GTQGGGTGTA TPCAGAGCAC 2700 

25 CTCTCCAGAT GCACCATGCA TGCTCACAGT OCCTTGCCTA TCyPGTGOCAG AGTGrrCCCAG 2760 

CCAGATGrrcr GCCCCCACCC CATGTCCATT TACATGTCCT TCAATGCCCA CCTCAAAAGG 2820 

TACCTCTTCT GTAAAGCTTT CCCTGGTATC AGGAATCAAA ATTAATCAGG GATCTTTTCA 2880 

30 

CACTQCTGTT TTTTCCTCTT TGGTCCTTCT ATCACTAAAA CICATCTCAT TCAGCCTTAC 2940 

AGCATAACTA ATTATTTGTT TTCCTCACTA CATTGTACAT GrrGGGAATTA CAGATAAACG 3000 

35 GAAGCCKGCT GGGGTGGTGG CICAOGCCTG TAATCCCAAC ACTTTQQGAG GCCAAGGCAG 3060 

GCGGATCACC TGAGGTCAGG APTTCGAGAT TARTCTGGCC AACATGGrTGA AACCCCAIWr 3120 

ITEACTAAAAA TACGAAATTA GCCAGGTGTG GTGGCACACA TCTGTAGTCC CAG 3173 

40 



(xi) SBQUEMZE DESCRIPnCN: SEQ ID NO: 175: 

AAATTCGGCA CAGCPGAGAG GAGACACAAG GAGCAGCCCG CAAGCACCAA GTGAGAGGCA 60 

TGAAGTTACA GTGrPGTnCC CTTTCGCTCC TGGGTACAAT ACTGATATTG TGCTCAGTAG 120 

ACAACCACGG TCTCAGGAGA TGTCTGATTT OCACAGACAT GCAOCATATA GAAGAGAGTT 180 



60 



TOCAAGAAAT CAAAAGAGCC ATCCAAGCTA AGGACACCTT CCCAAATGTC ACTATCCTGT 
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CCACATTOGA GACTCTOCAG ATCATTAAGC CXrrTftGftlCT GTGCTG0C3TC ACCAACAACC 

T0CTOGC3Grrr ctaogtgqac agggtgttca aggatcatca ggagccaaac cxxaaaatct 

TGAGAAAAAT CAGCAGCATT GCCAACTCTT TCCTCTACAT GCAGAAAACT CTGCGGCAAT 
C3TCAGGAACA GAGGCAGTCT CACTGCAGGC AGGAAGCCAC CAATQCCACC AGAGTCATCX; 
ATGACAACTA TGATCAGCTG GAGGTCCACG CTGCTGCCAT TAAATOCCTG GGAGAGCTCG 
ACGTCTTTCT ACXTTOGATT AATAAGAATC ATOAAGTAAT GTCCTCAGCT TGATGftCAftG 
GAACCTGTAT AGTGEATOCAG GGATGAACAC CXXXTGTGCG GTTTACTGTC GQAGACAGCC 
CACCTTGAAG GQGAAGGAGA TQQQGAAGGC CCCTTGCAGC TGAAAGTCCC ACTOGCTCGC 
CTCAGGCTGT CTTATTCXXSC TTGAAAATAG CCAAAAAC7PC TftCTOrOGTA TTTGTAATAA 
ACTCTATCTG CTGAAAGGGC CTC3CAGGCCA TCCTQGGftGT AAAGQC3CTGC CTTCCCASCT 
AATTTATTGT GAAGOXarAT AGPCCATGTC TOTGATGTGA GCCAAGTCAT ATCCTGTAGT 
ACACATPGTA CTGAGTGGTT TTTCTGAATA AATTCCATAT TTTACCTAAA AAAAAAAAAA 
AAAAACTOGA OOOOGGGCXX GTACCCAATT T 

(2) INFORMATION FOR SBQ ID ND: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPQUDGY: linear 

(xi> SBC3UENCE DESCRIPTICN: SEQ ID NO: 176: 
ACAGCCCTCT TOGGAGCCTG AGCCCGGCTC TCCTCACTCA CCTCAACCCC CAGGCGGCCC 
CTCCACAGQG CCCCTCTCCT GCCTQGACGG CTCTCCTQGT CTCCCCGTCC CCTGGAGAAG 
AACAAGGCCA TGGCJTCGGCC CCTGCTGCTG CCCCTRCTGC YCCTQCTGCW GCCGCCAGCA 
TTTCTQCAOC CTRGTQGCTC CACAGGATCT GGTCCAAGCT ACCTTTATQG GGTCACTCAA 
CCAAAACACC TCTCAGCCTC CATGGGTCGC TCTGTGGAAA TCCCCTTCTC CTTCTATTAC 
CCCTGQGERGT TAGOCAYAGY TCCCRACGTG AGAATATCCT GGAGACGGGG CCACTTOCAC 
GGGCAGTCCT TCTACAGCAC AAGGCCGCCT TCCATTCACA AGGATTATGT GAACCGGCTC 
TTTCTQAACT GGftCAGAGGG TCAGGftGAGC GGCTTCCTCA GGATCTCAAA CCTCCGGAAG 
GAGGACCAGT CTC3TGTATTT CTGCCGAGTC GAGCTGGACA CCCGQAGATC AGQGAGQCAG 
CAGTTQCAGT CCATCAAGGG GACCAAACTC ACCATCAOCC AGGCTGTCAC AACCACCACC 
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ACCTGGAGGC CCAGCAGCAC AACCftCCATA GCCGGCCTCA GOffPCACAGA AAGCAAAGGG 660 

CACTCAGAAT CATGQCACCT AAGTCTQGAC ACTQCCATCA GGGTTGCATT GGCTGTOGCT 720 

) GTOCTCAAAA CTCTCAmT QQQACPQCTG TGCCTCCTCC TCTOrGGTGG AGGAGAAGGA 780 

AAGGTAQCAG GGCCCCMiGC AGTGACTTCT GACCAACAGA GTOTOGGGAG AAGCSGASXTTG 840 

TATTAQCXXX! GGAGGAOGTG ATGTGAGACC OGCTTGTCAG TCCTCCACAC TCGTTCOCCA 900 

TTC3GCAAGAT ACATGGAGAG CACCXTTGAGG ACCnTAAAA GC3CAAAGCCG CAAQGCAGAA 960 

GGAGQCTGQG TCCCTGAATC ACCGACTQGA GGAGAGTTAC CTACAAGAOC CTTCATOCAG 1020 

15 GAGCATCCAC ACTQCAATGA TATAQGAATO AGGTCTGAAC TCCACTGAAT TAAACCACTG 1080 

GCATTTOGGG GCTGTTYAIT ATAGCAGTTGC AAAGAGTTCC TTTATCCTCC CCAAGGArPGG 1140 

AAAATACAAT TTATTTTGCT TACCATACAC CCCTTTTCTC CTCGTCCACA TTTTCCAATC 1200 

TGTATQGTGG CTGTCTTCTA TQGCAGAAGG TTrTGGGGAA TAAATAGCGT GANATGWTNC 1260 

TGACINAAAA AAAAAAAAAA AAAAACTCGA 1290 

25 

(2) INFX>RMATICN FOR SEQ TO NO: 177: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANTffiENESS: double 

(D) TOPOLOGY: linear 

35 

ixi) SEQUENCE DESCRIPTIQN: SBQ ID NO: 177: 

TGGGGCOCCT TTTQGATC3CT CTC3GGTGTTT TTGCCAAGAG TTACAGGATG TCAAGTGTGG 60 

40 GGAGCTCAGC ACCCTTGCTG TGGACCAGTG AAGGCTGTTC CAGACCAGGT GCTPCCftGAC 120 

ATTTCCAGGC TCCAGGAGAG AGGCTGGGAG CCCCCACAGA AAGCACAGGA AAATGCAAAA 180 

AAAAAACAGT CTTTTTTTTT TTTTTGCTTT TTATTATGAA AACAAAACAA ATGCCCCAGG 240 

45 

AGAAGGGTCC ATGATTACCA GAAACATCAA AGAGTACTTT CTACCATTTT TATTCTGTTG 300 

TGTTGAGGCC AGCATTQCAA TAAACAAGCT AAACTACTTA CATTQGACTC ATTTTCAGTA 360 

50 ACTGACATTT ACAGGAATAT ACTAGAAACG GCACTAAAAA GTTTAAGAAA AGrTTACGGTA 420 

AACTTGCATG CACATCATAC AGAAAAGTTAA CATTTTAAAT ATAAAAAAGA AAAACTTCCT 480 

GGAAGCATTA TGCCAGTATT AAGGAACAGT GCTACTCTGG ATGTGACAAA TTCTGTATGT 540 

GGGTGTTACT CTTTCCCAAA AGftCTGTCAG AGGOGTGAGT GCTOCAAAAG AACAACAACA 600 

AAAACAAACA CACAAAAAAA TGTGTCTTAC AGTTTGTAAG CAAGATGACA CTGCCCAACA 660 

60 CAAAGAGGGG TCrGGAGTrC AGTICAOGCC CGAAGCCTGC CCCCTCGGCC TCCAGQGGTC 720 
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ATTCAGAGTG TTCTCAAATC CAATTCOGAC 
AAAAAAGCAT GTrAGAAGCT GCCCTACAGG 

5 

COGCflGCXTTT CTAATACAGA AGAAftCOGAC 
GOGCTGAGGA AGTCATTAAT CdTCGAAAC 
10 CflGAAAGCCT TAAAAAAGTG ACAGCAOCAA 
GTCAQGGTCA GntGGCTTCrrr TCTAGA3X5AA 
CAGAGAACCA GCCGAGAAGG AAAQGCCCXA 

15 

GQCCGCnXC COGACQGCTC ACACAGGCAG 
TGCAAGGEAGC GCCOCXXIAGC CCCAGGCACC 

20 CTGrrOCTQGC agcacgttac caaccagcct 

ATTCCTTAAA TTCGGTTTAA ATAGTCCATT 
OGAGGGTAAC TTTAAAAAAT GGAAACTTTC 

25 

ACTTAATTAA AAGTTTAACA AACTGGCTGA 
TTTAAAAAAT GATAATTTAC CAGCATCTCC 
30 ACCTACATCT GTAAGGGTCA GTGGACTCTG 
GTGTATTAGG ATACTAATGA TAGICCCTAT 
GGCCACCATA CAQGACAGAC CACACCACAG 

35 

CCCACGCTGA GGTCXGGGAG AATQCCTGGT 
GCGTGAGCAG GGAQCACOGT GOGAGTCTCC 
40 CCTCCACCCC TGGGGAGGGC AGACAGGCTC 
GCAGGGAGCA GQCATGTCCA CCXXXMGCC 
AGCCGCCATG CTCATTGGTG GGCCAGTTTG 

45 

AGCXTCTQGG AAAACAAAAC CAAAACATCA 
TTTACAATAG AAAGTTAAAA ATCAAGACTT 
50 ACAAAGTTTA TATTATATAA CTQQGGrrrcC 
AGAGACCAGA AGTGAATACA AAAGAACTAA 
TGAAAGCTGT 

55 



ACAOGACTTG TCACTACTOC TCTCCCCTTG 780 

TCTCAGCAGT GGGACAATCT AAITGAATCA 840 

GTCACTGrPCA OCCTCAGCXrC GCCAGCAAGG 900 

TCTGAAAAGA AACCAGTGTT GAAGTCTGGA 960 

TGCAGCTGCT CAGrTGTACCC NCCGrTGQGCT 1020 

AGGAGCAGAG GOGAGCCGAC GCCACCGTCA 1080 

OSATGCrrcCC TOTGCXXTPGC CCXX:ACAGCC 1140 

CACCTCACTG CCCTGTOGCT GGAGGQQCAT 1200 

CCCGQCTTAG GGnCTACGTA TCACCCAGCC 1260 

G0C7IGAAGAC CTGTCAACTG TCGTGrPGTGA 1320 

AAAGATCTGT TTAGAAAATA CCTTTGAAAA 1380 

AAATCCATTT ATATTTTTAT TATAAACAAA 1440 

AAACTCACCA AGTGTCAGAC TCACCAGCAA 1500 

TCATCAGAGT TCXXTTCTCCA GTAAGGCTAT 1560 

AATCAATTTT ATGGrrPGTTT TAAAATCACC 1620 

ATCCATCCAG AAATGCTGGC AGAAAGCACT 1680 

CTCCATACCC AGCGTCTGCC TOGAGGCTCC 1740 

TTCAGTCATT TCXXSGACTAA CTGrTGACJ^C 1800 

GGQAGGGAAT CCTCCTGQGG CCCAGAGACT 1860 

OGGARGGCCT GQCXIAGQCCA CTGGAGQCTG 1920 

TQGGAGGCTA ACTCTGGCAT TCCTGGCXX3G 1980 

GGACATOCCC GTACTCAAAG ACXATATGGC 2040 

CCTTCTATTA AACTCTGTAT ATTATTATTT 2100 

AGATTTACTA TACATTTTTr CTCTCAGATT 2160 

CTAAATTGAT TTCTTTTAAA ACACJTCTTAA 2220 

ACAAAATAAA AAATTAGAAT GTGCTGTAGC 2280 

2290 



60 



(2) INFORMATION FXJR SBQ ID NO: 178: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) I^CfTH: 549 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDGCMESS: double 

(D) TOPQIXJGY: linear 

<xi) SBQUEMCE DESCRIPTIGN: SBQ ID NO: 178: 
GGCAOGAGCC ATGCCTQGOC TCTCCTTGAT TCTTACAGTC ACmXHTOG CTGfTTTCTGA 
CTCAGCAGCT ACCTGCATTG TC3CXXAAAGG ATOACCTATT CCTTCTCAGG AGGGCAAAAA 
TGTGGAATAG TGrrCTGTCCA TQCCTCTCCT CATGGGCTAC CACCTCTC3CC AOCGTGGTTA 
ATCAJC?rAACA ACCAGGAGAG AAGCTGCTGG AACTGACCTC TGGGAACTCC CTOGGATOGrT 
TTC3GTGCAGG AATGTAGTAG GCATACACGT GC3TTQCGTOG ATCTCSGGCCC TCCTGATGTG 
AGTAGAGAOG TAAAAOGCCA CCATCTCCTT GACCTCTGGG GAACTCATCC ACAAAGAAGA 
TGTTTCCAAG ATGCTTCTPGA AGATTGCCTA AAAATAGCCG GTTTCCACCC CCGTGAATGC 
ATCCATTCTA GAATGCTCCT TCACCAGGAC CAGAGAACTG ATTTACAGAA GTCACATGAA 
AACATTCCAT CCCAGAATTT GCAGTAGCTC AAATTAAGTT TCTAGCTATT AAAAAGAAAA 
AAAAAAAAA 

(2) INFORMATION FOR SBQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) hENCrrHi 1509 base pairs 

(B) T5fPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOUXY: linear 

(Xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 179: 
GGCAOQAGGG CTCATTCATT CCGCGCCC3GG OCTQCCAGAC ACCTQCGCCC TTCTQCAGCC 
GCCCGCCGCA TCCGCCOCCG CAOOCCCCAG CATGTCGGGC CCAGACGTCG AGACGCCGTC 
CGCCATCCAG ATCTGCOGGA TCATGCGGCC AGATGATC3CC AAOGTOGCCG GCAATC3TCCA 
CGGGQOGACC ATCCTGAAGA TGATCGAGGA GGCAGGOSCC ATCATCAGCA CCCGGCATTG 
CAACAGCCAG AACGGQGAGC GCTGTGrPGGC CGCCCTGGCT OGTCTTCGAGC GCACCGACTT 
CCTGTCTCCC ATGTGCATCG GTGAGGTGGC GCATGTCAGC GCGGAGATCA CCTACACCTC 
CAAGCACTCT GTOGAGGTGC AGGTCAACGT GATGTCCGAA AACATCCTCA CAGGTGCCAA 
AAAGCTGACC AATAAGGCCA CCCTGTQGTA TCTGCCXXTTG TCGCTGAAGA ATGTC3GACAA 
GGTCCTCGAG GTGCCTCCTG TTGTGTATTC CCGGCANGAG CAGGAGGAQG AGGGCCGGAA 
G0QC3TATCAA GCCCAGAAGC TGGAGCGCAT QGAGACCAAG TC3GAGGAACG GGGACATCGT 
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CCAGCCAGTC CTCAACCCAG AGCCGAflCAC TGTCAGCTAC AGCCAGTCCA GCTTGftTCCA 660 

^ CCTCGTQGGG CCTTCAGACT GCACCXTIGCA CXSGCTTTGTG CAOGGAGGTC TQACCATGAA 720 

GCTCArGGAT GAGGTCGCXX5 GGATCGTOGC TGCACGCXAC TOCAAGACCA ACATCGTCAC 780 

AGCTTCCGTG GACGCCATTA ATTITCATGA CAAGATCAGA AAAQQCTGCG TCATCACXAT 840 

10 CTCGGGACGC ATGACCTTCA CGAGCAATAA GrrCCATGGAG ATCGAGGTGT TQGrTGGACGC 900 

CGACCCTCTT GTGGACAGCT CTCAGAAGCG CTACCGGGCX: GCCAGTGCCT TCTTCADCTA 960 

CGTGTOGCTG AGCCAGGAAG GCAGGTOCSCT GCCTGTOCCC aVGCTQGTGC CCGAGACTGA 1020 

GGACGAGAAG AAG0C5CTTTC AGGAAGC3CAA AGQGCGGTAC CTGCAGATGA AGGCX3AAGCR 1080 

ACAGGGCCAC GCGGASCYTC AGCCCTAGAC TXXCTCCTCC TGCCACTGGT GCCTCGAGTA 1140 

20 GCCATGGCAA CGGGCCCAGT G?rCCAGTCAC ITAGAAGnTC CCCCCTTQGC CAAAAACCCA 1200 

ATTCACATTG AGAGCTGGTG TTGTCTGAAG TrTTCGTATC ACAGTGTTAA CCrGfTACTCT 1260 

CTCCTGCAAA CCTACACACC AAAGCTTTAT TTATATCATT CCAGTATCAA TGCTACACAG 1320 

25 

TGITOPCCCG AGCGCCGC3GA GGCtJrTGQQC AGAAACCCTC GGGAATGCTT CXX5AGCA0GC 1380 

TGTAGGGTAT GQC5AAGAACC CflGCACCACT AATAAAQCTG CTGCTTGGCT GGAAAAAAAA 1440 

30 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1500 

AGAAAAAAN 1509 

35 

(2) INFCMMATIQN FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) uaacra: 1316 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: do\ible 

(D) TOPOU)GY': linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

AGCTGTATCA TAGGAAAGAT GGCCACACCG GCGGTAOCAG TAAGTGCTCC TCCGQCCACG 60 

CCAACCCCAG TCCCGGCGGC GOCCCCAGCC TCAGTTCCAG 0C3CCAACGCC AGCACOQGCT 120 

50 

GCGGCTCCGG TTCCCGCTGC GGCTCCAGCC TGCATCCTCA GACCCTCXX3G CAGCAGCGGC 180 

TGCAACTGCG GCTCCTGGCC AGACCCOGGC CTCAGCGCAA NTCCAGCGCA GACCCCACpS 240 

55 CCCGCTCTGC CTGGTCCTGC TCTTCCAGGG CCCTTCCCCG GCGQCCGCGT QGTCAGQCTG 300 

CACCCAGTCA TTTTGGCCTC CATTGTGGAC AGCTACGAGA GACGCAACGA GGGTGCTGCC 360 

CGAGTTATCG GGACCCTGTT GGGAACTGTC GACAAACACT CAGTQGAGGT CACCAATTGC 420 

60 
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•mTCAGTGC CGCACAATCA GTCAGAAGAT GAAGTGGCTG TTGACATOGA ATTTGCTAflG 480 

AATATGTATG AACTGCATAA AAAAGTTTCT CCAAAIGAGC TCATCCTQGG CTGGTACGCT 540 

ACX3QGCCATG ACATCACAGA GCACTCTGTG CTGNATCCAT GAGTACTACA GCOGAGAGGC 600 

COCCAACCCC ATCCACCTCA CTGTOGACAC AAGTCTOCAG AACGGCX3GCA TCAGCATCAA 660 

AGCCTACGTC AGCACTTTAA TOGGAGTPCCC TQGGAGGACC ATGGGAGTTGA TGriTCMGCC 720 

TCTQACAGTG AAATACGCX3T ACTACGACAC TGAACGCATC GGAGrTTGACC TGATCAOGAA 780 

GAOCTGCTTT AGCCCCAACA GAGaXMTQG ACTCTCAAGT GACTTGCAGC AAGTAQGAQG 840 

15 GGCATCAGCT CGCATCCAGG ATQCCCTGAG TACACTGITC CAATAIGCAG AGGATGIIACT 900 

GTCTQGAAAG GTGTCAGCTG ACAATACTGT GGGCCGCTTC CTGATCAGCC TQGTTAACCA 960 

AGTACCGAAA ATAGTTCCCG ATGACTTTGA GACCATQCTC AACAGCAACA TCAATGACCT 1020 

20 

TITGATGGTC ACCTACCTGG CX:AACCTCAC ACAGTCACAG ATTGCACTCA ATGAAAAACT 1080 

TGTAAACCTG TGAATQGACC CCAAGCAGTA CACTTGCTOG TCTAGGTATT AACCCCAGGA 1140 

25 CTCAGAAGTG AAGGAGAAAT GGGTmTPG TGGTCTTGAG TCACACTGAG ATAGTrCAGTT 1200 

GTGTGTGACT CTAATAAACG GAGCCTACCT TTTGTAAATT AAAAAAAAAA AAAAAAACCN 1260 

SCSRQGGQGGG CCXX3GTCCCA TTSSCCCTTT NGTAATTCC?r NTTACAATCC (XNGC5C 1316 

30 



35 



40 



45 



(2) INFORMATION FOR SBQ ID ND: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQflGTH: 777 base pairs 

(B) T5fPE: nucleic acid 

(C) SIHANDECNESS: double 

(D) TOPOLOGY: linear 



(xi) SBQOENCE EESCRIPTION: SBQ ID NO: 181: 

GQCATGWKCA GACATGACTT CTATTGCCAG GCTGGrTCAAG TGGCAGGGTC ATGAGQGAGA 60 

CATCGATAAG GGTOCTCCTT ATOCTCCCTG CTCTGGAATC CACCAGCGGG CTATCTGOGT 120 

TTATGGGGCT GGGGftCTAGA ATTQGATGCT TCAAAACCAT CACCrGTTGG CCAACAACrTT 180 

50 TGACOCAAAG GTAGATGATA ATGCTCTTCA CTGCTTAGAA GAATACCTAC GTTATAAGGG 240 

CCATTCTATT GGGACCTGAA CTTTGAAGAC CACAOTATTG AAGAGGOGTET GCTTACCYGT 300 

TQQGGGCCAA GAGGCATCJTT ACCAAACATG GYYCARGAAM YTTGGYKGGG AMCARKKKKG 360 

55 

GKKGGGARRM CMRGGGYTTG SCAAWTTCSK KGCXM#CCYT TTAGGGTAAR RRGGGOOGnW 420 

AITAGATTGT GQGTAAAGTA GGATCTTTTG CCCTTGCAAA TTTGCTGCCT GGGTGAATGY 480 

60 TGCTTCSTTCC TTCTCMACCC CTAACCCTAG TAGnTCCTCC ACTAACTTTC TCACTAAGTG 540 
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AGAATGAGAA CTGCTC3TQAT AGGGAGAGTC AAQGAGGGAT AIGTOG?EAGA GCACT1X3ATT 600 

TCACTTTGAAT GCCTCCTOGT AQCTTTTCCA rPCTGTQGAG CTGCCGTTCC TAATAATTCC 660 

AGGTTTQGTA GCGTOGAGGA GAACTTTGAT GGAAAGAGAA OCTTCCCTTC TOTACTGrrTA 720 

ACITAAAAAT AAATAGCICC TGATTCAAAG TAAAAAAAAA AAAAAAAAAA AAAAAAA 777 

<2) INFORMATION FOR SBQ ID NO: 182: 

15 (i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 
GGCACAGEATA ACTATCTACA TGTATTCCTT AAATC3TTTTT TTAACTTTTA TATTCTTGGC 60 
25 ACTGGTCTTC AAATGTGTAC ATC3TGTGCCA GC3GAGCAAAT GCCTTCTICT TTCTGAAATT 120 
GGTCTTTTAG ACTGTTCTTT TTTCCCATCT TCTCACCTCC TQCCCCTCCT TCAGGGTACT 180 
TCCGTGGCCA GAACCCCTCC AGGTCAGAGG CAGAAGAGAA GCCTCATGQG TCACAGCAGC 240 

30 

AGATGTGGGC TQGAGATCTA TTCATITQGT TTTGGCTTGA ATTTTCTGRA TGGTTTACTT 300 
GATCYTGGGA AAGANATATC TTGCCAGGAA AAATGATACa^ CCTTGACAAT GTTGAATGAT 360 
35 CCTGCACCAC COTGAAAGAC ATTTCTAATA TGGTTTGTCA GGCAAAGTGG TTAGTAGTCA 420 
TTTGTGGCCT GAGGTAGAAG TCCTCAGAAA TCAGCAGACT TCACTGATAA AATGCTGACT 480 
TGCCCCTQGA CTGGGCTCTG TGAGAGTQGC CTTCTGCACT GTGCACAGTA QGTGTGAACA 540 

40 

CACCACACCT ACAGGGACCA CGrTGGTGGGC TGTQGACTAG CGGCCAAGCT CCCTGCAGGC 600 
CCACTAATAG AATTCAGCTT TTAGCATGGG CTGTTTCATA CTGITCTGAT GAAACTGATT 660 
45 TGGTTTCTTT CCTCCATACC CCTTCTGCAT TTCAGTGTTT TPGTTTAGTT TTCCTGGrrTT 720 
TTAATTATAA CTACAAAATA AAATCTTTAG GCTATTCACC TTAGCTTAGT AAAAAAAAAA 780 
AAAAAAAACT C 79X 

50 



(2) INFORMATION FOR SEQ ID NO: 183: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1405 base pairs 

(B) TYPE; nucleic acid 

(C) ffTRANDECNESS: double 

(D) TOPOLOGY: linear 
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(xi) SBQUSWCE DESCRIPTION: SBQ ID NO: 183: 

AAATTGATPA ACAGCTTGAA AGAAGtXTCT GGTTITCAAG GCCTAGATAG CAGCACTOCC 60 

5 

AGTAGCOTGG AGCTGGAAGA ACTTCGGCAT GAGAAAGAGA TOCflGflOGGA GGAAATACAG 120 

AAGCTQATGG GCCAGATACA TCAGCTCAGA TCCGAATTAC AGC5ATATGGA GGCACAGCAA 180 

10 GTTAATGAAG CAGAATCAGC AAGAGAACAG TTACAGGWTC TGCATGACCA AATAGCTQQG 240 

CAGAAAGCAT CCAAACAAGA ACTAGAGACA GAACTQGAGC GACTCAAGCA GGAGTTCCAC 300 

TATATAGAAG AAGATCTTTA TOGAACAAAG AACACATTGC AAAGCAGAAT TAAAGATCGA 360 

15 

GACGAAGAAA TTCAAAAACT CAGGAATCAG CTTACCAATA AAACTTTAAG CAATAGCAGT 420 

CAGTCTGAGT TAGAAAATCG ACTCCATCAG CTAACAGAGA CTCTCATCCA GAAACAGACC 480 

20 ATGCTQGAGA GTCTCAGCAC AGAAAAGAAC TCCCTGGTCT TTCAACTQGA GCGCXTTOGAA 540 

CAGCAGATGA ACTCCGCTTC TGGAAGTAGT AGTAATQGGT CTTCGATTAA TATGTCTGGA 600 

ATTGACAATG GTGAAGGCAC TCGTrCTQCXSA AAICTTCCTG TTCTTTTTAA TGACACAGAA 660 

25 

ACTAATCTQG CAGGAATGTA CGGAAAAGTT CGCAAAGCTG CTAGTTCAAT TGATCAGTTT 720 

AGTATTCGCC TOGGAATTIT TOTCCGAAGA TACXXCATAG CGCGAGTTTT TGTAATTATA 780 

30 TATATGGCTT TQCTTCACCT CTGGGTCATG ATTGTTCTGT TGACTTACAC ACCAGAAATG 840 

CACCACGACC AACCATATGG CAAATGAACC AAGCCCAGTT GTTGCAGTGA TTGGTTGTCT 900 

TITPCTAGAC TTGGGATCTG CAAGAAGGCC AATTGCCTAA AATTTCTGAG AACAGTQCAC 960 

35 

AAGATTATTT TATCACTACA AGCTTTTAAC TTTTTAAGTT ATTCTACAAG TATTCTACCT 1020 

AAATCTTCCA ATTTCCITTA AATC3GTAAGA GTTTCTAAAA CAGACAATAA TTTAACAAGC 1080 

40 TCAGCTCTGC TTTATCTGAG TTTAGrTGGTC CTAATATATA TGTAGAGAAA GATGGTOGGG 1140 

TTGrrrCACCT CTCTACAGAC CAOX^CTATG TTAGGTGACA TTGATTAaXSG GTTATAATCA 1200 

GGGAAACTAA TTGTAlTrAG TGACAAAAAT AAAAAGTTTT TTTTTTATAA TTCAGTCTQC 1260 

45 

TTTTGGATTT TCATATATTT AACTTTGCAA AAAGATTTAC TTTGTACATG TTACAGQCTT 1320 

GATTQGTGTA AATCTTTTTA TAAATACATA AATAAAAGNA AAATATQCAT ' I T r i CX ' mXJ 1380 

50 TAAAAAAAAA AAAAAAAAAA CTCGA 1405 



55 (2) INPOFMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACIERISTICS : 

(A) LQIGTH: 1596 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: doxable 
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(D) TOPOljOGy: linear 
(xi> SBQUQJCE DESCRIPTICN: SBQ ID JTO: 184: 

5 CnCATPQCAGT GCGCCGGAGA ACTGrPGCTCT TTGAGGOCGA CGCTAGGGQC COSGAAGGGA 60 

AACTGCGAGG CGAAGGTGAC CGGGGACOGA GCATTTCAGA TCTOCTCGGrr AGACXTTOGrPG 120 

CACCACCACC ATGTTGGCTG CAAOC5CTGGT GrPGTCTCaSG ACACTACCTT CTftGGGlTTT 180 

10 

CX^CCCAGCT TPCACCAAGG CCTCCCCTGT TGTGAAGAAT TCCATCACGA AGAATCAATG 240 

GCTGTTAflCA CCTAGCASGG AATATGCCAC CAAAACAAGA ATTGOGATCC GGCXSTOGQAG 300 

15 AACTOQCCAA GAACTCAAAG AGCXZAGCATT GGAACCATCG ATGGAAAAAA TATTTAAAAT 360 

TGATCAGATG QGAAGATOGT TTGTTGCTGG AGGGGCPGCT GrrTGGTCTTG GAGCATTGTG 420 

CTACTATGQC TTGGGACTGT CTAATGAGAT TGGAGCTATT GAAAAGGCTG TAATTTGGCC 480 

20 

TCAGrrATGrrC AAGGATAGAA TrCAVrOCPiC CTATATGTAC TTAGCAGGGA CTATrQGTTT 540 

AACAGCTTTG TCTGCCATAG CAATCAGCAG AACGCCTGrTT CTCAHGAACT TCATGMGAG 600 

25 AGGCTCTTGG GTGACAATTG GTGTGACCTT TOCAGCCATC GTTGGAQCTG GAATQCTQGT 660 

ACGATCAATA CCATATGACC AGAGCCCAGG CCCAAAGCAT CTTGCTTC3GT TGCTACATTC 720 

TGGTGTGATG GGTGCAGTGG TOGCTCCTCT GACAATATTA GGGGGTCCTC TTCTCATCAG 780 

30 

AGCTQCATC3G TACACAGCTG GGATTGTGGG AGGCXTTCTCC ACTGnOGCCA TGTGTGCXXX 840 

CAGTGAAAAG TTTCTGAACA TQGGTOCACC CXrTGGGAGTG GGCCTQGGTC TCGTCTTTCT 900 

35 GTCCTCATTG GGATCTATOT TTCTTOCACC TACCACOGTG GC^IGCFTGCCA CTCTTTACTC 960 

AGTGGCAATC TACGGTOGAT TAGTrCTTTT CAGCATOTTC CTTCTGrTATG ATACCCAGAA 1020 

AOTAATCAAG CGrTGCAGAAG TATCACCAAT OTATQGAGTT CAAAAATATG ATCCCATTAA 1080 

40 

CTCGATGCTG AGTATCTACA TGGATACATT AAAIATATTT ATGCXaGTIG CAACTATGCT 1140 

GCXMCTGGA GGCAACAGAA AGAAATGAAG TGACTCflGCT TCTGGCTTCT CTOCTACATC 1200 

45 AAATATCTTG TITAATQGGG CAGKTATGCA TTAAATAGrTT TGrTACAAGCA GCTTTCGrTTG 1260 

AAGTITAGAA GATAAGAAAC ATGTTCATCAT ATTTAAA3X3T TCCGGTAATG TGATGCCTCA 1320 

GGTCTGCXnT TTTITCIGGA GAATAAATGC AGTAATCCTC TCCXAAATAA GCACACACAT 1380 

50 

TTTCAATTCT CAOCTTTGAG TGATTTTAAA ATGnTTQGT GAAICTGAAA ACTAAAGTTT 1440 

GrrGTCATGAG AATGTAAGTC TTTTrTCTAC ITTAAAATTT AGTAGGTTCA CTGAGTAACT 1500 

55 AAAATTTAGC AAAOCTGrTGrr TTGCATATTT TTTKGGAGTC CAGMMTASTO TAATTARAGC 1560 

ATTCCAGrrAA NAGTOmTTT AAAGTTCNTC TATATN 1596 

60 
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(2) INFORMATION FOR SBQ ID NO: 185: 

(i) SBQOENCE CHARACTKEasnCS: 
5 (A) hOtCTTH: 2293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 185: 

GOGCflGAGCC CGYACGAGCA GGAOGAOGAG GACAAGGGCG ACTOCAAGGA AACGCQGCTG 60 

ACCCTGATGG AGGAAGTCCT CCTGCTGC3GC CTCAAGGAOC GCGARGGTTA CACATCATTT 120 

15 

TQGAATGACT GTATATCATC TGGATTAOGT GQCTGTA3X3T TAATTGAATT AGCATTGAGA 180 

GGAAGGTTAC AACTAGAGGC TTGTOGAATG AGACGTAAAA GTCTATTAAC AAGAAAGGTA 240 

20 ATCTOTAAGT CAGATGCTCC AACAGGQGAT GrrTCTPCTTG ATGAAGCTCT GAAGCATGTT 300 

AAGGAAACTC AGCCTCCAGA AACGGTCCAG AACTQGATTG AATTACTTAG TGGTGAGACA 360 

TGGAATCCAT TAAAATTQCA TTATCAGTTA AGAAATGTAC GGGAACGATT AGCTAAAAAC 420 

25 

CTC3GTGGAAA AGGGTCTATT GACAACAGAG AAACAGAACT TCCTACTTTT TGACATGACA 480 

ACACATCCCC TCACCAATAA CAACATTAAG CAGCGCCTCA TCAAGAAAGT ACAGGAAGCC 540 

30 GTTCTTGACA AATQGGTGAA TGAOCCTCAC CGCATCX5ACA GGCGCTTGCT GGCCCTCATT 600 

TACCTGGCTC ATGCCT0C3GA CGTCCTGGAG AATGCTTTTG CTCCTCTTCT GGACGAGCAG 660 

TATGATTTQG CTACCAAGAG AGTGCGGCAG CTTCTCGACT TAGftCCCTGA AGTGGAATGT 720 

35 

CTGAAGGCCA ACACCAATGA GGTTCTGrTQG GCGGTGGTGG CGGOGTTCAC CAACTAACTC 780 

TGCTCGGGGT GAACCATTCT CCTTTCrCTC AACTAAACCA GTAGTTTTTC TTCTGTTGAC 840 

40 TTCTQGTTrr CrGTAATTTG TACTTTOCCA CACTATAATT GGCTTCTGTT TTACAAAATG 900 

GTGGGTQGCT TTTTCTTTTT TGTACGTGTA CAGGATTCPG CTGGTACGAG AGGCCTTCCT 960 

CTTTCTGrrTT TTAAAAAAAG TTTTACTQCC ATATTGGCAT TCXATTOCCT GrTTGCCATCC 1020 

45 

TCACTGTTAC CTGTTTTQGG TTTGTGGrPCT ACTTTGACTT TCAAAGTACC TCCAGCCTCC 1080 

TCATACGCAC AGCTTTTGGA TGACCTCAGC TTGAGTTTCT CCATATGTGC ATGTACATCT 1140 

50 AGCATTCTGC CTACAGTTCA GACAGAAGTC ACAAAAAGGC CTTCAACTCA CCAAAGGTAA 1200 

ATATCTGTAT CTATTAGGAC ATmTTACA TAGACTTCAG TTGAGATGTA TACTTAGCAA 1260 

AATTATTTTT AAATTGAAAC AGCACAGTAA ATACTTAATA TAAAATGTCC CTTQGATTTT 1320 

GCTTCCCATG TAAATCTATT GTIATTATTAC ACTTCTTATA ATTTTAACTA TAAAGGTCCA 1380 

ATTGTTTCAC AGAGCCAGTIT TGGGATGGGC TGCATTOCAT TTATGCTGTA TATAGTTTGA 1440 

60 ATTATATATA AATTACCCCT TCTTCTGGCC ACCCCTGCTC CCATCTTACTT ATTTTGCAAG 1500 
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ATCTAATCAG TTCTACACCT GGTOCtXCTC GCTTQCTTCA ATCAITGCTrTA TTTGATC3GCA 1560 

AAATOGflCCT CTTGrrCGCTG AAGGftGflGAG AAAAGATGTC TGTCTCATTC GTCCPGQGAT 1620 

5 

TTTTTGAGCT GTCCCATTTA TGffEftCTCTT TGCCTATGCA aXXXXTTTTTT AGMTTTTIT 1680 

TAAATTTTAT CTTACTGTTT TTATAATTTC TATTC3GGAAG AC3GCTTGTGA OCAGTACXAA 1740 

10 TCTTGAGTTT CTTTTTCTOT (XaCAACTAA ATTAATATCT GCTCTGAAAT GTCATTTATC 1800 

TACTCACACA TTCITGGQGA AAAAAATCAA ATGTCAGTCC TftGCAGATGT 1GCATGTAAA 1860 

TTOGTAGCAA GTAATGATTA CAADCX^GAG GATTAAGAAT TTICTAACAG AAAGCTCTAT 1920 

15 

CTTTTAAITT TTTATATACA ATTAGGATAA TTAGCATroT CAGACTATAA ACCTTTQCTT 1980 

nTAAflGTTT ATTTTTACTA TTTCTTTATC ACTTTATTGT ATCATCACCA ITQGTTTCAT 2040 

20 AATGTAAATA CTATATGTTG AACAAATTAA ATGTCAAAAT TTTTTATTAC CATAGTCCAT 2100 

GTTAATAGTG GGGCTTTCAG GTOTTTAGAG ATTTTTTTTG TTGTTGTTAA CATTCATTGC 2160 

AAAAGTACTA GATQGTGTAT AACTCTAGAG TTGAATTTTA AGGGATTCCC TAATATGTAT 2220 

25 

ACTATCTTTT TATCTGAAGT AATAAATAAA CAATGATCTT GAAAGTGCCY RAAAMAAAAA 2280 

AAAAAAAAAA AAA 2293 

30 



(2) INFORMATION FOR SEQ ID NO: 186: 

35 <i) SEQUENCE CHARACIERISTICS: 

(A) LENGrm: 1212 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOUX3Y: linear 

40 

<xi) SEQUENCE DESCMPTION: SEQ ID NO: 186: 
GGCACGAGGC GAGCXXSCXXSC ACCGTAOGCT GGGACGTGTG GrTTCAQCTC GTCCGCCrCC 60 
45 CXX3TQGGTTT GCGACGTTTA GCGACTATTG CGCCTGCGCC ACGCCGGCTG CX5AGACTGGG 120 
GCCGTQGCTG CTGGTCCOGG GTGATGCTAG GCGGCTCXXrr GGGCTCCAGG CTGTTGCGGG 180 
GTGTAGGTQG GAGTCACGGA CXSGTTCGQGG CCCGAGGTGT CCOCGAAGGT GGCGCACATG 240 

50 

GGCGGCAGGG GAGAGCATGG CTCAGCX3GAT GGTCTGGGTG GACCTGGAGA TGACAGGATT 300 
GGACAHTGAG AAGGACCAGA rEATTGAGAT GGCCTGTCTG ATAACTGACT CTGATCTCAA 360 
55 CATTTTOGCT GAAOGTCCTA ACCTCATTAT AAAACAACCA GATGAGrTTGC TGGACAGCAT 420 
GTCAGAITGG TGTAAGGAGC ATCACGQGAA GTCTGGCCTT ACCAAQGCAG TGAAGGAGAG 480 
TACAATTACA TTGCAGCAGG CAGAGTATGA ATTTCTGrrCC rrTGTAOGAC AGCAGACTCC 540 

60 
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TCCAGGCXrrC TGTCCACTTG CAGGAAATTC AGTTCATCAA GftTAAGAflCTT TTCITCACAA 
ATACATCXXr CAC?ITCATCA AACATCTTCA TTATAGAATA ATTGATOPGA GCACTGTTAA 
AGAACIOTGC AGACGCTOGT ATCCAGAAGA ATATGAATTT GCAOCAAAGA AQGCTTGCTTC 
TCATAGQGCA CTTGATGACA TTAGTGAAAG CATCAAAGAG CTTCAGTITr ACCX3AAATAA 
CATCTTCAAG AAAAAAATAG ATGAAAAGAA GAGGAAAATT ATAGAAAATC GGGAAAATOA 
GAAGACCGTG AGTTGATGCC AGTTATCATG CTGOCACTAC ATOGTTATCT GGftGGCAACT 
TCTGGTGGTT TTTTTTTCTC ACGCTGATGG CTTGGCAGAG CACCTTOGGT TAACTTGCAT 
CTCCAGATTG ATTACTCAAG CAGACAQCAC ACGAAATACT ATTTTTCTCC TAATATGCTC 
TTTCCATTAT GACACAQCAG CTCCnTGTA AGTAOCAGGT CATGTCCATC CXTTTOGTACA 
TATATGCATT TQCTrTTAAA CO^ITTCTTT TGTTTAAATA AATAAATAAG TAAATAAAGC 
TAGTTCTATT GAAATGCAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AN 

(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEC30ENCE CHARACTERISTICS: 

(A) LQJGTH: 1605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
GCTTCCGGAA GTTQCTTrTG TCCAAACATC OGGGCTTCTC C m ' lWiVr TCCGGCCQAT 
CCCACCTCTC CTCGACOCTG GACGTrCTAOC TTCOQGAGGC 0CACA3CTTG CCCACTCCGC 
GCGCGGGGCT AGCGCGQGTT TCAGCGACGG GAGOCCTCAA GGGACATGGC AACTACAGCG 
GCGCCGQOQG GCGGCGOCCG AAATGGAGCT GGCCCGGAAT GGGGAGQC3TT CGAAGAAAAC 
ATCCflGGGCG GAGGCTCAGC TGTrGATTGAC ATQGAGAACA TGGATGATAC CTCAGGCTCT 
AGCTTCGAGG ATATQGGTGA GCTQCATCAG OGCCTQOGCG AGGAAGAAGT AGACGCIGAT 
GCAGCTGATC CAGCTGCTGC TGAAGAGGAG GATGGAGftGT TCCTGGGCAT GAAGGGCTTT 
AAGQGACAGC TGAGCCGGCA GGTGGCflGAT CAGATGTGGC AGGCTGGGAA AAGACAAGCC 
TOCAGGGCCT TCAGCTTGTA CGCCAACATC GACATCCTCA GACCCTACTT TGATGTOGAG 
CCTGCTCAGG TGCGAACAGG GCTCCTGGAG TCCATGATCC CTATCAAGAT GGTCAACTTC 
COCCAGAAAA TTGCAGGTGA ACTCTATQGA OCTCTCATGC TQGTCTTCAC TCTOGTTCSCr 
ATCCTACTCC ATOGGATGAA GAOGTrCTGAC ACTATTATCC GGGAGGGCAC CCTCATOGGC 
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ACAGCCATPG GCACCTGCTT CGGCTACTGG CrGGGAGTCT CATCCTTCAT TTACTTCCTT 
GCCTACCTGT GCAACGCCCA GATCACCATG CTGCAGMCT TQGCACTGCT GGGCTATQGC 
CTCTTTOGGC ATTGCATTGT OCICTTCATC ACXTEATAATA TXXACCTCCA CXXrCTCTTC 
TACCTCTTCT GGCTGTTGGT GGGTOGACTG TCCACACTQC GCATQGTAQC AGrTGTTGCTre 
TCTCGGACCG TGC3GCCCCAC ACAGCGGCTG CTCCTCTGTG GCAOCCTGGC TOCXXTACAC 
ATQCICTTCC TOCrCTATCT GCATTTTGCC TACCACAAAG TGGTAGAGQG GATCCTGGAC 
ACACTOGAGG GCCCCAACAT CCCGCCCATC CftGAGGGTrCC CCAGAQACAT OCCTCCCATG 
CTCCCTGCTG CTOCSGCTTCC CACCACCGTC CTCAACGCCA CAGCCAAAGC TGTTGCXSGrTG 
ACCCTGCAGT CACACTGACC CCACCTGAAA TTCTTQGCXA (TrCCTCTTTC CCOCMXTCQC 
AGAGAGGAGG AAGACTATTA AAGGACAGTC CTGAJPGACAT GTTTOGrTAGA TQGOGTTTOC 
AGCTGCCACT GAGCTOEAGC TGCGTAAGTA CXTTCCTTGAT GCNTCTCX3GC ACTTCTGAAA 
GGCACAAGGC CAAGAACTCC TGGCCAGGAC TGCAAGQCTC TGCAGCCAAT GCAGAAAATC 
C3GTCAGCTCC TTTCAGAACC CCTCCCCACC TACCCCTTCC TTOCTCTTTA TCTCTCCXaC 
ATTGTCTTGC TAAATATAGA CTTGGTAATT AAAATGTTGA TTQAAGTCTG GAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAC TCGAG 

(2) INFOKMATION FOR SEQ ID NO; 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEMESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 188: 
ATTOGGCATG AGGQQG?rCAC GTOGTOGCTG GGCCGGGGAA ATGGCGGCTT CAGGAGAGAG 
CGGGACTTCA GGCGGCGGAG GCAGCACCGA GGAAGCATTT ATGACCTTCT ACAGTGAGGT 
GAAACAAATA GAGAAGAGAG ACTCGGrTTCT AACTTCGAAA AATCAGATTG AAAGACTCAC 
COGTCCTGGT TOCTCTTACT TCAATTTGAA CCCATTTGAG GITCTTCAGA TAGATCCTCA 
AGITACAGAT GAAGAAATAA AAAAGAGGTT TCGGCAGTTA TCCATCTTGG TGCATCCTGA 
CAAAAATCAA GATGATGCTG ACAGAGCACA AAAGGCTTTT GAAGCTGrTGG ACAAAGCTTA 
CAAGTTQCTA CTGGATCAGG AGCAAAAGAA GAGGGCCCTG GATC3TAATTC AGGCAGGAAA 
AGAATAOGTG GAACACACTG TGAAAGAGCG AAAAAAACAA TTAAAGAAOG AAGGAAAACC 
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TACAATTC3TA GAGGAQGATG ATCCTGAGCT GTTCAAfiCAA GCTOTATATA AACAGACAAT 
GAAACTCTTT GCAGAGCTQG AAATTAAAAG GAAAGAGAGA GAAGCCAAAG AGftTGCATCA 
5 AAGGtfUUtf^GA CAAACXXtfUVS AASAGATT^ 

GTGGCAGAAA AACTTTQAOG AAAGTOGAGA TOGTCGTGTC GACAGCTGQC GAAACTTCCA 
AGCCAATACG AAGGGGAAGA AAGAGAAGAA AAATCQGACC TTCCTGAGAC CACCGAAACTT 
AAAAATQGAG CAACXTTGAGT GACCGCCCAA GGTCACAGGC ACflGAAOCTT TCCOCTOCTA 
TCTCCCTTCC TGCTTOSAAG GACTCATTCT TTCCTCCCAC ITCCACCCCA ACATAGACTA 
GTATTTCCTT TrTAGTCCAT TTTGTTTTCA ATAOGATTTA ATATCXMCA GflffTAAITCT 
TTTGTACATT GAAATGAGGG GCTTGGTTTA AAAAAAGACC TTTCCCTCOC CCTOCCXTTA 
GAACAAOCAG TATTAGAAGG TGCCACCATT GGTGCTQCXrT TCTCTTCCCA CAGCCTC5rAA 
CTCAGTGTTT TOTACTTCAC TGAATTGTGA TGGTTAGAAA CTTCGTGGAT AGTTIX3TX3GA 
AATCATCCAA TTAAACATAC TQCTTAAAAC AGTGTTGCTG TGACTTCA6A GACAAC3CCTC 
GAAGGGGCAC CTTAGGAAQC CCCTTCGCTT CAGTTGCTCG CTTCTGGGTG TGCTCCCTTC 
GAAGGCCCAG ATAAGACAQG GAACACTTGT GAQCACACAG AGCAGCATCT GATOXCTCT 
GGTOTTrGGC ATGTCCOCCC OXTPCTACTGA CCAATCAGTG TQGCATGAGG CCCACGCCAC 
CCAAACCTTT CACTTTCCAA AGAGCTAGCC GTCCTCXACC CAGTACCATG TCCTAGCCTC 
TCTGCATTTG TTAGTGGTAA TATTCTTTAT GTATAATAAA TTTTTATACC CAAAAAAAAA 
AAAAAAAAAA ACTOGA 

(2) INPORMATICN FOR SBQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGrrH: 681 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SBQUENCE DESCRIPTICN: SBQ ID NO: 189: 
GCTCCCATGrr TGCTGGCTGT CCGTACATCA CCCTCTOCCC TGCAGGAGGG GGCTACAGGC 
CATCTCCCTC CTGTAGGCCT CTGACTCCCC TCCACTTTTG GGCXXTICAGC TTATCTOGGG 
CAGGGGAOCA TTGCAGCATC CTCCCCTCCT CNGGACTCAA GGTGCTGAGG TATAAGCCCT 
C3GGCCCCAGA TOCCTGRTKA CACCTTCCTG GAGAAGACTC TCAAAAGTCA CrGTATATrr 
GAGTTCACCA GCAATAACTC CCCACACPCG AAGCAGGTCC AAADCCMAGG ATCCCAGGGrr 
CCTTGGGCTC TOTGGCACTG TCTTCCCAAG ATCCTTCCTG TTGCACAATG GGAAACCTAA 
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GftGGAAAAAG ACAGGQGCCT GCTTCCOCAG CCATOCGAGG GATTCCATGC CCACCTOOCX: 
TCTGYCTGCC TCGCTGGAAT GTOGGCCCCT GCTCCCOSTC AOGnTTTQCT GrTCTCIGACC 
TATGrrrACA TCCOOGAGGG GTITCTOCCT (XTOCOCACC caggtcaggg totogtccag 
CaGCTTGCTG TGGQGTOCTC ACATGTGTCA CCACTC5CCCC OCTTGCCCXX: GC3GGGGGTCA 
TOGTCTCCTC CTGGATGCTG CTCCTTGAAT YTTmrrTT GAWAAACCYT TTAMAATEAA 
AAAAAAAAAA AAAAAACTCG A 

(2) INFORMATION FDR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENCmi: 1014 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECMESS: double 

(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 
GCCTCAAGCC AOGCATATGA TAATFTTCTG GAACATTCAA ATTCAGTGTT TCTACAGCCA 
GTTAGTCTAC AAACCATTCC AGCAGCACCA TCAAACCAGA GTCTGCCACT TTTTGTCATC 
GCTGGATGAT TGCTGGGCAA AQGTGGOCTT TTAGAGCTCT TAAAAGCCCA CAAAAAGGCT 
ATTCC3TAGAG CCACAGTCAA CACATTTC3GT TATATTQCAA ACSGCCATTGG CCTCATCATG 
TATTGGCTAC ACTTCTGAAC AACCTCAAAG TTCAAGAAAG GCAGAACAGA GTTTGTACCA 
CTGTAGCAAT AGCTATTGTT GCAGAAACAT GITCACOCTT TACAGTACTC CCTGCCTTAA 
TGAATGAATA CAGAGTTCCT GAACTGAATG TTCAAAATGG AGTGTTAAAA TCGCTTTCCT 
TCTTGTTTGA ATATATTGGT GAAATGGGAA AAGACTACAT TTATC3CCGTA ACACCGTTAC 
TTGAAGATGC TTTAATC3GAT AGAGACCTTG TACACAGACA GACGGCTAGT GCAGTGGTAC 
AGCACATGTC ACTTGQGGTT TATQGATTTG GTTGTQAAGA TTCQCTGAAT CACTTGTTGA 
ACTATGTATG GCCCAATtSTR TTTGAGACAT CTCCTCATC5T AATTCAGGCA GrTATGGGAG 
CCCTAGAGGG CCTGAGAGTT GCTATTGGAC CATGTAGAAT GTTGCAATAT TGTTTACAGG 
CJTCTGTTTCA CCCAGCCCGG AAACTCAGAG ATGTATATTG GAAAATTTAC AACTCCATCT 
ACATTQGrrrC CCAGGAOGCT CTCATAGCAC ATTACCCAAG AATCTACCAA OGATGATAAG 
RACACCTATA TTCGrTTATGA ACTTGACTAT ATCTTATAAT TrTATTGTTO ATTTKGTC3KT 
TAATGCACAS TACTTCACAC CTTAAACTTG CTTTGATTTG GTGATGTAAA CTTTTAAACA 
TTGCAGATCA GTGTAGGACT GGTCCATAGG GGAAGAGCTA GGAANTCCAT AGGC 
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^ (2) INPORMATION FOR SBQ ID NO: 191: 

(i) SEWENCE CHARACTERISTICS: 

(A) LENGOT: 2779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 
10 (D) TOPOLOGY: linear 

(xi) SEQUaK:E DESCRIPTION: SBQ ID NO: 191: 

TOOCAGCAGG OrreTGTCCAG ATOGTCAGTC TCTGGTGGCT AGCCTgrCCl' GACAGGQGAG 60 

AGnTAAGCTC CCGYTCTCCA CCGTGCOGGC TQGCCAGGTG GGCTGAGGGT GACCGAGAGA 120 

CCAGAACCTG CITGCTGGAG CITAGTGCTC AGAGCTQGGG AGGGAGGITC CGCOGCTCCT 180 

20 CTGCTGTCAG CGCOGGCAGC CCCTCXXX3GC TTCACTTCCT CCCGCAGCCC CTCCTACTCA 240 

GAAGCTCOGG GATCCCAGCA GCCGCCACGC CCTGGCCTCA GCCTCCGGGG CTCCAGTCAG 300 

GCCAACACOG ACGOGCANTG GGAGGAAGAC AGGACCCTTG ACATCTCXAT CTGCACAGAG 360 

GTCCTGGCTG GACCGAGCAG CCTCCTCCTC CTAGGATGAC CTCACCCTOC AGCICTCCAG 420 

TTTTCAQCTT GGAGACATTA GATGGAGGCC AAGAAGATGG CTCTGAGQOG GACAGAGGAA 480 

30 AGCTQGATTT TQGGAGCGGG CTGCCTCCCA TQGAGTCACA GTTCCAGGGC GAGGACCGGA 540 

AATTCGCCCC TCAGATAAGA GTCAACCTCA ACTACCGAAA GGGAACAGGT GCCAGTCAGC 600 

CQGATCCAAA CCGATTTGAC CGAGATOGGC TCTTCAATGC GGTCTCCCGG GGTCTCCCCG 660 

AGGATCTGGC TQGACTTCCA GAGTACCTGA GCAAGAOCAG CAAGTACCTC ACCGACTCGG 720 

AATACACAGA GGGCTCCACA GGTAAGACGT GCCTGATGAA GGCTGTGCTG AACCTTAAGG 780 

40 ACGGGGTCAA TGCCTQCATT CTGCCACTGC TGCAGATCGA CCGGGACTCT GGCAATCCTC 840 

AGCCCCTGGT AAATGCCCAG TGCACAGATG ACTATTACCG AGGCCACAGC GCTCTGCACA 900 

TCGCCATTGA GAAGAGGAGW CTGCAGTGTC TGAAGCTCCT GGTGGAGAAT GGGGCCAATG 960 

45 

TQCATGCCCG GGTCTQOGGC GCTTCTTCCA GAAGQGCCAA GGGACTTGCT TTTATTTCGG 1020 

TGAGCTACCC CTCTYTITGG CCGCTTQCAC CAAGCAGTGG GATGTGGTAA GCTACCTCCT 1080 

50 GGAGAAOCCA CACCAGCCCG CCAGCCTGCA QGCACTGACT CCCAQGQCAA CACAGTCCTC 1140 

CATGCCCTAG TGATGATCTC GGACAACTCA GCTGAGAACA TTQCACTGGT GACCAGCATG 1200 

TATGATGGGC TCCTCCAAGC TGGGGCCCGC CTCTGCCCTA CCGTOCAGCT TGAGGACA3C 1260 

CGCAACCTQC AGGATCTCAC GCCTCTGAAG CTGGCCGCCA AGGAGGGCAA GATCGAGATT 1320 

TTCAGGCACA TCCTGCAGCG GGAGTTTTCA GGACTGAGCC ACCTTTCCOG AAACmCACC 1380 

60 GAGTGGTGCT ATGGGCCTGT COGGGTGTCG CTGTATGACC TGGCTrCTOT GGACAGCTCT 1440 
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GAGGAGAACT CAGTGCTQGA GATCATTGCC TTTCATTGCA AGAGCCCGCA CX3GACACCGA 1500 

^ ATCSGTOGTTT TQGAGCCCCT GAACAAACTG CTGCAGGCGA AATGGGATCT GCTCATCCCC 1560 

AAGTPCTTCT TAAACTTCCT GTGrEAATCTG ATCTACATCT TCATCTTCAC CGCTGTTGCC 1620 

TACCATCAGC CTACCCTGAA GAAGCAGGCC GCCOCTCACC TQAAAGOGGA GC3TTGGAAAC 1680 

10 TCCATOCTGC TGACGGGCCA CATCCT13VTC CTGCTAGGGG GGATCTACCT CCTCGTOGGC 1740 

. CAGCTOTOGT ACTTCTOQCG GOOOCACGTG TTCATCTOGA TCTOGTTCAT AGACAGCTAC 1800 

TTTGAAATCC TCTrCCTGTT CXARGCCXTPG CTCACAGTOG TGTCCCARGT GCTGrTGTTlC 1860 

CTQGSCATCG AGrrGGrTACCT GCCCCTCCTT GTGTCTGCGC TQGTGCTQGG CTGQCTCAAC 1920 

CTQCTTEACT ATACACGTCG CTTCCAGCAC ACAGGGATCT ACAGTGTCAT GATOCAGAAG 1980 

20 COCTGGTGAG CCTQAGOCAG GANNTTQGCG CCOXAAGCT CCTACAGGCC CCAAOXSCCAC 2040 

AGAGTCAGTC CAGCCCATC5G AGGGACAGGA KGAOSAKGGC AACGGGGCCC AGTACAGQQG 2100 

TATOCTQGAA GCCTCCTTTOG AGCTCTTCAA ATTCACCATC GGCATGGGCG AGCTGGOCTT 2160 

25 

CCAGGARCAG CTGCACTTCX: GCGGCATGGT GCTOCTGCTG CTOCTOGSCT ACGTGCTOCT 2220 

CACCTACATC CTGCTOCTCA ACATC3CTCAT CGCCCTCATG AGCGAGACCG TCAACAGTOT 2280 

30 CGCCACTGAC AGCTGGAGCA TCTGGAAGCt GCAGAAAGCC ATCTCTGTCC TGGAGATQGA 2340 

GAATGGCTAT TQGTGGTOCA GGAAGAAGCA GCGGGCAGGT GTCATGCTGA CCGrTTGGCAC 2400 

TAAGCCAGAT GGCAGCCCSG ATGAGCGCTG GTGCTTCAQG GTGGAGGAGG TGAACTGtSGC 2460 

35 

nCATCGGAG CAGACGCTGC CTACX3CTGTG TGAGGAOCCG TCAGGGGCAG GTGTCCCTCX; 2520 

AACTCTCGAG AACCCTGTCC TGGCTTCCCC TCCCAAGGAG GA3X3AGGATG GTGCCTCTGA 2580 

40 GGAAAACTAT GTGCCOGTCC AGCTCCTCCA GTCCAACTGA TGQCCCAGAT GCAGCAGGAG 2640 

GCCAGAGGAC AGAGCAGAGG ATCTTTCCAA CCACATCTGC TQGCTCTGGG GTCXXAOPGA 2700 

ATTCrGGTOG CAAATATATA TTTTCACTAA CTCAAAAAAA AAAAAAAAAA AAAAAAAAAA 2760 

AAAAAAAAAA AAAAAAGGC 2779 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LaJGTH: 1923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANTTFTTWESS : dotible 

(D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DE9CRIPTICN: SEQ ID NO: 192: 
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AOCCGCTCCG CTCCGCTCCG CTOGGCOCCX3 
TGQCCTQCGA G0C3CTGCCGC TGGATCCTOC 
5 ACATCATCCjC GCTQGCOQGC OQCGGCTGGT 
CGCTGrPGGTC GAAATGCTCC CAAGAGGGCXS 
AGAGOCTCAT GGAGTAOGCG TGGGGTAGAG 

10 

TCATCCTGGT GATCTGTTTC ATCCTCTCCT 
TCTTCCTGAG AGrrGftrTTGGA GGTCTCCTTG 
15 TGGTAATTTA CCCOGTGAAG TACACOCAGA 
CTTACATCTA TAACTGGGCC TACGGCTTTG 
GTOCCTTCTT CTTCTQCTGC CTCCCCAACT 

20 

CCAGGTACTT CTACACATCT GCCTAACTTG 
TGAGATOGAC TCCAGAflGAA GAAACTGrTTT 
25 AGTOTTCATA TTATTAAACT AGTCAAAAAT 
AAGTAGTGTT ATAGTTTCAT GlTTATCnT 
CTAATTACCT ATACTATGCC AATATTTCCT 

30 

TCTAAGAGAA TATOCACCTG AAACTTAACA 
TTTAATAATC TGA3CAAGTT CTTGTTATTT 
35 CTAAOGAGAA GAGGAAGATA AOGTTAAAAG 
GCAAAAAAAA AGTTTATTTT CAAGCCTTCG 
AAATOCATAT CATTTGTCAG AATTTCTCAT 

40 

GCTTCATGTr GACTCGATAT GTCATCTAGG 
CCATAGTTGG TAAGGCTTTC CTTTAAGTGT 
45 AGTTCTTTAT AGGGrTTAGGG TGTGGGAAAA 
GTTTATATGT TCAGAACCAG AGTAGACTGG 
TGACTGATAG ATCTCGTTAA GTTGTGTAGT 

50 

AAGTGCX^iCT AAAACAGCCT CAGGAGAATA 
TCTQGGCTCT ATCATATAGA CAGGCTTCTG 
55 TATAGTTAAA ATCCTOGrCT TTCTPGGTAA 
CCACAGGAGA ATTCGGGGAT TTGAGTITCT 
GGrTCATTATG ATTTTTTACC ATTTCGACTT 

60 



CXXX3GC00GT GAACATGATC OC5CTC3COGCC 60 

CCCTQCTCCT ACTCAGCX3CC ATCGCCITCG 120 

TGCAGTCTAG C3GAOCACGGC CAGAOGTOCT 180 

GCGGCAGCGG GTCCTACGAG GAGGGCTGTC 240 

CAGOQGCTGC CATGCTCTTC TGTGGCTTCA 300 

TCTTCGCXXrr CTGTQGACCC CfiCMKXTVG 360 

CCrTGGCTGC TGTGnTCCAG ATCATCTCOC 420 

CCTTCACCCT TCATGCXMC CSTQCTOTCA 480 

GGTOGGCAGC CACGATTATC CTGATYGGCT 540 

AOGAAGATGA CCTTCTGGGC AATGCXMGC 600 

GGAATOAATG TGGGAGAAAA TCGCTQCTQC 660 

CTCCAGGCGA CTTTGAACCC ATTmTOGC 720 

GCTAAAATAA TTTQGGAGAA AATATTmT 780 

TATTATGTTT TGTGAAGTTG TOTCTTTTCA 840 

TATATCTATC CATAACATTT ATACTACATT 900 

CTTTATAAGG TAAAAATGAG OTTTCCAAGA 960 

(XAAATAGAA TGGACTOGGT CTTGTTAAGGG 1020 

TTGTTAArcA CCAAACATTC TAAAfiGAAAT 1080 

AACTATTTAA GGAAAGCAAA ATCATTTCCT 1140 

TAATATCCTG AATCAITCAT TTCAGCTAAG 1200 

AAAGTACTAT TTCAaraGTCC AAACCTOrTG 1260 

GAAATATTTA GATGAAATTT TCTCTTTTAA 1320 

TQCTATATTA ATAAA3CTGT AGTGrrTTTGT 1380 

ATTGAAAGAT QGACTGGGTC TAATTTATCA 1440 

AAAGCATTAG GAGGGrTCATT CTTGTCACAA 1500 

AATGACTTGC TTTTCTAAAT CrCAGGTTTA 1560 

ATAGrrTTGCA ACTCTAAGCA GAAACCTACA 1620 

ACAGATTTTA AATOTCTGAT ATAAAACATG 1680 

CTGAATAGCA TATATATGAT GCATCGGATA 1740 

ACATAATGAA AACCAATTCA TrTTAAATAT 1800 
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CAGATTATTA TmCTAAGT TGTGGAAAAA GCTAATTGTA GmTCATTA TGAAGTTTTC 1860 
CCAATAAACC AGGTATTCTA AAAAAAAAAA AAAAAAACTN GAGQGGQGGC CTGGTACCCA 1920 



ATT 



1923 



10 (2) INFORMATION FOR SBQ ID NO: 193:^ 

(i) SEQUENCE CHARACTERIffTICS : 

(A) LENGTH: 2346 base pairs 

(B) TYPE: nucleic acid 
15 (C) STOANDEENESS: double 

<D) TOPOLOGY: linear 

(xi) SBQUQJCE DESCMPTION: SBQ ID NO: 193: 

20 AGGCTCAGGG GGACACTCTC AAAATTACAC AGCTTTTAAC AGGTGGCAGA ATTGGGGTTC 60 

AGACCCAGAT CTGGGTTCAA GTCACTCATG GTGrrGATTGC QGCATTCCTT 0CXX3CATCTC 120 

GQCCTTGCCA TCTCTCTCTC CGAGTQGACA TGGAGAGGAC GGGGGCCCAG CAGCTQGATG 180 

25 

GCTC5CAGQG6 ATCAAGTCTT CTCTGGGGCT GGGCACGTAN AAGAGCATGT GGCTGGTGGA 240 

CGGCATGCCT GGCTCCTCAC CTQQCAGTCT GCCTGCCCTC CTAACCGGCT GrcTCTTCTr 300 

30 CCCCTAGTOC CCTOQQCTAG CATGACCXX3C CTGATGOGWT SCXX3CACAGC CTCTGGTTCC 360 

AGCGTCATTC TCTGGATQGC ACCCGCAGCC GCTCCCACAC CAGCXSAGGGC ACCOGAAGCC 420 

GCTCCCACAC CAGCGAGQGC ACCCGCAGCC GCTCGCACAC CAGOGAGGGG GCCCACCTGG 480 

35 

ACATCACCCC CAACTCGGGT GCTQCTGGGA ACAGNGOCGG GCCCAAISTCC ATGGAGGTCT 540 

CCTGCTAGGC GGCCTGOCCA GCTGCCGCCC COGGACTCTG ATCTOTGTAG TGGCCCCCTC 600 

40 CTCCOCGGCC CCTTTTCGCC CCCTGCCTGC CATACTGCGC CTAACTOGGT ATTAATCCAA 660 

AGCTTATTTT GTAAGAGTGA GCTCTOGrGG AGACAAATGA GGTCTATTAC GTGGGTOCCC 720 

TCTCCAAAGG CGGGGTGGCG GTQGACCAAA GGAAGGAAGC AAGCATCTCC GCATCGCATC 780 

CTCTTCCATT AAOCAGTCGC CGGTTGCCAC TCTCCTCCCC TCCCTCAGAG ACACCAAACT 840 

GCCAAAAACA AGACGCGTAC AGCACACACT TCACAAAGCC AAGCCTAGGC CGCCCTGAGC 900 

ATCCTGGnrC AAACOOGTCC CTGGTPCAGAA GGCCAGCCGC CCACTTCCCG TTTCCTCTTT 960 

AACTGAGGAG AAGCTGATCC AGTTTCCGGA AACAAAATCC TTTTCTCATT TGGGGAGGGG 1020 

GGTAATAGTG ACATGCAGGC ACCTCTTTTA AACAGGCAAA ACAGGAAGQG GGAAAAGGTG 1080 

GGATTCATGT OGAGGCTAGA GGCATTTGGA ACAACAAATC TACGTAGTTA ACTTGAAGAA 1140 

ACCGA'iTlTr AAAGTTGGTG CATCTAGAAA GCnTGAATG CAGAAGCAAA CAAGCTTGAT 1200 

60 TTTTCTAGCA TOCTCTTAAT GTGCAGCAAA AGCAGGCRAC AAAATCTCCT GGCTTTACAG 1260 
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ACAAAAATAT TTCAQCAAAC GTTGGGCATC ATGGnTTTG AAGGCTTTAG TTCTCCTITC 1320 

TGCCTCTCCT CXIACAQCCCC AACCTCCCAC CCCTGATACA TGAGCCAGTC ATTATTCTIG 1380 

TTCAGGGAGA AGATCATTTA GAnTCTTTT GCATTCCTTA GAATOGAGGG CAACATTCCA 1440 

CAGCTC3CCCT GGCTGTGATG AGTGTCCTTG CAGQGGCCGG AGTAGGAGCA CTGQGC?K3GG 1500 

GGCGGAATTG GGGTTACTCG ATGTAAGQQA TrCCTTGrrTC TTGTGrPTGAG ATCCAGTOCA 1560 

GrrorGATTT CTGTOGATCC CAGCTTGCTT CCAGGAATTT TCTGrGATro GCTTAAATCC 1620 

ACTTTTCAAT CTTCGACAGC TGQGCTGGAA CCTTGAACTCA GTAGCTGAAC CTCTCTCACX: 1680 

CGGTCAOGTT CTTGGATCCT CAGAACTCTT TQCTCTTCrPC GGGGTOQGGG TOGGAACTCA 1740 

CGTGGGGAGC GGTQGCTGAG AAAATCTAAG GATTCTGGAA TACATATTCC ATCGGACTTT 1800 

CCTTCCCPCT CCTGCTTCCT CTTTTCCTGC TCCCTAACCT TTCGCCGAAT GGGGCAGCAC 1860 

CACTGAC3GTT TCTGGGCGCSC CAGTGCGGCT GCCAGGTTCC TGTACTACTC CXnTOTACTT 1920 

TTCATTITOG CTCACCC3TQG ATTTTCTCAT AGGAAGTITG GTCAGAGTCA ATTGAATATT 1980 

GTAAGTCAGC CACTQGGACC CGAGGATTTC TGGGACCCCG CAGTTGGGAG GAGGAAGTAG 2040 

TCCAGCTTTC CAGGTCGCGT GAGAGGCAAT GACTCGTTAC CTGCCXXXXIA TCACCITOGA 2100 

GGCCTTCXCT GGCXTTGAGT AGAAAAGTCX5 GGGATCOGGG CAAGAGAGGC TGAGTACGGA 2160 

TGGGAAACTA TTGriGCACAA GTCTTTCXAG AGGAGTTTCT TAATGAGATA TTTCTATTTA 2220 

TTTCCAGAOC AATAAATTTG TAACTTTQCA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2280 

AAAAAAAAAA AAAAAAAACT CGAGGQGGGC COGTACCCAA TTCGCCGTAT AT6ATCGTAA 2340 

ACAATC 2346 
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(2) INFDRMATION FOR SEQ-ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 194: 
TATCTGAACC ACCCTTTATT CTACATATGA TAGGCAGCAC TGAAATATCC TAACCCCCTA 
AGCTCMAGGT GCCCTGTQQJ ACGAGCAACT GGACTATAGC AGGGCTGGGC TCTCTCTrcC 
TGC5TCATAGG CTCACTCTTT CCCOCAAATC TTCCTCTGGA GCITTGCAGC CAAGGTGCTA 
AAAGGAATAG GTAGGAGACC TCTTCTATCT AATCCTTAAA AGCATAATCT TCAACATTCA 
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TTCAACAGCT GATGOCCTAT AACCCCTGCX: TQGATTTCTT CCTATTAGGC TATAAGAACT 300 

AQCAAGATCT TTACATAATT CAGflGTGGlT TCATTGCXnT OCTACCCTCT CTAAOXaGCCC 360 

CrcCATITAT TTGACTAAAG CATCACACAG TOGCACTACX: ATTATACCAA GAGTATGAGA 420 

AAXACACTTQC TTTATGQCTC TAACATTACT GCCTTCAGTA TCAAGGCTOC CTCGAGAAAG 480 

GATGGCAGCC TCAGGGCTTC CTTAOXnCCT CCACCACAAG AGCTCCTTGA TGAAGGTCAT 540 

CTTTTTCCCC TATCCTOTTC TTCCXXTCCC OGCTCCTAAT GGTAOGTGGG TACCCAGGCT 600 

GGTTCTTGGG CTAGGTAGTG GGGACCAAGT TCATTACCTC CCTATCAGTT CTAGCATAGT 660 

15 AAACTAOSGT AOCAGTCTTA GTGGGAAGAG CTQGGTTTTC CTAGTATACC CACTQCATCC 720 

TACTOCTACC TGGTCAACCC GCTGCTTCCA GGTATQQGAC CTGCTAAGTC TOGAATTACC 780 

TGATAAGGGA GAGQGAAATA CAAGGAGGGC CTCTGGTGTT CCTQGCCTCA GCCAGCTGCC 840 

CACAAGCCAT AAAOCAATAA AACAAGAATA CTGAGTCAGT TTTTTATCTG GGTTCTCTTC 900 

ATTCCCACTG CACTTQGTQC TQCTTTGGCT GACTQGGAAC ACCXX^VTAAC TACAGAGTCT 960 

25 GACAGGAAGA CTOGAGACTG*TCCACTTCTA GCTCGGAACT TACTGTGTAA ATAAACTTTC 1020 

AGAACPQCTA CCATGAAGTG AAAATGCCAC ATTTTGCTTT ATAATTTCTA CCCATCTTOG 1080 

GAAAAACTGG CTTTTTCCCA GOCCTTTCCA GQGCATAAAA CTCAACCCXTT TCGATAGCAA 1140 

30 

GTCCCATCAG CCTATTATTT TTITAAAGAA AACTTGCACT TGTTTTTCTT TTTACAGrTTA 1200 

CTTCCTTCCT GCCCCAAAAT TATAAACTCT AAGTGTAAAA AAAAGTCTTA ACAACAGCTT 1260 

35 CrrGCTTGTA AAAATATGTA TTATACATCT GTATTTTTAA ATTCTGCTCC TCAAAAATGA 1320 

CTGrrcCCATT CTCCACTCAC TGCATTTGGG GCCTTTCCCA TTGGTCTQCA TGTCTTTTAT 1380 

CATTGCAGGC CAGTQGACAG AGQGAGAAGG GAGAACAGGG GTCGCCAACA CTTGrrGTTQC 1440 

40 

TTTCTGACTG ATCCTGAACA AGAAAGAGTA ACACTGflGGC GCTCGCTCCC ATCCACAACT 1500 

CTCCAAAACA CTTATCCTCC TGCAAGAGTC GQCTTTCCAG GGTCTTTACT GG6AAGCACT 1560 

45 TAAGCOOCCT CXTCACCCCT TCCmTrrC TTTCITTACT CXTTTTOGCTT CAAAGGATIT 1620 

TOGAAAAGAA ACAATATGCT TTACACTCAT TTTCAATTTC TAAATTTGCA OOGGATACTG 1680 

AAAAATACGG CAGGTOQCCrr AAGQCTGCTG TAAAGTTGAG GQGAGAGGAA ATCTTAAGftT 1740 

50 

TACAAGATAA AAAACGAATC CCOTAAACAA AAAGAACAAT AGAACTGGTC TTCCATTTTG 1800 

CCACCTTTCC TGTTCATQAC AGCTACTAAC CTGGAGACAG TAACATTTCA TTAACCAAAG 1860 

55 AAAGTQGGTC ACCIGACCTC TGAAGAGCTG AGTACTCAGG CCAjCTCCAAT CACXCTACAA 1920 

GATGCCAAGG AGGTCCCAGG AAGTCCAGCT CCTTAAACTG ACGCTAGNCA ATAAACCTGG 1980 

GCAAGTGAGG CTU^GAGAAAT GAOGAAGAAT CCATCTGTGA GGTCACADGC AAGGATGAAA 2040 

60 
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GACAAAGAAG GAAAAGAGTA TCAAAOOCAG AAAOGAGATC ATTTftGTTGG GTCTGAAAGG 2100 

AAAAGrrCTTT GCTATCOGAC ATGrEACTQCT AGTACdCTA AGCATTTTAG GTCCXrAGAAT 2160 

GGAAAAAAAA ATCAGCTATT GGTAATATAA •tWVPGTCCTT TCCCTGGAGT CA bWlTlTr 2220 

AAAAAGTTAA CTCTTAGTTT TTACTTGTTT AATTCTAAAA GAGftAGGGAG CTCAGGCCAT 2280 

TCCCTGTEAGG AGTAAAGATA AAAGGATAGG AAAAGATTCA AAGCTCTAAT AGAGTCACAG 2340 

CTTTOCCAGG TATAAAACCT AAAATTAAGA AGTACAATAA GCAGAGGTOG AAAAJPGAIICT 2400 

AGrrrocTGAT AGCTACOCAC AGAGCAAGTG ATTTATAAAT TTGAAATCCA AACTACnrC 2460 

TTAATATCAC TTTOGTCTCC ATTTTTCCCA GGACAGGAAA TATGTCCCCC CXTTAACTTTC 2520 

TTGCTTCAAA AATTAAAATC CAGCATCCCA AGATCATTCT ACAAGTAATT TTGCACAGAC 2580 

ATCTCCTCAC CCCAGTOCCT GrTCTGGAGCT CACCCAAGGT CACCAAACAA Cn X JU l'LXjfi Xi 2640 

AACCNAACTG CCTTAACXnT CTC3GQQGAGG GGQATTAC3CT AGACTAGGAG ACCAGAAGTG 2700 

AATGGGAAAG GGPGAGGACT TCACAATGTT GGCCTGTCAG AGCTTGATTA GAAGCCAAGA 2760 

CAGTQGCAGC AAAGGAAGAC TTQGCCCAGG AAAAACXTTCT GGGTTGTGCT AATTrCrGTC 2820 

CAGAAAATAG GGTOGACAGA AGCTTGTOGG GTGCAIGGAG GAATTGGGAC CTGGTTATCT 2880 

TGTTATTCTC GGACTGrTGAA TTrTOGTCAT GTAAAACAGA ATATTCTGTA AACCTAATGT 2940 

CTGTATAAAT AATGAGCGriT AACACAGTAA AATATTCAAT AAGAAGTCAA AAAAAAAAAA 3000 

AAAAAACTCG AGGGQGGQCC CGGTACCCAA TTTNCCAAAT AGAGATOGTA TTAC 3054 
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(2) INPOHMATIGN FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTTH: 907 base pairs 

(B) TYPE: nucleic cicid 
<C} STRANDEENESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

GGCAGAGCTC GTQGCCGNAA CTTTTTCTGC TCCTQGCTGC CACCTACTGG CTQGCCGCGG 60 

CCCTQGCCTG GGCCTGCACC AGCCTGOGMG CGGGCTOCCA CAGCAGCCCC CTTCCAAGCA 120 

GCGTCCCCAC ACCGCGCACC TTCTGCGGGA ACGTGCTCGC CGTGCCGGGG ACCATATQGA 180 

CGGAAGGCTT TGTGCTCACC TACAAGCTGG GTGAGCAGGG TGCCAGCAGC CTGTTGATCC 240 

TCTTGGCTCC TGCTGGAGCA CGAGCGGCGT TTCTGCTCCC GAGTTGGGAC TOTGGAATQG 300 

TGTOGGTGCT GTGC?rCTGCT CCATCGCTQG CTCCTCCCTG GGTGQGACCT TGCTGGCCAA 360 

GCACTQGAAA CTGCTGCCTC TGTGAGGTOG GTGCTGOGCT TCCGCCTCGG QGGCCTAGCC 420 



W098/3M48 PCT/U^M)4493 

427 

TGTCAGACTG CCTTGGTCTT CCJVCCTTGGA CACCCTGQGG GCCAGCATOG ACGCTOGCAC 480 

AATCTTGAGA GGGTCAGCCT TGCTGAOCCT AIX3TCTGCAG CACTTCTTGG GARGCCTGGT 540 

5 

CACCACAGTC ACCITCACTG GGAATCATGC GCTGCAGCCA GCTGQCXCCC AGGGCCTTQC 600 

AGGCCACACA CTACAGCCTT CTGGCCACGC TGGflGCTGCT GQGGAAGCTG CTGCTQGGCA 660 

10 CTYTGGSCGG AGtSGCXTOGC TGATGGGTTG GC3GCCACATC CCltX: rii:X ' r GCTCCTGCTC 720 

ATCCrCTCTG CCTTTCCCGT TCTGTACCTG GACCTAGCAC CTMCACCTT TCTCTGAGCT 780 

GAGTGGCTGG AGTGGTCAAT AAAGCCACAT GTOCCTGTOG CCCMAPAMi AAAAAAAAAA 840 

AAAAAAAAAA AAAAAAACPG GAGGGGGGC3C CCGGTACCCA AATCGCOXA TATGATOGTA 900 

AACAATC 907 
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20 



(2) INFORMATION FOR SBQ ID NO: 196: 

25 <i) SEQOajCE CHARACTERISTICS: 

(A) LHJGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 

30 

(xi) SEQUENCE EESCRIPTICN: SBQ ID NO: 196: 
GGCAOGAGGA GQGACAGGGA GTQGGCAAGG GGAAGAAGCA GCTTATTTGA CTAACCAGCC 60 
35 CCTCTGTGGT CCAOCAQCGT CITOGCTrGG TGGGAGGGCT CTCAATCAGC AGQGCCCCAG 120 
KflGGCSCAAGA AGAAGTGC3GG CAAAGCCTGG OGCTCGGCCG CGGTCGOGGC AGCTTTGCMA 180 
TCTGGAGCCA CGCCTCCTCC AGQCCATGCT OCTTGAACTT GGAAATGTCA ACCGGAGCCC 240 

40 

TTAACACCAG CCCTCCAGCA TCTAATAGAC TTGAATCTAC TCTAAACGAA TATTTAATCC 300 
AACCTCAACT ACATTGTAGC TCAGTCCAAC GACTAACCCT GAAATQGGGG TGTTCCAGCC 360 
45 TTCAGCGAGA TQGCCAAGCG CTTCCCCTOGG GGCTGTOGCA GCXSGGCTTAT CCT'lCIVitrr 420 
TGCCAACCTT GCCGTCCGAC CTCCTCCGCC COCATGCGCTr GACCCCGTCC GTC?rCTGTGrr 480 
CTGTCCATAC GTGTGAGTCC AGCTAAAAAG ACAAAACAGA ACCCGTOGGC CCAGCTOGGA 540 

50 

AGGTGCGTGG AGAAGGCTCC GACGTCTCCG AAGTGCAGCC CTTGGGATQG CATTCCGTTG 600 
TGTGCCTTAT TCCTGGAGAA TCTOEATACG QCTOGCCTAT AAGAAATATA GCCTCTTCAT 660 
55 GCTGTATTAA AAGGACTTTT AAAAGCAAAA AAAAAAAAAA AAAAACTCGA GGGGGGGCCC 720 
GGTACCCAAT TCGCCCAATA GTGAGTCGTA TTACAATTCA CTGGGCCGTC STTTTAACAA 780 
CGTCGTC5AAC TGGGAAAACC CTGGCGTTTA CCCAACTTAA TOGCCTTGCA GCACATCCOC 840 
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CTTTCGCCAG CTQGCXJTTAA TAGOGAAAAA NC3CXXX3CACC CX5AATCGCCC TTCCCAACAG 900 

TTTOCXXAGC CCTGAATOGC GAAATQGCAA ATTGTAAGCG TTTAATATTT TKKTTAAAAT 960 

5 TocNOGnrwA Awi-irritrrr taaatcarct cmttttttt aacccaataa gsccgaaatc 1020 

CGGCAAATCC CCYTTATTAA TTCCAAAAAA ATAAACCSAA AAWGGGrTTTG AATTTTTTKT 1080 

TTCCCCAyrr TTQGAAACAA ASWYCCCCCT TTTTAAAAAA GTTQGAACCC CCAMCCYTCC 1140 

10 

AAAGGGGAAA AAACSYTTTT YTQGGGQGNA Af«3GC3GOCCC CWTACTTTOA ACAYCCCCCC 1200 

CCAAWCAATT TITTTGGQGG GTCCCNAAAG GTCXXXCTAA AA NCTlTlTr OQGAACCCNA 1260 

15 AGGGGANCCC CCCATTTAAA ATnTOGCTTN 1290 



20 (2) INFORMATION FC« SBQ ID NO: 197: 

(i) SBQUa^CE CHARACTERISTICS: 

(A) LQ4GTH: 1020 base pairs 

(B) TYPE: nucleic acid . 
25 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 197: 

30 GGrrCTGCCTG GATQGTCGTG TADGTGAGTT TTACCAAGGA TTATGGTAAC AAATGAGTGA 60 

GACCTCTATG GAGAAAATAT TGAAGNNCAT TAAAGAAGAC CTCATANTAG GAGAGAATGT 120 

SCTTTGGAGG ATTTGTATTG AGCTTTTACA CTrATTCATTT TTCAACTCAA GGCAATOOCT 180 

35 

TTCTACACCA ACTCTAATCC ATAAACGGGT CTTATGACAT CTATGAAGTA GTAGCAAGAC 240 

ATGCTTAGTG TGTATTTCTC TCTTTGAGAC ACTGTAATTT CTAOCAGAAA TTTCCAGAGC 300 

40 ATTATCSTAGG TAGAAAAAAA TQCAAGCAAG CTGTTAAAGA TCTTOGATCC CATTATATAG 360 

TATGTATAGC TGAAATCTGT AATTCAATCA CTTTTTCTCT TTTATCCTCT AACCAAAAAA 420 

TTGTTTAATT TTGCATCCCA AATGnTTTTA ATCTTTGrrAT ATTTTrTAAA AAYCCTTTTC 480 

45 

TCCTCATCAT TGCCTTTTTT GTGGTTGTAA ATAGACTTAC TTGCACTTTG AAGATGAGTT 540 

ACTCCTTGTC ATCTTACAAA TATGTGATAT GGTAATTTTC ATAACAGATG TCAGrTTTTGA 600 

50 ACCAAGAATT GGTGATTTGT TTATAAGAAA AAAACTGGCT TCATTTCTGT GAAATTGCTC 660 

TTTGAAAATT TCTTTTTACA CC?rGTAAGCC AACTGAGATA CCGTGATGGrT GTTGATTTCT 720 

TTCAATGATG CTTACCATCT ATTTTAGCCA CTGAGCCTTT TATTATTTGfT CTATTTGTAA 780 

55 

AGTTTArrTG TCTTAACTCA TTTAATAAAT ATACTGrTTTA TCICTTTCTG AATGGGGACT 840 

GAACTITTTG GATATTGATA TTGATTTGAA AATATTTTGG AATTTTTTCT ACTTGAAATT 900 

60 TTAGAAATCT AATKGAAAAT TCTATAATGT ACTGAAAGTA VCCnTCTGTA CAGTGAKCAC 960 
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TCTCTAATAA TATGAraiCT TGCCCTAAAN GAGCSNGGGAC ATGTCCCACT TTCCACCACG 1020 



.(2) INFORMATION FOR SBQ ID NO: 198: 

(i) SBC2DQflCE CHARACTERISmCS : 
10 (A) LQIGmi: 524 base pairs 

(B) TYPE: nucleic acid 

(C) STBAttaaOHtESSi double 

(D) TOPOIXX3Y: linear 

15 (xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 198: 

AATTCCXXAA GCTGAGGGTT GTGTGCCNrC GGGCGAGCCA AGTCTTTPGA CCGGACCCTT 60 

CCCGGCGCAG AAGANCTGAA GrTTGRTTTGA GAGCCTGTKT TTGOGGITRA GCCGAGCTGC 120 

20 

TGCGOGCTTY GTOGCCGGCC AQGACftCAAG YTACTTQCAA OtSGQGOGGCG CCTGGCTTAT 180 

GATGTTCCTC AACCCAGQGG CQGCCTCTGC CCTCTACTCG TOXAGQCCC ACTTGCXMG 240 

25 CAGGAGCCCT CCCCAAGCCT TCAQGGCTQC TCGGAGTCAC CPGTTGGAAT GGACTAAAAG 300 

GACCXTTTOTG TGGGAACAGG TGCTCCAAAC ACXXTOCTOC TOC3CTG0CAG GCAGGCXXTPC 360 

TGGAAOXSAA GGGGCAGGftC TCATCAQGAC CTCCCTQGAC CCTGCAGGGC AQGCAGTTGG 420 

CCCGAGCCCA AGCATTTGGC TCTGCTTQCC CXaU^GGGGAC AGGAAGCXTTC TTGQGCCTCT 480 

TccxrrrccTG gacaagqccx: CCTGCCTTTC CCTCACATAA ACTG 524 



30 
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(2) INFORMATIGN FOR SBQ ID NO: 199: 

40 (i) SEC30ENCE CHARACTERISTICS: 

(A) LENGTH: 332 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEENESS: double 
<D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 199: 
GTGATACAAG GAAGQGTGAT CATCATCTGT CACCATGCAA TTCCTGCTCA CAGCCTTTCT 60 
50 GTTGGTGCCA CTTCPGQCTC TTICTGATGT CCCCATATCC CTAGGCTTCT CCCCCTCCTA 120 
GAAGGGCTTC TTGATAGA1T AGAAAATAAG AATQAGTGAC ATTTCCTATG TGCATATAAG 180 
AAGGAGCCAC AAGACATGTC TTTTAAATAA AAGGACAGTG TCCATCCTTT TAGCTGCCGA 240 
ATAGAACCTT OGTCTCATCC TCCTQGAGCT AGG9CTTAAA ACAGCTTCTG TGTTTCTSAT 300 
TiOSTCTCART CTTTTTGCCAA GGTTTTATTC GG 332 
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(2) INFORMATION FOR SEQ ID NO: 200: 

<i) SEUUKNCH CHARACTERISTICS: 

(A) LQJGrra: 376 base paixs 

(B) TYPE: nucleic acid 

(C) STRANDEXXIESS: double 

(D) TOPOLOGY: linear 



(xi) SBC^JENCE DESCRIPTION: SEQ ID NO; 200: 

CCAGGC3AAGC CCCARGCCTG TOCTGAATTG ACATCAGTGC TTCCCTGAAC TC3CCTCCCCC 60 

15 ACCCCTGGGC ATTATCCCAG GAAACTTATG TTTTCTAGAA GCTAAGCAGC TC3CTQGGACT 120 

CAGGGACTOG TQCAGGTAGG CTGAGTGGCA GCTCAGTCCT AGAAGGTCTC TGAAGATCTG 180 

GACTGAGGAC CYTGCTACTC CCCAAGCCAG AGCCCATCAG CCAGGCCPQC TGTGAGCCAC 240 

20 

CTGCCTGTGG AGTGCTGAGC TCAACCAAAG GCTOQCAAGC TCTGGGCCTC ATTTAAGOGA 300 

TTCTGATGAG CCX5ATGGGCC CTGGAGGCAG CCCATTAAAG CATCTGGCTC GTTTITOGAA 360 

25 AAAAAAAAAA AAAAAG 376 

30 (2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I«ENGTH: 1192 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

40 CCCAGTATAT TTCTATAACA TTTATTTTAG TGAACTTATA ATGTTTCrTT GTATTAAATT 60 

ATTAGATTAT ATCTTTAGAT AATATTGTTA CTNAATTAGT AGGTAATATA TATTTTATTC 120 

AAAAATAAAT TGTGCATCTA ATGTCTACCA ATTAATGTAC TTOTAGATGT ATCTTATCTT 180 

45 

AACTTGAGTC TTTGCTQCCC CTAATGAGGT GrTGAAGGACT CTTCTCCCCT QGGGAAGTTT 240 

TTCTTTITCA GGAGGGAGGA GGGCTTTCCC AGGTAATGTG TCTAGAGTGT TGGGCAGAAR 300 

50 AATCTGGGAC CACACCACAC CAGTTCTCTC CTTAATCCAC GTCATTTQCC TTCTATCCCA 360 

GCTATGrrrc CAGTGTCCTC TGGGTGTTTC CAAGAGCAAC AAGAAAYGAA TAAATCTCTG 420 

KTGAGTTGTT TATTTGTTCT TCACnTOTT TTACACTGTA WTTTCTGAGT TTATGGGTGT 480 

55 

CTGTGAATTA AAAAGGAAAA GTRGAAATAA GTAAAACTCA GGTTGAAGGA AATATACATA S40 

AATAAGATAA AGCTGAOCTG TAGATATARR CAGGTTATAA RAGCTTAGAG TTGTCTAAGrr 600 

60 TS^GTGCAAA K i TiVL TCTC ATCTTTCTGA TGCCGARACA AAAAAGGCAG TCATGTTICT 660 
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WATGTGMTG GAATOGAACC CGARAAGAGA GCAYGCTGTG TTCTTGGQGA CAGGAAflGCT 
TGYGTQCACC AAGTCTKAAC CACCACCTTC ATGC3GACATA GRTTATGrTGC TQC3AACA2aT 

5 

TTCACAOCGG CCTGGCAGTA AACACTTC3TA GTOTTOTGCA C3TGGAAACGG TCATCTTCOG 
CTAAAGCACG GOGTOTTCTG CAGCQGAAAT QGrrGATCTGC TGCTAAAACA CAGCTTCCAT 
CCTAATGTAT GCTCCTTACT CAAAGAGrTGT GGTCCCAAAC AQCCTTTGGG AGGTCCTCCT 
TGATTCATQG ATGAAACCTG GAACATCTTG AGGACTGAGT TAACCATEAGG TCCTTAAATA 
ACTCTCCACA CUn ' A ' i ' AUtJ ' ri ' AGrTTTATCTC TACATGCAGG GTGTGCAGCA GCCTGTrCAA 
AGTCATATTT TCTGQGAAAT ATTTCCAGTG TTTATTTGCA CTTTAGCXXA CTCTGnCTAG 
CCTTATTTCT TCTAAACTCA CCATTAATCT GAATAAT^VGT CAAATTTAGG GG 

(2) INFORMATION FOR SEQ ID NOi 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : double 

(D) TOPOIOGY: linear 

(xi) SBQUraJCE DESCRIPTION: SEQ ID NO: 202: 
ATCTTQGGCT ATCTTTGACA GGGGATTCTT GCAAGTTGAT GCTTTCTACA AGTGAATATA 
GTCAGTCCCC AAAGATGGAG AGCTTGAGTT CTCACAGAAT TGATGAAGAT GGAGAAAACA 
CACAGATTGA GGATACGGAA CCCATGTCTC CAGTTCTCAA TTCTAAATTT GnTCCTGCTG 
AAAATGATAG TATCCTGATG AATCCAGCAC AGGATGGTGA AGTACAACTG AGTCAGAATG 
ATGACAAAAC AAAGGGAGAT GATACAGACA OCMQGGATGA CATPAGTATT TTAGCCACTG 
GTTGCAAGGG CAGAGAAGAA AOGGTAGCAG AAGATGrTTTG TATTGATCTC ACTTGTGATT 
CGGGGACTCA GGCAGTTCCG TCACCAGCTA CTCGATCTGA GGCACmCT AGTGTGrTTAG 
ATCAQGAGGA AGCTATGGAA ATTAAAGAAC PCCMCCCAGA QGAGGGGTCT TCAGGGTCTG 
AGGTGGAAGA AATCCCTGAG ACACCTTGTG AAAGTCAAGG AGAGGAACTC AAAGAAGAAA 
ATATQGAGAG TGTTCCGTTG CACCTTTCTC TGACTGAAAC TCAGTCCCA 



(2) INPORMATICN FOR SEQ ID ^50: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQKTTH: 847 base pairs 

(B) TYPE: nucleic acid 



wo 98/39448 



432 



(C) STRANDECMESS: double 

(D) TOPOUXSY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 
QCXAOGAGCG CAAGCTGCTG GCCGCCATCA ACGCX5TT0CG CCAGGTGCGG CTGAAACACC 
GGAAGCTCCG GGAACAAGTG AACTCCATGG TGGACATCTC CAAGATGCAC ATGATCCTGT 
ATCACCTGCA GCAGAATCTC AGCAGCTCAC ACCGGGCCCT GGAGAAACAG ATTGACfiCGC 
TQGCGGGGAA GCTGGATGCC CTGACTQAGC TGCTTAGCAC TGCCCTGGQG CXX3AGCAGCT 
TCCAGAACCC AGCCAGCAGT CCAAGTAGCT GGftCCCACGA GGAGGAACCA QGCTACTTTC 
CCCAGTACTG AGTOGTQGAC ATCGTCTCTG (XACTCCTGA CCAGCCTGAA CAAAGCACCT 
CAAGTGCAAG GACCAAAGGG GGCCTGQCTT QGATQGCnTG GCTIGCTGAT GGCTC3CTGGA 
GGGGACGCTG GGTAAAjGrTGG GGAGQCCTTG GCCCACCTGA GGCOCCAGGT QOGAACATQG 
TCACCCCCAC TCTGCATACC CTCATCAAAA ACftCTCTCAC TATGCTGCTA TOGACGACCT 
CCAGCTCTCA GTTACAAGTC CAGQOGACTG GAGGCAGGAC TCTTOQGrPCC CTGGGAAfiGA 
GGGTACTAGG GGCCCGGATC CAGGATTCTG GGAGGCTTCA GTTACCGCTG GCCGAGCTC3A 
AGAACTGGGT ATGAGQCTQG GQCGQGGCTG GAGGTGQOGC CCCCTOGrGG GACAACAAAG 
AGGa«:ACCAT TTTTCCAGAG CTGCAGAGAG CACCTGGTQG GGAGGAAGAA GTGTAACTCA 
CCAGCCTCTG CTCTTATCTT TGTAATAAAT GTTAAAQCCA GAAAAAAAAA AAAAAAAAAA 
AAAAAAA 

(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOIOCT: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 
ACAAACATAC TCGCAGGAAG GAGTCTCATG CTGCCCGCAG CATCAGCGCA ACNCNTC3GCC 
GCCATCAACG CGTTCCGCCA GGTGCGGCTG AAACACCGGA AGCTCCGGGA ACAAGTGAAC 
TCCATGGTGG ACATCTOCAA GATGCACATG ATCCTGTATG ACCTC3CAGCA GAATCTGAGC 
AGCTCACACC GGGCCCTGGA GAAACAGAIT GACACGCTGG OQGGGAAGCT GGATQCCCTG 
ACTGAGCTGC TTAGCACTGC CCTGGGGCCG AGGCAGCTTC CAGAACCCAG CCAGCAGTCC 
AAGTAGCTGG ACXXACGNAG GAGGAACCAG GCTACTTTCC CCAGTACTGA GGTQGTQGAC 
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ATHOGTCTCT TGCCACTCCN TCSIACCCAGC CCPGEAACAAA GCACCTCAAG TGCAAGGACC 420 
AAAGGGGGCC CTGGCTTGGA GrGGGTTQGC TTGCTGATOG CTGCTOGAGG GGACGCTGGC 480 
J TAAAGTOGGK AGQCCTTGGC CCACCTGAGG CCCCftQGrTGG GAACATGGTC ACCCCCACTC 540 
TGCAIAOCCT CATCAAAAAC ACTCTCACTA TGCTOCTATG GAOGACCPOC AGCTCTCAGT 600 
TACAAGTC3CA GQCGACTGGA GQCAGGACTC CTGGGTCCCT GGGAAAGAQG GTACTAGOQG 660 
CCCX3GATCCA GGEATTCTOGG AGGCTTCAGT TACOCXTOGC OGAQCTGAAG AACTOGGTAT 720 

GAGGCTGQQG CGQGGCYQGA GGTOGCXSCCC CCTGGTOGGA CAACAAAGAG GACACCATIT 780 

15 TTCCAGAGCT GCAGAGAGCA CCTGGTGGGG AGGAAGAAGT GTAACTCACC AGCCTCTC5CT 840 

CTTATCTrPG TA 852 



10 



20 



(2) INF0RMAnC3N FOR SBQ ID NO: 205: 



(i) SBQqmjOE CHARACTERISTICS: 
25 (A) LENGTH: 1354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOIiOGy: linear 

30 (xi) SBQOENCE DESCRIPTION: SBQ ID NO: 205: 

GATTCGGCAC GAGQCTTGCT GGAGCAGGAG AAGTCTCTRG CCGGCTGGGC ACTGCTTGCTG 60 

GCASGARCTG GCATTGGACT CATGGTGCTG CATGCAGAGA TGCTGTGGTT CGGGGGGTGC 120 

35 

TCGGCTGTCA ATGCCACTGG GCACCTTTCA GACACACTTT GGCTGATCCC CATCACATTC 180 

CTGACCATCG GCTATGGTGA CGTGGTGCCG GGCACCATGT GGGGCAAGAT CGTYTGCCTG 240 

40 TCXACTGGAG TCATGGGTGT CTQCTGCACA GCCCTGCTGG TQGCCGTOGT GGCCCGGAAG 300 

CTQGAGTTTA ACAAGGCAGA GAAGCACGTG CACAACTTCA TGATGGATAT CCAGTAOftCC 360 

AAAGAGATGA AG6AGTCCGC TGCCCGAGTO CTACAAGAAG CCTGGATGTT CTACAAACAT 420 

45 

ACTCGCAGGA AGGAGTCTCA TGCTGCCCGC AGGCATCAGC GCAANCTGCT GGCOGCCATC 480 

\ 

AACGOGrrCC GOCAGGTGCG GCTGAAACAC CXSGAAGCTCC GGGAACAAGT GAACTCCATG 540 

50 GTGGACATCT CCAAGATGCA CATGATCCTG TATGACCTGC AGCAGAATCT GAGCAGCTCA 600 

CACCOGGCCC TGGAGAAACA GATTGACACG CTGGCGGGGA AGCTGGATQC CCTGACTGAG 660 

CTGCTTAGCA CTGCCCTGGG GCCGAGGCAG CTTCCAGAAC CCAGCCAGCA GTCCAAGTAG 720 

55 

CTQGACCCAC GAGGAGGAAC CAGGCTACTT TCCCCAGTAC TGAGGTGGTG GACATCGTCT 780 

CTGCCACTCC TGANCCCAGC CCTGAACAAA GCACCTCAAG TGCAAGGACC AAAGGGGGCC 840 

60 CTQGCTTQGA GTGQGTTGGC TTGCTGATOG CTGCTGGAGG GGACGCTGGC TAAAGTGGCSK 900 
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agc3ccttqc5c cxaoctgagg coccaogrtgg gaacfttggtc acccxxactc tgcataccct 
catcaaaaac actctcacta tqctc3ctato gacgaoctcc agctctcacn* tacaaijigca 
ggogactgga qgcaggactc vtoqgtccct gggaaagaqg gyactagqgg cxxx3gatcca 
ggattctgqg agqcttcagt taccxxttogc ogagctgaag aactgggtat gaggctggc3g 
oggggctgga ggtqcxx3ccc cctogtggga caacaaagag gacaccattt ttcxagagct 
gcagagagca cctqgtoggg aggaagaagt ctaactcacc agcctctoct crtatctttg 
taataaatgt taaagccaga aaaaaataaa aaaaaaaaaa aaaaaactcg agggggc3c0c 
agacxx:aatc tccx:tatagt aagncgccnn anan 



(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CJiARACTERISTICS : 

(A) LENGIH: 1378 base pairs 

(B) TYPE: nticleic acid 

(C) STRANDBCNESS: double 

(D) TOPOIXX5Y: linear 

(xi) SEQUENCE DE9CRIPTIGN: SEQ ID NO: 206: 
TCCCCAGGTG CACAGCXAGG GCXX:TCCTGT CTGCAGGAGA ATTCACAGCT GGTGTGGGAC 
TCAGCXXXTTA GNCCATTCAA AGCCTTAATG TTGTAATCAT ATCTTACGTG TTGAAGACCT 

gactqgagaa acaaaatgtg caataacgyg aattttatct tagagatctg tgcagcctat 

TTCTGTCACA AAAGTTATAT TGTCTAATAA GAGAAGTCTT AATGGCCTCT GTGAATAATG 
TAACTCCAGT TACACGGTGA CTTTTAATAG CATACAGTGA TTTGATGAAA QGACGTCAAA 
CAATGTQGOG ATGTa?rGGA AAGTTATCTT TCCCGCTCTT TGCTGTQGTC ATTGTGTCTT 

gcagaaagga tggccctgat gcagcagcag ogcx:agctgt ANATAAAAAA taattcacac 
TATCAGACTA GCAAGGCACT AGAACTQGAA aagaccacag aaaacaaaga atccaacxxt 
TTCATCTTAC AGGTGAACAA ACTGrrCATGA TGCACATGTA TGTGrmTGrr AAGCTC3TGAG 
CACCCTTAACA AAATGTAAAT TTGCCATTAT TAGGAAGTGC TGGTGGCAGT GAAGAAGCAC 
CCAGGCCACT TGACTCCX:aG TCTGCrrGCCC tgtctacacc agacaacaca ggagctqggt 
CAGATTCCCC TCAOCTQCTT AACAAAGTTC CTCGAACAGA AAGTGCnTAC AAAGCTGCCT 
TCTOGGATAC TGAAAGGTCG AOTITTCTGA ACTGCACTCA TTTTAITCCA GriTGAAAAAA 
AAAAAAAGCT AITCXZAAAGA TTTCAAGCTG TTCTGAGACA TCTTCTGATG GCTTTACTTC 
CTGAGAQGCA ATGTTTTTAC TTTATGCATA ATTCATTGTT GCCAAGGAAT AAAGTGAAGA 
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AACftOCACCT TTTAATOTAT AGGTCTCTCT GGAAGAGACC TAAATTAGAA AGAGAAAACT 960 

GTGACAATTT TCATATTCTC ATPCTTAAAA AACACTAATC TTAACTAACA AAAGTTCTTT 1020 

i TGAGAATAAG TTACACACAA TGGCX^CAGC AGTITGrrCTT TAATAGTATA GTOCCTATAC 1080 

TCATCTAATC GGTTACTCAC TACTQCCTTT AAAAAAAAAA ACCAGCATAT TTATTGAAAA 1140 

CATGAGACAG GATTATAGTG COTTAACOGA TATATTTTGT GAdTAAAAA ATACATTTAA 1200 

AACTGCTCTT GTGCTCTAGT ACCATGC7ITA CTTGCAAATGA TTATTTCTAT GTACAACTGA 1260 

TGCTTGrrrCT TATTTTAATA AATITATCAG AGTGAAAAAA AAAAAAAAAA AAAAAAAAAA 1320 

15 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAGAA NAAAANAA 1378 

20 (2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACIERISTICS : 

(A) li^lGTH: 1166 base pairs 

(B) TOPE: niicleic acid 
25 (C) STRANDEDNESS: double 

(D) TOPOIiOGy: lineeir 

(xi) SE^^JENCE DESCRIPTION: SBQ ID NO: 207: 

30 AANCCACTGC ANTTTAAACC CCCTCCCCTC CAAGAAAGTT CACAACCGGC CATGGATGAC 60 

CCTCATTTTA GATQGGCCNC AATATTEAAG ATGGACTC5RG C3MCCCCARAG ACTGACCCTT 120 

GAAAGQGGGA CTCAGAAGAA AGATCXTTTGA CATTGCCMAA CATGCTGGGC TTGTCCAACA 180 

35 

CAGTGATGCG GCTCATCGAG AARCGGGCTT TCCMAGGACA AGTACTTTAT GATAGGTQGG 240 

ATGCTGCTGA CCTGTGTGGT CATGrTTCCTC GrTGGTGCAGT ACCTGACATG AGCCAGCCAC 300 

40 GCTCAGTOGC TGAACAGCAT TCCCACAGCC TQCAAGTGTG TGTC?PgrGTG AAAGAGAGAG 360 

GQGGCCCAGA GGCOOCCTTT TGAAATGTTT GCCTGnCTGA ACTGTGAAGA CACTTGGGAG 420 

TGATTGTGGT CTAATTTCCA ACCTOCTCTG TITTCTGrTCA CATCTTQGAG QQGGAGCTAG 480 

45 

TQCCAMCACC ATGCQOGGTG CTTAGGAAAT GAAAGAAGTC CCGGGTCTGT CTCTCTCACT 540 

CTCGCrCTCA mOGQGGAGG GAAAGAATGG CTrTQGTGGC TTTGTTCACA CAGCTGATGC 600 

50 GTGSCCTQGG AAGGTGTCCA CAGTGAGCCC TGTGTGCAGG ACTGTCCACN ACGGTITCACA 660 

CCTTGTCACC ATCAGGCCTT TCTGGCTCCT GATAGGGTrGG AGCAAAAGTG GAAAGGAAAG 720 

GAAAGAGGCY TnTCTCACA GCCATTATAT TAAATAGTAG GTCGATTCAC ATCYTCGrTGC 780 

TCCTOGCCAC CCTCCCCTGT GCCTCAGTGA CATGTAGATG ACTGACTGCC AATACTTGTC 840 

ACCATTCCCT GGAAGCAGCT ACCTAGGGGA AACAAGATGT AGrTGCTATTG CCGATAACAA 900 

60 OTAAGATTTT CCACACTACA GCTGQGT(3TT TCTCTTTTCT AAAGTGAGGC CAGTCTTATT 960 
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TCCCGQGAGT GTrCAGTCTT GACOCTAGTC ACTGATTTTT TCTAGnTCTT AATAGAGTGG 1020 

TTQGGCTTTT AAGGTTCAGA GACTGTOQGC TTGGGCACCT GCXXXXWQGG STi TX lT i aA; 1080 

GGCCTTTQCC CCTTAGRAAA GTAGCTTTTA QGGGCAAAGA TTTGnTGATT TTCCCCA3TA 1140 

CAGTCTTCAG CTCNAGGGTT TTAAAA 1166 



(2) INFORMATION FOR SBQ ID NO: 208: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) 697 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECMESS : double 
<D) TOPOIjOOT: linear 

20 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 208: 
TACmCTAGG ATTATAAGGA ATTAACATTG AGATGACATT TCCATTTGAG AAQGAAAATA 60 
25 GTTGCTTTCA CTGCCTTTTA TTTGATTCCT GGAGAGAGCA GACTCGCACS AACATTCAAC 120 
CCCAGCGCTG ATATGACAGT AATCCTCAGA GGCAGAGCCC AGCACAAAAC AGCAATGCTA 180 
GAAAGTTACA ATTGGAAAGT TTCCTGCCAG CTTCGGGAAT GACACTGCAA AGCTCATCSCC 240 

30 

AGAAACTGCC AGKGTAATTC TCCTCATTAC TGCTCTACCC ACCCACTTTC AGCTCCCCAA 300 
ATTAACTAGT GCAGTTGACT AATTCTCTTT ACCTTTATCA TTTARGGTCA RGCATTGCAC 360 
35 AAAAACTCTC GACTTTGCCA TATAAGGGCT GTGGTTCTCT GTGGTCCCCT GGATAAGAGG 420 
CATCACCATT ATCTQGAAAC ATGCAGTAAA TGCAGATTOT TCATCTTCTC CCCAGACCTC 480 
CTGAGTTAGA AATTCACAAG TTCTCCAGGT GATCTCATAC ATGCTAAACT TTGAGAACCA 540 

40 

TTGAGTAAAG TTAATGCATT AAGAAGAGAT TAGATAGC3GA TGGTQGCX7EA TCTTCCTACA 600 
GTTTCCCTGT TAACAAGAAA GTCAGAGGTC AGrTTGATCAG ACATTAGATT ATTTATTGCT 660 
45 AAAACTAAAA AAAATTAAAA AAAACTGGAG GGGOGCC 697 



50 (2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 932 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 209: 
60 CGTGAGTCAC CTCTCTATAG TGGGCGTGGC CGAGGCOGGG GTGACCCTGC OGAAGCCTCC 60 
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GCTGCX:AGAA ACCATOTTCA AGGTAATEAA AAGGrrCCGTG GGGOCAGCCA GCXTOAGCTT 120 

GCPCACCTTC AAACJTCTATG CAGCWXAAA AAAGGACTCA CTTOOCAAAA ATTCCGTGAA 180 

5 

GGTTCATGAG CTTTCACTCT ACTCAGTTCC TGAGGGTCAA TCGAAGTATG TQGAGGAGQC 240 

AAQGAGOCAG CTTGAAGAAA GCATCTCACA GCTOCGACAC TATTGCGAGC CATACACAAC 300 

10 CTQGTCTCAG GAAAOTCACT COCAAACTAA GCCCAAGATG CAAAGTTTC3G TTCAATGGGG 360 

OTTAGACAGC TATGACTATC TCCAAAATGC ACCTCCTQGA TTTTTTCCGA GACTTQGrPGT 420 

TATTQGTTTT GCTGGCXnTA TTGGACTCCT TTTGQCTAGA GGTTCAAAAA TAAAGAAGCT 480 

15 

AGTGTATCCG CCTOGrTTCA TGQGATTAGC TQCCTCOCTC TATTATCCAC AACAAGCCAT 540 

CXjrcrrTTGCC CAGGTCAGTO GGGAGAGATT ATATGACTGG GGITTAOGAG GATATATAGT 600 

20 CATAGAAGAT TTGTGGAAGG AGAACTTTCA AAAGCCAGQA AATGTGAAGA ATTCACCTGG 660 

AACTAAGTAG AAAACTYCAT GYTCTGCCAT CTTAATCAGT TATRGGTAAA CAITGGAAAC 720 

TCCATAGAAT AAATCAGTAT TTCEACAGAA AAATGGCATA GAAGTCAGTA TTGAATGTAT 780 

25 

TAAATTGGCT TTCTTCTTCA GGAAAAACTA GACXIAGACCT CrGrrTATCTT CTGTGAAATC 840 

ATCCTACAAG CAAACTAACC TGGAATCCCT TCACCTAGAG ATAATGTACA AGCCTTAGAA 900 

30 CTCCTCATTC TCATGTTGCT ATITATGTAC CT 932 

35 (2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARAcrreRisrrics: 

(A) IiENGTH: 661 base pEiirs 

(B) TYPE: nucleic acid 
40 (C) STRANDEENESS: double 

(D) TOPOiLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 210: 

45 GTCATTCTTT AAATAAAAQC nTCCTGTTT AAAGCTTTTC AAAGGAGCAG ACCACCTTGA 60 

AGATTCOCCC TAGGGTTGAT ATGnCTCTAA TTCATTTTAT AAAAATTATT CTTGTCTTCA 120 

TTTTAAAGCT TTGQCTATAT AGTCAGAAAT GTCCTAAATA ACAAACTATT TTGTATTTAA 180 

50 

TTTAQQGAAG ACTAAAGGGA AGAAAAATGA AAACTCAGTC TTTATGTAAG CTCCAAGGAT 240 

ATTAGGGCTT AAAGGGCrTTT TCTAGTTTTA TGAGAATTTG TACTACTGAT TTTTATATAT 300 

55 TCCICTTTTT GAGATGAACA GATCTCTGGG GAAATTGTTG AGTTACAATG GCATTTCACT 360 

GTGATCCCTC TCAAGCTCAG ATCAGTTCTA TAACCCAATG ACAACCTGTC TCTTTGCTTTT 420 

ACTGTCCTGT GAAATGTCAG CTCAAGTTTC CCAGAAGTCG TGTGTTTATG ATGAGTCAGA 480 

60 
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GTOcrrrrcc tcggtgggac agtixxtogc cctcttaatt ttogtotatc tgcttccaag 



540 



TATCTAAACC TCCAGTCTGA TCTGTATATG CTATCCTAAC TGTTAAITGT ATTATTGATT 



600 



ATOTTGATTA TCTTOCTTCA AOCTPCATAC TTTTCAATTT GATAGAAATA AAGTrTTTTT 



660 



C 



661 



10 



15 



20 



25 



30 
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50 



(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SBQUHJCE CHARACTERISTICS: 

(A) LE2!97ni: 592 base pairs 
(B> TYPE: nucleic acid 
(C) OTRANDKCNESS: double 
(D> TOPOLOGaf: linear 

(xi) SEQUENCE DESCRIFITCN: SEQ ID NO: 211: 

GAAACTGACA TTGTTAAACA CACTAAAACA GAAGTACTTA OCTCTTGAAG ATTTAATATA 60 

TAATGGTTGA CATGATACAT GTACATGAAT GGAATGACCA GATGCTTATG GTCTACATTT 120 

TCCTTTATCC TGTTACTATr AOCTTCCTTA ATCnTCTTC CTTAACATGC TAAATTCCTC 180 

TTCAGTGTTT ATTTTCTAGT GACAGAATGC TAACATTTCT TACACCCTGG CAGAAGQGAG 240 

AGAAATGTGT TTTQGGGTQG GTAACTAAAT TTTTGAGrGA AATATCATAA GATGAGAATG 300 

GAAAGAGCX3A GACACAAAGA GTTATAACAA AAAAACAATG GTTrTTTTAG CCATTTGACT 360 

GGCTCTTTAA ATAGTCTACA AGACATTCAC GTTOAACATC ACTTTTAGTG AAATAAAATG 420 

TGCCATACTA GTATGTGCTT CAAAAGGGCA AATGTGCTTT AGTGCCCTAA GGCTAAATTT 480 

TGGTCATTTG ACATCAGAGA TGTTGTAAGT ATTGCACTTA ATACGCACCT ATTTCTCAAT 540 

AGTGOTATTT TTTTC3GCTAG CATTTNCTTT ACCACTAACC TTGTTQGATA GC 592 

(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 938 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCEUPTICN: SEQ ID NO: 212: 
TGGAGTQQCT TPCCAGCTGA ATGAATCCTA TCTTCTCGCGT GCAGGTGGTT QGTTTTCAAT 60 
GrrCTTSCTA ATTTTTTTCC TATTGGCTCT TGGGAGTTTN CriTGnTTGC TCCTGTGTTT 120 



60 



GCCCAGCTTT AATAAAACCA GGCGCAAACA AAAACCATAG CATTCTGAAA CAATAGGGGG 
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CCCACATK5G ACCCftGTATG TCACTTTAAT QGACTTCAAG AAAAAATCTG AATQGGAAAA 240 
TGACACTAGG AATCTAJTACT CCACACATTT TATOCCATAT AATGGTGnCT TITCTTAArr 300 
5 TTOTITCTTG TQQOQAAATG TQQCTTTCAA ATTAAAATQl CXnTTTCTTC TnCGAAACTT 360 
TTrormCGA CTKCTATAAT TAAGGGTTTG GAAAGATTCA TAAnKTGAG AGAGGTTTQC 420 
AACCflGGAGA TACAAAGAAG TCTCAGTAGT AATCTTGrrTC ATGrTGCTTTT ACAGCCAGCT 480 
ACATTTAAGR ATGTATTAGT TACAGAAATT ATATCTPCTGT CTATgrGTCT CTACTCAATA 540 
AACTACATQC CPCCACATAA TQCX3GTGCTG TCCATCTOGG CAAATACTQG CCAAC3TCCCT 600 
15 TTATGACAGG CACACAGAAA CCATAGCATG GTCTGGCTrTT CAGAAAATGC CTCTCATCTrr 660 

TCCTOC5AACC TTATTTTGCT AAATGTCTGrT TTTCTTOTGA TTICTTGTAC CTCACACCAC 720 

CATTGTGACC ATGGTGATGC GTCATTTGCA TGATATGTAC CT TOAUm 'A ATGTCAAATA 780 

CATTTTCATT GAAGAGTCTG ATGACTTGCT AGCGTTTTAT TTTTTCTGTA AGCTCAATGr 840 

GCTGAAACCA AACCAGGCTT TTAAAAACCT GrTGTAGAAGA AAACCAAAAA ATCCTGTCTO 900 

25 GGrrGTCCTTT CCCTGTCAAA CTCATTAAAA ATTCCTTT 938 

30 (2) INPOTMATICN FOR SEQ ID NO: 213: 

(i) SBQUQQCE CHARACTE3tLSncS : 

<A) LQJGTH: 1079 base pairs 
(B) TYPE: nucleic acid 
35 (C) STOANDEENESS: double 

(D) TOPOLOGY: linear 

<xi) SBQUQICE DESCRIPTICN: SEQ ID NO: 213: 

40 AGCCTGCCGG GAGAGrPGGTG GCATCTRARA GGCTGGTCGrr GGACTGTGGT TOGGQGAGGT 60 

GGGAGCTOTT TTAACCGrTGT GCCCXXTCTC CTGTGCCKGC GTTGGGCATCC CCCGGGGCAG 120 

TGGAW2QCGG GCGCTCCTCC AGCTTCOGAG TCCAGCCAGC CTGGGOGOGG GGCGCGCCCC 180 

45 

CGAGACACCC GAGGAGTCCG TTCCTCCCTG GTTACGTGGA CTGTGGAGCT GGTCTCTTGT 240 

C3GCTCAQCGC CGTQCGGAGG TTGAAGCGTA CXTTGCGGAGG TCGCACCAGG GGCGTGAGGA 300 

50 GGAGGAGGAA GGGCATGAGC CGAGCTTGAG GAATCCGTGY TCCAAACTCT ACACTCAAGG 360 

RTGCMCTGCG CAACTCTG(?r GGCGATGGGC TGGQGCA6AT GrrCdTGGAG TTCTACCAGA 420 

AGAAGAAGTC TCGCTQGCCA TTCTCAGACG AGTGCATCCC ATGQGAAGTC TOGACGGTrCA 480 

AGGOXSCAror GGTAGCCCTG GCCACGGAGC AQGAfiCGGCA GATCTGCTGG GAGAAGGrTGG 540 

GTPGAGAAACT CTGCX3AGAAG ATCATCAACA TCGTGGAQGT GATGAATOGG CATGAGTACT 600 

TGCCCAAGAT GOXACACAG TCGGAGGfTGG ATAACGTGTr TGACACAGGC 1TCCGGG^W:G 660 
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OXXIAGCCCTA CCTGTACAAG ATCTCCTTCX: AGATCACTGA TGCCCTOQGC ACCTCAGTrCA 720 

CCACCACCAT GOGCAGGCTC ATCAAAQACA CCCITOCCCT CTGAGCCTOG CTOGASCTCT 780 

GQGAGCTCCT TGATGGCPOC CAGACCTTGG CTTTTGOGAA TTQCACnTT QGGCCTTTGG 840 

GCTCTGGAAC CTGCTCTOGG TCATTOGrTGA GACTTOGAAG QGGCAGCCCC CXSCTCGCTIC 900 

TTCGrmTGT GGrrrOCCAGC CTCAGGTCAT CXnTTTAArc TTTGCTGACX3 GnTCAGlCCT 960 

GCCTCTACTG TCTCTCCATA GCCCTC3GTOG GCTTCCCXXnT CITPCTOCAC TGTACAGAAG 1020 

AC3CCACCACT GQGATGGGGA ATAAAGTITGA GAACATC3AGT TTOGQCTGAA AAAAAAAAA 1079 



20 
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(2) INFX?RMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHAEyVCTERISTICS ; 

(A) LENGTH: 3791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANTTEEWESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

TGAAGCAGGC GCTCTTC3GCT CGGCGCGGCC CGCTGCAATC CGTGGAGGAA CGCGCCGCCG 60 

AGCCACCATC ATGCCTQGGC ACTTACAGGA AGGCTTCGGC TGCGTGGTCA CCAACCGATT 120 

CGACCAGTTA TTTGACGACG AATOGGACCC CTTCGftGGTG CTGAAGGCAG CAGAGAACAA 180 

GAAAAAAGAA GCCGGCGGGG QOGGCGTTGG GGGCXXTOGG GCCAAGAGCG CATCAGQQCC 240 

GCGGCCCAGA CCAACTCCAA CGCGGCflGGC AAACAGCTGC GCAAGGAGTC CCAGAAASAC 300 

CGCAAGAACC CGCTGCCCCC CAGCGTTQGC GrGGTTCACA AGAAAGAGGA GACGCAGCCG 360 

CCCGTGGCGC TTTAAGAAAG AAQGAATAAG ACGAGTTGGA AGAAGAOCTG ATCAACAACT 420 

TCAGGGTGAA GGGAAAATAA TTGATAGAAG ACCAGAAAGG CGACCAOCTC GTGAACGAAG 480 

A1TCGAAAAG CCACTTGAAG AAAAGGGTGA AGQAGGOGAA TTTTCAGTTG ATAGACCGAT 540 

TATTGACCGA CCTATTCGAG GrrCGTQGTGG TCTTGGAAGA GGTCGAGGGG GCCGTGGAOG 600 

TGGAATGGGC CGAGGAGATG GATTTGATTC TCGTGGCAAA CGTGAATTTG ATAGGCATAG 660 

TGGAAGTGAT AGATCTTCTT nTCACATTA CAGTGGCCTG AAGCACGAGG ACAAACGTGG 720 

AGGTAGOGGA TCTCACAACT GGGGAACTGT CAAAGAOGAA TTAACTGACT TGGATCAATC 780 

AAATGTGACT GAGGAAACAC CTGAAGGTGA AGAACATCAT CCAGTGGCAG ACACTGAAAA 840 

TAAGGAGAAT GAAGTTGAAG AGGTAAAAGA GGAGGGTCCA AAAGAGATGA CTTTGGATGA 900 

GTGGAAGGCT ATTCAAAATA AGGACCGGGC AAAAGTAGAA TTTAATATCC GAAAACCAAA 960 
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TGAAGGTGCT GATQGGCAGT GGAAGAAGOG ATTTCTTCTT CATAAATCAA AGAGTGAAGA 1020 
GGCTCATGCT GAAGATTCGG TTATQGACCA TCATTTOXSG AAQCCAGCAA ATGATATAAC 1080 
GTCTCAGCTG GAGATCAATT TTGGAGACCT TOGCXXXXTA GGAOSTOGCG GCAGC3GGAGG 1140 
ACGAGGrrOGA CCTOGQCXTTG GTOGGOGCTC AAACCGTGGC AGCAGGACOG ACAAGTCAfiG 1200 

TQCTTCTGCT CXTTGATGTGG ATGACCCAGA GGCATTCCCA GCrPCTOQCTT AACTGGATQC 1260 

CATAAGACAA CCCTOGl'lCC TTTGTGAACC CTTCTOITCA AAGCnTTQC ATGCTTAAGG 1320 

ATTCCAAAOG ACTAAGAAAT TAAAAAAAAA AAGACTGTCA TTCATACCAT TCACACCTAA 1380 

AGACTGAATT TTATCTGTTT TAAAAATGAA CTTCTCCCGC TACACAGAAG TAACAAATAT 1440 

GGTAGTCAGT TTTGTATrrA GAAAaCTATT GGTAGCflGQG AICTTTTCAT AATTTTCAGA 1500 

GATTA3t3CAT TCTTCAT6AA TACTTTTGTA TTGCTGCTTG CAAATATGCA TTTCCAAACT 1560 

TGAAATATAG GnGTGAACAG TGTGTACCAG TTTAAAGCTT TCACTTCATT T GTOrm ' iT 1620 

AATTAAGGAT TTAGAAGTTC CCCCAATTAC AAACTCGITT TAAATATTGG ACATACTGGT 1680 

TTTAATACCT GCTTTGCATA TTCACACATG GTCAACTOGG ACA3CTTAAA CTTTGATITC 1740 

TCAAAITTTA TQCTOrGTQG AATACTAACT ATATGTATTT TAACTTAGTT TTAATATrrr 1800 

CAirrrTGGG gaaaaatctt TrrrcAcrrc tcatgatagc tgteatatat atatgctaaa i860 

TCTTTATATA CAGAAATATC AGTACTTGAA CAAATTCAAA QCACATTTQG TTTATTAACC 1920 

CTTGCTCCTT GCATGGCTCA TTAGGTTCAA ATTATAACTG ATTTACATTT TCAGCTATAT 1980 

TTACTTTTTA AA2T3CTTGAG TTTCCCATTT TAAAATCTAA ACTAGACATC TTAAHTOGTC 2040 

AAACTTGTTT AAACTACTTA TTGTTQGrPAG GCACATOGTG TCAAGTGAAG TAGmTATA 2100 

GGTATGGGTT TTTTCTOCCC CTTCACCAGG GrTGGGTOGAA TAAGTTGATT TGGCCAATGrr 2160 

GnAAXATTTA AACTGTTCTG TAAAATAAGT GTCTGGCCAT TTGGTATGAT TTCTGTGrTGT 2220 

GAAAGGTCCC AAAATCAAAA TGGTACATCC ATAATCAGCC ACCATTTAAC CCTTCCTTGT 2280 

TCTAAAACAA AAACCAAAGG GCXJCTGGriTC GTAGGGrTGAG GTGGGGGAGT ATTTTAATTT 2340 

TTGGAATITG GGAAGCAGAC AGCTTTACTT TGTAAGGTTG GAACAGCAGC ACTATACATG 2400 

AAATATAAAC CAAAAACXTTT TACTGTTTCT AAATTTCCTA GATTGCTATT ATTTGGTTGrT 2460 
AAGTTGAGTA TTCCACAGAA AGrTGGTAATT ATCTCTTCTC TCTTCCTCCA TTAGAAAATT . 2520 

AGGTAAATAA TQGATTCCTA TAATGGGAGC ATCACCACTT ATTAAAACAC ACATAGAATG 2580 

ATGAATTAAA AAAGTTTTCT AGGATTGTCT TTTATTCTGC CACATTTATT GATAAACACTT 2640 

GAAGGAATTT TTAAAAAATT TTTAAGAATT GTTTGTCAOG TCATTTTTAG AAATGrPPCTA 2700 

CCTGrrATATG GTAATGTOCA CTTnTAAAAA TATTQGACAT CTTCAATCTT AAACATTTCT 2760 
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ATTTAGCTGA TTQGTTCTCA CATATACrTC TAAAAGAAAC TrTTATGTTA TAAGAGTEAC 2820 

TTTTTQGATA AGATrTAITA ATCTCAGTTA OCTACTAnC TGACATTTTA GGAAGGAGGTr 2880 

AAnCTTTTT AATGATQGAT AAACTTGrTGC TGGTQTTTTG GATCTTATGA TOCTGAGCAT 2940 

GTTCTOCACT GGTQCTAATG TCTAATATAA TTTTATATTT ACACACATAC GTGCTACCCA 3000 

GAGATTAATT TAGTCCATAT GAACTATTGA CXXATTGTTC ATTGAGACAG CAACATACGC 3060 

ACTCCTAAAT CAGTOTCTTT AGACTTTTCA AGTATCTAAC TCATTTCCAA ACAICTAOCA 3120 

TGrrTTTATAA AOCTCTTGAT TPCCAGCAAC ATACTATAGA AAACftCCTGC TACTCAAAAC 3180 

AC7VACTTCTC AGTOICATCC ATTQCTGrrCG TGAGAGACAA CATAGCAATA TCTOCTATCT 3240 

TQCAflGCTTT CAAGATAGCC TGAACTTAAA AAGTTGGrTGC ATTAGrTTGTA TCTGATQGAT 3300 

ATAAATTTGC CTCCTAGTTC ACTTTCTTGrC AAGAGCTAAA ACTGTGAACC TAACTTTCTC 3360 

TTATTGGTGG GTAATAACTG AAAATAAAGA TTTAnTTCA TC5CTCACTTC TTAAAAGTCA 3420 

TAAAAACAAT CAAATAGOrT CA3CTTTATT GrPCATGTCTT IXLtrHjU K Tit: TGACCTGrTGT 3480 

GCACACOCCT GTOTGnTTAT AATTTTTAAA TTGAATTTTA TA3X3GGGTTT TTATTTGCTA 3540 

AAAACCAGGC TGTTQAATCA CATTTGGGAA GGGTACTTAT CTTAATGACT AATQACTTAA 3600 

TTQGGAAAGT TGAATTCTTG TAAAATACAA AATCCAAGGA CTTCTTGC3GA TTTAATCTAA 3660 

ITCTCACTTC IWTAGGCAGA TNCACTITTT TOGATAATGG AAAGTTAAGC ATACCGAATG 3720 

CEftCTTTTGG TTGACAAACG GGCCTAATAG TCCGQQQGGA AATCCCTAAC NGGTAAGGNT 3780 

CXXIAAGTATG G 3791 



40 (2) INFORMATION FOR SBQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) 1334 base pairs . 

(B) TYPE: nucleic acid 
45 (C) STRANEEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEX^JajCE DESCRIPTICN: SEQ ID NO: 215: 

50 CAGTGCTCGC TCCTGCTCGG GGCGCTGCGG CCCCGGGCGT CQCCATGACC AGTGAGCTGG 60 

ACATCTTCGT GGGGAACAGA CCCTTATCGA CGAQGACGTG TATCGCCTCT GGCTCGATGG 120 

TTACTCGGTG ACCGACGCGG TGOCCCTGCG GGTOCGCTCG GGAATCCTQG AGCAGACTGG 180 

55 

CGCCACGGCA GCGGTGCTNC AGAGCGACAC CATQGAOCAT TACCGCACCTr TCCACATQCT 240 

CGAGOGGCTG CTGCATQCGC CGCCCAAGCT ACTGCACCAG YTCATCTTCC AGATTCCGCC 300 

60 CTCCOGGCAG GCACTACTCA TCXSAGAGGTA CTATGCCTTT RATGAGGCCT TTGTTCQGGA 360 
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GGTOCTGGGC AAGAAGCTGT CXIAAAGGCAC CAAGAAAGAC CTOGATGACA TCAGCACCAA 420 

AACAGGCATC ACCCTCAAGA GCTQCOQQAG ACAGmTGAC AACTTTAAAC GGGTCTTCAA 480 

GGTGGrTAGAG GAAAOOTSGG GCTOCCTQGT GGACAATATT CAGCAACACT TCCTCCTC TC 540 

TGACOGCTTG GCCAGGGACT ATGCAGCCAT OTICTTCTTT QCTAACAAOC QCTTTGAGAC 600 

AGGGAAGAAA AAACTOCAGT ATCTGAGCTT CXXTTGACTTT GCXHTCTGCX; CTGAGCTCAT 660 

GATCCAAAAC TGGACCCTTG GAGCCGTCGA CTCACAGAIX; GATC3ACATGG ACATGGACTT 720 

AGACAAGGAA TTTCTCCAGG ACTTGAAGGA GCTCAAGGTG CTAGTOGCTG ACAAGGAOCT 780 

TCTGGACCTC CACAAGAGCC TQGTGTGCAC TGCTCTCCX3G GGAAAGCTGG GOGTCTTCTC 840 

TGAGATGGAA GCCAACTTCA AGAACCTGTC CCGGGQGCTG GTGAAOSTGG COGCCAAGCT 900 

GACCCACAAT AAAGATGTCA GAGACCTGTT TCTGGACCTC GTGGAGAAGT TTGTOGAACC 960 

CTGCOGCTCC GACCACTGGC CACTCAGCQA CGTGCGGTTC TTCCTGAATC AGTATTCAGC 1020 

GTCTGrrcCAC TCCCrCGATC GCTTCOGACA CCAGGCCTCT QGGACCGCTA CATGGGCACC 1080 

CTCCGCGGCT GCCTCCTQCG CCTGTATCAT GACTGAGGTG CCTCCCAAOG CTCCGCCCAC 1140 

GCTGACAATA AAGTTGCTCT GAGTTTGGAG ACTGGTCCTC GCTCCGGGGA GCAAGTTGGGG 1200 

GGCGTGCAGA TGTGCCTGTG TCTGrTCTCTG AGCACCTOGT GTCCGTGTAC AAGGATGGAT 1260 

GTOiTlCNSTG QCTCCTTGGG AACTGAGACA TATCTCAGGG AATGGTGTCT GTGCTCAGCC 1320 

CATCCACCAG AAGA 1334 
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(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGIW: 1511 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNKSS : double 

(D) TOPOLOGY: linear 

<xi) SBQUaJCE E3(ESCRIPTiaN: SEQ ID NO: 216: 

GTGGCGGQGA TOCTGCQAGG GGGTCTCCTG CCCCAGGCGG GCCGGCTGCC TACCCTCCAG 60 

ACTGTCOGCT ATGQCTCCAA GGCTGTTACC CGCCACCGTC GTGTGATGCA CTTTCAGCGG 120 

CAGAAGCTGA TGGCTGT6AC TGAATATATC CCCCCGAAAC CAGCCATCCA CCCATCATGC 180 

CTGCCATCTC CTCOCAGCCC CCCACAGGAG GAGATAGGCC TCATCAGGCT TCTCCGCCGG 240 

GAGATAGCAG CAGTITTCCA GGACAACCGA ATGATAGCCG TCTGCCAGAA TGTGGCTCTG 300 

AGTGCAGAGG ACAAGCTTCT TATGCGACAC CAGCTGOGGA AACACAAGAT CCTGATQAAG 360 
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RTCTTCOCXIA ACCAGGTCCr GAAGCCCTPC CTQGAGGATT CCAflCTACCA AAATCTOCTC 420 

COCCTTTTTC TGGQGCACAA CATGCTOCPG GrCAGTGAAG AGCCCAAQGT CAAGC3A£3ASrc 480 

CrrACGGATCT TAAGGACTGT GCCMTCCTG CCGCTGCTAG GrPGGCTOCAT TCATCACACC 540 

ATCCTCAGCA GGCAGGGCTT TATCAACTAC TCCAAGCTCC CCAGCCTOCC CCTOCTTOCAG 600 

C3QC3GflGCrrG TAGGEAGGCCT CACCTGCCTC ACAGCXTAGA CCCACTCCCT GCTCCAGCAC 660 

CAGCCCCTCC A£3CTGACCAC CCTCTTGGflC CAGTACATCA GAGAGCAACG CGAGAAGGAT 720 

TCTGTCATGT CQGCCAATGG GAAGCCAGAT CCTGACACTG TTCOGGACTC GTAGCCflGCC 780 

TGriTAQCCA GCXXTOCX3CA TAAATACACT CTOCXJITATT GGCTCTOCTC TCCTCAATOG 840 

GACATC3TGGA AfiAACTO3CX5 GTCGGGGAGT CSTOmCTCA CTTQGTTTTC ACEACTAATC 900 

ATATTGTCAG GTATAGGC5CC ACTTQGAGAT GCAGAGGATT CXATTTCAGA TCTCAGTCAC 960 

CQGCTTCGTC CTTA£?rTTTC CCAACTTGGG ACGTQATAGG AGCAAAGTCT CrCCATTCTC 1020 

CAGGTCCAAG GCAGAGATCC TGAAAAGATA GGGCTATTGT CXXXTOCCTC CTTOGTCACT 1080 

GCCTCTIGCT GCACGGGCTC CTGAGCXACC OCCTTCGGGC ACAACCTGCX: ACTOCCACfiG 1140 

TAGCTCAACC AAGCAGTICT GCTGAGAATG GCACCT0C3TG AGAQCXTTGCT GTCTCCCAGG 120O 

CTTrcrrGCTG ACJTGCTGrrAC ATGTATTAGT TCCTTTACTG CTGACCACAT TGTACCCATT 1260 

TCACftGflGAA GGAGCAGAGA AATTAAGTOG CTTGCTCAAG GTCATCCAGT TAGTAAG1X3G 1320 

CAGAACftGGG ACTTGAACCA AGCCCTCTGC TCTGAAGACC GCGTCCTCAA TTTCTTCACP 1380 

AGAGCTTCCT CATCAGGTTA CCCAGAAGTG GGTCXXMCC ACCATCCAGG TCTGCTTOGA 1440 

TGTTAGTTCT CCAOCCTCGA GCTGTACGCT C3TCAAAAGTT TGGGAGCACT GCTTTATAAT 1500 

AAAATGAAAT A ^^^^ 
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(2) INPORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTEEaSTICS: 

(A) LENffHi: 642 bcuse pairs 
<B) TYPE: nucleic acid 

(C) STRANDEEENESS : double 

(D) TOPOLOGY: linear 

(xi) SBQUE^K:E DESCRIPnON: SEQ ID NO: 217: 

AGGCCTTACT OTrCCTCXXZA CAAAGGAGTC QCAGCCACGC TAGCTCTGAC TTGCTACICT 60 

GACAAAGTTC ACGTAGCAGG TCTAGGCAAA GACTGGGCAA TTGAGCAGAG GAGACGGACC 120 

TGTQAGTCTG ACCRYGAGSC GCatCCCCTTC ACCTTGGCTG GGCTGGTCCT GGTCCTTAGG 180 

TTTTGTCAGG TTGrPCCTTGT TTOGATCXTT CAACTAGGTG ATAAGCACTC GAGGGGGATC 240 
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AOXXXXITTG GAOGTOTTTC TTTAACCTCA TCCATATAAT AGGGOOGrTGG GATGCSTTOrA 
GAGGTAAAGC AGGATGftTGG TGTITTAAGA OCAGAGCTTG QQACCAGC3GC TCCTACACCT 
AATTTTCTCT CCTGGTAGCT GAACAAAGGT CTAAATTAGC TTAACAAAAG AACAGGCTGC 
OGTCAGCXAG AGTTCTGAAG GCCATOCTTT CAGTTTCCCT TGTTGACAAT TGCTCTCCAG 
TTCCTATGAA AGCACAGAGC CTTAQGGQGC CTQGCXIACAG AACACAACCA TCTTAGQCCT 
GAGCTGTCAA CAGCAGGGGG TTGTGTGrrCT GTTCTGTITC TCTOCTTCCC GAACTTTCTC 
AATAAACCCT ATTTCTTATT TTATATTTAC Gm«3GTGCTG GG 

(2) INFORMATION PQR SBQ ID NO: 218: 

(i) SBQUl^SCE CHARACTEEIISTICS: 

(A) LENGTH: 1241 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUQICE DESCRIPnCN: SBQ ID NO: 218: 
<3GTCCCACTG TTCCATTTTA TGCTAATAGA TTCCATTCTA GGGCCCAGCC GTCTCTTGAC 
TGATQGTCTT CCCTTTAACC CTTGGCATGT ATAATAGAAT TTTC3GTCAAT GAAAGAACCX: 
AAATACGCCA GATAC3TCCCC CCAGGCCCTG ATATCCATAA AAGGCTTGGG AATQCATTAT 
GTAATTGTCC TTAGTCTTTT TGrrTGTTTTA GAAAAAAAAA ACAAGATGGG CTCAGATGGA 
TGCCTACGTA AAAATGGTTC CTAGCTGTGT ACTCATAACT TTTCTTTGAA TTGAGTAGTG 
AAAOGAAGGA OGAQGAAAGG AAATTAAATG TCCTTCTAGT ATTCTCTGGA CTCAAGTCTG 
ACATATGRGA TAATAACCTA TATTGAAATG CCAAGAATTG TATCTGAAAC AAGRGAACAG 
TOTGACACAT TTATCATGCC TPCATATTAC ATATTAACTG AAACCAATTA ATAAACATAT 
GAAATATCCA TTGCACAAGG CAAAGGCACC TAAACCTTTT CrmX-TlTri ' CTACATAGCA 
6AAATTGATT TTTTTTTTAT TnTTTAGGG GAACCTATAT AATTATGACC CAGTGATCTC 
TTTTQGTGAC TTAAGCTTAT GAATTCAGGT TACAATTGAG TTGATTCTAG ATGGnTACTA 
CCTTGAAAAG GATGTTQGTG CXTTTATGrTGA CACGAGCCAG AGCCTGCTGG GAATAAACAA 
AGCAGATTCA TQCCAACACC AACTCGTAGC TTTAGTQGCA GATGGGAGTG GTCACAGACT 
CXXy^VAATGT GGGGCTTTGG ATTTCCACAC CATCCCACGT GTOTOTCArc TTCCTCTTTC 
ACACTCTTGA TGATAATTTG AAAATGBTGA AATCACCTCT GAATTTGCCT ATAGCATGAG 
CACATTCTTA TGACAACATA ACAAATAGTP CATAATGTCA ATATTAGAAA CTGTTACAGC 
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CTGCAGTTAC CATAATTTTC CATGTTTGTG GAATTGATAT TGAAATAGCA QQGCTAAQGA 1020 

ATTACTQGCA ACTTTTAGCX: TOTOGGTAftT ACCTTAQGGT TATITAAATA TTTGTAArrr 1080 

5 TASTTAAATG TTCATGAATO TTTGAAAGGA ACAAAATTAT CAGQGATOGC TCTTTCXX:AT 1140 

GGGTCTTATT TTCACCCTCT TTTCTGTAAG AAAAAAGAAC AA3CTCTEAA TGTAmTTA 1200 

AAGTTTTTOG TATAGTTTCT AA3TCCAATT TTAATAAAAG T 1241 



15 



(2) INFTOMATION FOR SEQ ID NO: 219: 



(i) SBC30ENCE CHARACTEEIISTICS : 

(A) LENGTH: lOBO base pairs 

(B) TYPE: nucleic acid 

(C) STTOANnsCNESS: double 
20 (D) TOPOLOGY: linear 



25 



(xi) SBQUErK:E nESCRIPTION: SEQ ID NO: 219: 

TGTTrATGTG ACCTAAAACA TACACACATG CACACACACA TACATATCCA TTCATTCATT 60 

CATTCAAGTG GTGTTTCCAG TGTCTGTGTG TCACTGTTTA TGCAGTTTCC ATTTCCCAGT 120 

GAATTATGAG TQGAGGGCAA CTTTTCTAAC CAGATTGTCT TTTCAGAACA AAGACCKGGG 180 

30 RATTGAGGAA GAGTTTGGAA AGAQGGAGAG GCAAGGAAAG AGAGCTTTAA ATTCAAAGGT 240 

TAATTTCCTA AGAGGAACCT GGGCTGAATG ACTACflGTGT TATACCTTCX: AATCTTTC3CA 300 

GGTGGGCATG GAACACTC5CT TGTTATCACTC TGTOCACGGT ATAAATCCAT ATATCXZACAA 360 

35 

AAACACACAT CCATCCATCA ACATATACAT GGTTTQGGAT GAGCAGGTCA ATAGmTGA 420 

GAGGGAGTTT GTTCCTTTTT TTTTCTCATT ATACTCTTAA ATTGTTGTCA GTTATCAAAC 480 

40 AAACAAACAG AAAAATTGTT TQGGAAAAAC CTTGCATACG CXTTTTCTAT CMAffrGCTTT 540 

AAAATATAGA CTAAATACAC ACATCCTGCX: AGTTTTTTCT TACAGTGACA GTATCCTTAC 600 

CTGCCA3TTA ATATTAGCCT CGTATTTTTC TCACGTATAT TTACCTGTGA CTTGTATTTG 660 

45 

TTATTTAAAC AGGAAAAAAA ACATTCAAAA AAAGAAAAAT TAACTOEAGC GCTTCATTAT 720 

ACTATTATAT TATTATTATT ATTGTGACAT TTTQGAATAC TGTGAAGTTT TATCTCTTGC 780 

50 ATATACTTTA TACGGAAGTA TTACGCCTTA AAAATACGAA AATAAATTTT ACAAGGTITC 840 

TGTTTTGTGT GGAAGAGTAA TrGATGTTGC TAAGAATGAT GTTTGTrTTT TrGGGGrTTTT 900 

TGrrPGrrrrTT tttttaaatg. ttaccagcac tttttttgta agtttcactt tcxxsaggtat 960 

55 

TOTACAAC3TT CACACTGTTT GTGAAGTTTG AATATGAAGG AATAATTAAA AAAAAAAAAA 1020 

AAACCNOGGG GGGGGCXXX3G TCCCATTQGN CCCAAGGGQG CGGTTACGGG GTCAOGGCCG 1080 

60 
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{2) INFORMATICN FOR SBQ ID NO: 220: 

(i> SEQimtCE CHARACTERISTICS: 

(A) LQ9GTH: 1258 base pairs 

(B) TOPE: nucleic acid 

(C) STRANQEECMESS : double 

(D) TOPOLOGY: linear 



(xi) SEQDEMCB DESCRIPTION: SBQ ID NO: 220: 

TGAATTGAGG GCTTAAAGAT AAACATATGG GRTTGGAGTT OTGTC3TCCAT AGGGTTTCAC 60 

15 TGCCTATTTG ATrTGAGPTTT ATCCCTATTA ATTTTTTACA GTGAAATITT ATTAAAGTAT 120 

AATGTACATA TATTTTCAGT GGATTITGCT CTGAAGGTTC TCCAGTOffTC TGACTAOGAG 180 

ATAGTQCOGC TTCAGCTGTG GGiATAnGCA GGC3CAGGAGC GCTTCACCTC TATGACACGA 240 

20 

TTGTATTATC GGGATGOCTC TGCCTGTGTT ATTATGTTTG ADGTTACCAA TC3CCACTACC 300 

TTCAQCAACA GCCAGAGGTG GAAACAGGAC CTAGACAGCA AGCTCACACT ACCCAATQGA 360 

25 GAGCOGGrroC CCTGCCTGCT CTTQQCCAAC AAGTCTGATC TGTOCCCTTG QCXAGTCAGC 420 

COGGASCAGA TTGACOGGTT CAGTAAAGAG AACGGTTTCA CAGGTTOQAC AGAAACATCA 480 

GTCAAGGAGA ACAAAAATAT TAATGAGGCT ATGAGAGTCC TCATTGAAAA GATGATGAGA 540 

30 

AATTCCACAG AAGATATCAT GTCTTTGrTCC ACCCAAGGGG ACTACATCAA TCTACAAACC 600 

AAGTCCTCCA GCTGGTOCTG CTGCTAGTAG TGTrTGGYTT ATTTTCCATC CCAGTTCTQG 660 

35 GAGGTCTTTT AAGTCTCTTC CCTrTGGTTG CCCACCTGAC MATTTTATTA AGTACATTTO 720 

AATTGTCTCC TGACTACTGT CCAGTAAGGA GGCCCATTGT CACTTAGAAA AGACACCPQG 780 

AACCCAKGTG CATTTCTGCA TCTCCTGGAT TAGCCTTTSA CATGTTGCTG RCTCACATTA 840 

40 

GTGCCAGTTA GTCCCTTCGG TGTAAGATCT TCTCATCAGC CCTCAATTTG TGATCCGGAA 900 

TTTTGTGAGA AGGATKAGAA ATCAGCACCT GCGTTTTAGA GATCATAATT CTCACCTACT 960 

45 TCTGAGCTTA TTTTTCCATT TGATATTCAT TGATATCATG ACTTCCAATT GAGAGGAAAA 1020 

TGAGATCAAA TGTCATTTCC CAAATTTCTT GTAGGCOGTT GmTCAGATT CTTTCTGTCT 1080 

TGGAATGTAA ACATCTGATT CTGGAATGCA GAAGGAGGGG TCTGGGCATC TGTGGAmT 1140 

50 

TGGCTACTAG AAGTGTCCCA GAACTCACTG TATTTTTGAA ACTTCTAAOG TCATAATTAA 1200 

GrrTTCTCTTG TCTTGGGCAT CAAGANTAGT TCCAATTTTT TGGGCCGGGG CAGQGTQG 1258 

55 



60 



(2) INFORMATION FOR SEQ ID hSOi 221: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LogcTH: 1693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SBC^JENCE DESCRIPTION: SBQ ID NO: 221: 

CACAATATAT GAAATAGTAC CCTCTAAAAA AGAGAAAAAA AAAATCAGGC GGrTCAAACTT 60 

AGAGCAACAT TGTCITArrA AAGCATAGTr TATTTCACTA GAAAAAATTT AATATCAAGG 120 

ACTATTACAT ACTTCAITAC TAGGAAGTTC TTTTEAAAAT GACACTTAAA ACAATCACTC 180 

AAAACTTGAT CCACATCACA CCCTOTTTAT TTTOCTEAAA CATCTTTGGAA GCCTAAGCIT 240 

CTGAGAATCA TGTGGCAAGT GTGATGQGCA GTAAAATACC AGAGAAGATG TTTACTAGCA 300 

ATTAAAGGCT GTTTGCACCT TTAAGGACCA GCTQGQCTCT AGTCATTCCT QGGGO&AGAG 360 

TQGCATTATC3 TTTTTACAAA ATAATGACAT ATGTCACATG TTTOCATCTT TCTTTGCTIt; 420 

TTGAAirrTr GAACAGCCAG TTGACCAATC ATAGAAAGTA TrACTOTCTT rcATATOC3rr 480 

TlTGCTrCAC TGGCTTAAGA GGTTTCTCAG AATATCTATG GCCACAGCAG CATACCACTT 540 

tccatcctaa TAGGAATGAA ATTAATTTTG TATCTACTGA TAACAGAATC TCGOTCACAT 600 

GAAAAAAAAT CATTTTATCC GTCTTTTAAG TATATGTTTA AAATAATAAT TTATOICTCT 660 

GCATATTQCA GAACAGCTCT GAGAGCAACA GTTTCCCA3T AACTCTTTCT GACCAATACT 720 

GCTGGCACCG TTGCTTCCTC TTTQGGAAGA GQAAAGGGTG TGTCAACATG GCTAACAATC 780 

TTCAAATACC CAAATTOPGA TAGCATAAAT AAACTATITA TTTTATGCCT CAGTATATTA 840 

TTATTTAATT TTTTAGCSrAA TQCCTATCTC TPGGTCTATT AAGGAAAGAA GCAATCACTA 900 

GAGAATTCAG GATAGTTTTG TTTAAATTCT TtXAGATTAC ATGTTTTTAC AGTOGCCK3C 960 

t 

TATTGAGGAA AGGTAITCTT CYATACAACT TGTITTAACC TTTGAGAACA TTGACAGAAA 1020 

•PTATGCAATG GTTTGTTGAG AIACQGACTT GATQGTOCTG TTTAATCAGT TTCCTTCCAA 1080 

AGTGQCCTAC TCAAGAGGCC CTAAGACTCG TAGAAATTAA AAGGATTTCA AAAACTTTCT 1140 

ATTCXrrTTCT TAAACCTACC AGCAAACTAG GATTGTGATA GCAATGAA3G GTATCATCAA 1200 

GAAAGTITGA CCAAATTTGT TTTTTTGnTG TTGITGTTGT TTTGAATTTC AAATCATTCT 1260 

TATTOCCTTT AAGAATGTTT ATGTATGAGT GT6AAGATGC TAGCGAACCT ATCCTCAGAT 1320 

ATTCATCGTA AGTCTCCCTT CACXnCTTAC AGAGTTTCAG ATCGGTCACT GATAGTAaCT 1380 

ATTTCTTTAG TAAGAATGTG 1TAAAATTAC AATGATCTTT TAAAAAGAIG ATGCAGTTCT 1440 

GTATTTATTG TQCTGTGrPCT GGTOCTAAGT GGAGCCAATT AAACAAGm CATATOTATT 1500 

TTrCCAGTGT TGAATCTCAC ACACTGTACT TTGAAAATTT CCTKXATCX: TCAATAACGA 1560 

ATAGAAGAGG CXATATATAT TGCCTCCTTA TCCTTGAGAT TTCACTACCT TTATOTTAAA 1620 
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AGTPGrPGTftT AATTCTTAAA ATCTGTCAAA GAAIAAAAAG TOGATTTAAA TTAAAAAAAA 1680 
AAAAAAAAAA AAA X593 

(2) INPC»MATICN FOR SEQ ID NO: 222: 



(i) SBQU^ICE CHARACTOasnCS: 

(A) l^XTTH: 1196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 
15 (D) TOPOLOGY: linear 



(xi) SEQUQJCE DESCRIPTION: SEQ ID NO: 222: 

AOQCGTQGGT CGACCCACGC GTCX:OC3GACN T0C3CGTQGTG GGGAAGOGAG AAGGATTTGT 60. 

AAACOCXX3GA GCX5AGGTTCT GCTTACCCGA GOCCGCTGCT OK3CX3GflGAC CCCCGGGTGA 120 

AQCCACCGTC ATCATGTCTG AOCAGGAGGC AAAACCTTCA ACTGAQGACT TGGGGGATAA 180 

25 GAAGGAAGGT GAATATATTA AACTCAAAGT CSOTOGACAG GATAGCAGTG AGATTCACTT 240 

CAAAGTGAAA ATGACAACAC ATCTCAAGAA ACTCAAAGAA TCATACTGTC AAAGACAGGG 300 

TGTTOCAATG AATTCACTCA GGTTTCTCTT TGAGGGTCAG AGAATTGCTG ATAATCATAC 360 

30 

TCCAAAAGAA CTQGGAATGG AGGAAGAAGA TGTGATTGAA GTTTATCAGG AACAAAOGGG 420 

GGGTCATTCA ACAGrTTTAGA TATTCTmT ATTTTTTTTC TTTTCCCTCA ATCCTTTTTT 480 

35 ATTTTCAAAA ATAGTTCTTT TGTAATGTOG TGTTCAAAAC GGAATTGAAA ACTOGCAOX 540 

CATCTCTTTG AAACATCPGG TAATTTGAAT TCTAGTGCTC ATTATTCATT ATTGTTTGTT 600 

TTCATTGTGC TGATTTTTGG TGATCAAGCC TCAGTCOCCT TCATATEAOC CTCTCCrTTT 660 

40 

TAAAAATTAC GTOTQCACAG AGAGGTCACC TTTTTCAtSGA CATTOCATTT TCAGGCTTCT 720 

GGTGATAAAT AAGATCGACC AATGCAAGTG TTCATAATGA CTTTCCAATT GGCOCTGATG 780 

45 TPCTAGCATG TGATTACTTC ACTCCTQGAC TGTGACTTTC AGTQGGAGAT GGAAGTITTT 840 

CAGAGAACTG AACICTGGAA AAATGACCTT TCCTTAACTT GAAGCTACIT TTAAAATTTG 900 

AQQGTCTGGA CCAAAAGAAG AGGAATATCA GGTTGAAGTC AAGATGACAG ATAAGCTGAG 960 

AGTAATOACT AACTCCAAAG ATGGCTTCAC T6AAGAAAAG GCATTTTAAG AmTTTAAA 1020 

AATCTTGrrCA GAAGATCCCA GAAAAGTTCT AATTTTCATT AGCAATTAAT AAAGCTATAC 1080 

55 ATGCAGAAAT GAATACAACA GAACACTGCT CTTTTTGATT TTATTTGrCAC TTTTTGGCCT 1140 

GGGATATGGG TTTTAAATGG ACATTGTCTG TACCAGCTTC ATTAAAATAA ACAATA 1196 
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(2) INPORMftHCN FCR SEQ ID TO: 223: 
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(i) SEQUENCE CHARACTEEIISTICS : 

(A) ISJGTH: 1791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSXIBSS: dcHible 

(D) TOPQLOGIT: linear 

(xi) SBQUQICE DESCRIPTICN: SBQ ID NO: 223: 

TCftGGGAGGT GGCAGGAAAG OCTTOGAACA GCTOCXXX3AG TGACGQAfiC3G GCGGCCOCOC 60 

CCGGnroCGC TGGAGGTPCGA AGCTTCCAGG TAG0GGCXXX3 CAGAGCCTGA CCCflGGCTTCT 120 

GGACATOCTG AGCCCAAGTC CXXXACACTC AGTQCAGTCA TGAGTCOQGA AGTGAAGGTG 180 

ACaCGCXAGA AOCAQGfiGCA ATTTCTOCTC CTAGCCAACTr CGGCCAAGGG GQCAO0QCT6 240 

GCCACACrCA TOCATCAGGT QCTQGAGCXX: CCTGGTGTCT ACGronTGG AGAACTGCTG 300 

GftCATGCCCA ATGTTAGAGA GCTQQCTGAG AGTGACTTTG CXTTCTACCTT OCXSQCTQCTC 360 

ACAGTOTTTG CTTATGGGAC ATACGCTGAC TACTTAGCTG AAQCXX:GGAA TCTTCCTCCA 420 

CTAACAGAGG CTCAGAAGAA TAAGCTTCGA CaCCTCTCAG TTGTCACCCT GQCTGCTAAA 480 

<?EAAflGTCTA TCCCATATGC AGrPGTTGCTG GAGGTCTTGC CCTGCGTAAT GTOCXSGCftGC 540 

TQGAAGACCT TGTGATTGftG GCTOTGrEATG CTGAOGrTGCT TCCSTQGCTCX: CTC3GACCAGC 600 

C3CAAOCAGCG GCTOGAGGTT GACTACAQCA TCX3QQCGGGA CATCCAGCGC CAQGACCTCA 660 

GTOCCATTGC CCGAACCCTG CflGGAATOGT GTGTOGGCTG TRAGGTCGTG CTGTCAQGCA 720 

TTGAGGfiGCA GGrTGAGCOGT GOCAACCAAC ACAAC3GAGCA QCAGCTX3GGC CTGAAGCAQC 780 

AGATTGAGAG T6AGGTTGCC AACCTTAAAA AAACCATTAA AGTTAOQACG GCAGCflQCAG 840 

COGCAGCCAC ATCTCAQGAC CCTGAQCAAC AOCTGACTGA GCTGAQGGAA CCAGCTOCTG 900 

GCACXaACCA QCXXXasCCA GCAAGAAAQC CTCAAAQGGC AAGGGGCTCC GftGGGAGOGC 960 

CAflGATTPOG TCCAAGTCGA ATTGAAM31A CTOICGTI'lt: CTCCCTGGQG ATGTGGGGTC 1020 

CCAQCTGCCT GCCTGCCTCT TAGGAGTCCT CAGAGAGCCT TCTGTGCCCC TGGCCAGCTG 1080 

ATAATCCTAG GTTCATGACC CnTCACCTCC CCTAACOCX3V AACATAGATC ACACCrrCPC 1140 

TAGGGAGGAG KCAAATGTAG GTCATGnTT TGTTGGTACT TTCTGTTTTT TGrTGACTTCA 1200 

TGTGTPCCAT TGCTCCCCQC TGCCATGCTC TCTCCXmCT TTCCTTAAGA QC7PCAGCATC 1260 

TGraXTCTT CATTACATGT CATTGAGTAG aPOGGTEAGCX: CTGATGGGGG TCGCTCTGTC 1320 

TQGAGCATAA CCX^CAGGCG TTTTTTCTGC CAOCCCATCC CTGCATOCCT GATCCCCAGT 1380 

TCCTATACCC TACCXTTQAC CTATTQAGCA GCCTCTGAAG AGOCATAGGG CCCXXACCTT 1440 

TACTCACACC CTGAGAATTC TGGGAGCCAG TCTGCCATGC CAGGACTCAC TGOACATGIT 1500 



W098«!M48 



451 



PCT/DS98A)4493 



CftTCCTAGAA TCCTGTCACA CTACAGTCAT TTCTnTCCT CTCTCTQGCC CTTGGGrrCCT 1560 

GGGAATOCTG CTGCTTTCAAC CCCAGAQOCT AAfiAATCSGCA GCCCTTTCTT AACATOITCA 1620 

GAGATGATTC TTTCTTGCXX: CTGGCCATCT CGOGAAGCIT GATOGCAATC CTOGAAGGCJT 1680 

TTAATCTCCT TTTGrTGAGTr TGGTQGGGAA GGGAAGGC3TA TATAGATTGT ATTAAAAAAA 1740 

AAAAQC3TATA TATGCATATA TCTATATATA ATATGAOSCA GAAATAAA3C T 1791 



15 (2) INFORMATION FOR SEQ ID NO: 224: 

(i) SBQqE^^CE CHARACTEIUSTICS: 

(A) LENffra: 2517 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDECNESS: double 

(D) TOPOXjOGV: linear 

(xi) SBQfUENCE DESCRIPTION: SBQ ID NO: 224: 

25 ACACTAGTQG ATCCAAAGAA TTCGGCACAG CGGCACAGCA TPGTTGAGCT mViXJltrm 60 

TGTGQQGCCC TCAAGCGAGC TCGACTGGTC CATCCTQGGG TAGCX5ASGTC GTCTrTCTOA 120 

AAAAGGACGA TGCCATCACC GCATAYAAGA AGTACAACAA CCXXTTGrCTG GACQGGCflGC 180 

CXSVTGAAGTG CAAOCTTCAC ATGAATGQGA ATGTTATCAC CTCAGAOCAG CCCATCCTCC 240 

TGCGGCTGAG TGACAGCXXIA TCAATGAAAA AGGAGAGCGA GCTQCCTCGC AGGGTCAACT 300 

CTGCCTCCTC CICCAACCCC CCTQCXrGAAG TGGACCXZTGA CACCATCCTG AAC3GCACTCT 360 

TCAAGTGCTC AGGGGCCTCT KTCSACCACGC AGCCCACAGA WTTTCAAAATC AAGCTTTGAG 420 

CAGGGGAGTR AGGCAGCCAG AftGTGGGGGC AGACX3AQGGT GGCrrcTGTTT CCCCAAQGCA 480 

AAGCTTATGA CXMTGGGCC ATCGC3ACTQG AGACCCCTGA TTGTGGGAAG GGTTCXrAGG 540 

GATAAAGAGC TKXTCACTG GATQQGACCC GCXTTTTCrGT GTTGTGTTCT GCCCTGTGCT 60.0 

CTTCTCTCTA CX3TTAACGTT TCXTTOTAGTA TGTTTCTTCA TCTCATCX3CX: AAGGTAGGCT 660 

TGnOTITTTM AGTGTGTGCC TCCOCGAGCC TCAGCCCXZAA GCTGATITCT TATCTQGAAA 720 

TGGTACACTG AATTCTCIOG GTGGCTTTCT TGTGGCXXXIA TC3C3GATGCAG CX3TOGGGGCT 780 - 

GTCTGAAGGA CCCTGCTTTT TCCAGQGGCC GAGGGGCTGC CPTrCCTTTG TGrCTTATTAA 840 

GCTTTTCAAA CAATGGAGGG GATGGAGAGC CCTGGTGTCC TGADQGGAQC CAGGTCGGCC 900 

TGAGAGCTGT GCXXCTCCTC TGrTCTTGTCA GrrGQAGGTGC CTC5GGTQQQG AGCAGGTCTC 960 

AGGCCTCTTG TCCTCTCCCC AGrTGGCTCCA GC3CXTCACTA OTGGCAAGGG CAGGATCAGG 1020 

CTGCACCGCT GGGAAGAGTC TATCTAAGCT CTTGGCTTGG AGTCCXX?rGT OGTCTCCRCC 1080 

60 
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CAGAGCSAAGT TCTCCAGAGT TCAOCTTTCC CTTTTCCTTG AGTTGTGCTG AATQCCCCAC 1140 

CXX:AGCTCTC TTTCCCTTCT GGGrrGTCriT GCTQGGAGGG GGCTGTGnTC TCAGCOCTCC 1200 

CGGTTCTCAC CTCGCCTQGC ACTTAACCAC ACCCTGCTTTT TGTOTAaXG CCAGCTCPCT 1260 

TCTQGTTGGG CCTTTGAAAG GCTCAGOCTC CCATTGTOCA GTGCTTGGGT TTGGAGCTTA 1320 

TTTGAATOGA AGAGGTCAGT TTGITCCTGG CTCTCGAOTT CTGGCCTCAG TTGTCTACAG 1380 

GACAGrTGGTC AGGGATXXCT QGAGGCATAT ATCCAGCTQC CACCAAGt3GG aVCTGTTTGT 1440 

TCCCACTTAT GTCACTTGACC CCATCCATCC ATGACCAGAG GATTATTTTC CPGOCnGGC 1500 

AGAQGAGGAG GftGTCAAGGG AGCAGQGCAG CTCTACCAGG CAAGGTOTTT CCXX:AGCATA 1560 

GGCGCAGACA GTTGQGACGA AACTTCAGAG CCCAGGCAGT OCCTGAATGA CCAGGCCAGT 1620 

GTTGTCACTG AGTOGTCCCC TQCTGGTTGG GAGTCAAGAG AATCCAGGCT GGCAGAGCTC 1680 

GAGCCAGTTG GGGAGCACGG TTCTQGGAGC TCTGCAAAAT CAGTAGCAAG TGCTQGAAAA 1740 

GGCACATGCX: GAAGATACTC AAGAGCTCCC AAGAITTGCT TGAGGCTAGC CCAGTCAAAA 1800 

AAACCAGAGA CTCATGTTTC CAGQGGTCAG TCTGTCAGGC AGGAAGGACC CAGGATTTGA 1860 

ACCCAGCTTC AGTCTOCAGG CTCT6AGQCT GCCCAGGACG GGAAAGTCCA AGGAAQGGGC 1920 

CTGGTGGrrGC TCCACTTGCA CTTCTTTAAA GAATGCTGCT TTTTATTCTC CTAACCXnTT 1980 

CAAGTGGGTO CAGACTTCTC GITAGCAGCT GGAAGACATT CCTCCCACAC TTTTCCCTTC 2040 

CTGGCCCAAG AGAGCAITCCA GAAGGCAGTA GGACCTQGTT TTTCAGCTAC TGQGAGCCGG 2100 

GGGCTCACTG CTTGCACTGT GCTTAGGGTA GGGATGGTAA ATATCCTCCC TGCATQGCTT 2160 

TATCCrcCCT CTCATCCCAA AGCAGGTATC TTCTGGTTGT CACACSAGrTTT CATTGAGTCC 2220 

AGCTGCAGCC ACGTOGCCAT CTQGAGCTGG TGCTATAGGT GACCATCTOG TACATTGAGG 2280 

GGACCTGTTT GCCTCCTCCA CTCTATAAGC AGTCATCTTG GQAGACCXX3G AGGAGAAGGT 2340 

GGTQGGCTAG TCCTGTCTCC TCCTCCACTT CXXATGCCTC TATGTTACCC ATCTGTGrPCT 2400 

CCTGTCCAGA AGGAGAGGAA GGGGCATTAA GAGATGAAGG GTGATTATGrT ATTACTTATC 2460 

CATTTCTGAA TAAACATTTG TTAITCCTAA AAAAAAAAAA AAAAAACTCG AGGGGGG 2517 
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(2) INFORMATION FOR SBQ ID NO: 225: 

Ci> SEQUENCE CHARACTERISTICS: 

(A) LaiGTO: 2424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEftiESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 
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TTCTANCTAA TOGAGGATTG ATPCTAATGA CAGAGTCTTT CAACACTTTG CACATCATOT 



60 



A3X:A0GAAGC TACAGCTTGC CATGrrGACTG GftGATTTAGT AGAACTTCTG TCAATATTTC 120 



TTTCGGTTTT GAAGTCTACA CGCXXTTTATC TTCAGAGAAA AGATGTGAAA CAAGCATTAA 



180 



TCCAC3TQC5CA GGAGOGAATT GAA1TPQCCC ATAAACTGTT AACTCTTCTT AATTCCTATA 240 



300 



10 GTCCTCCAGA ACTTAGAAAT GCTTGTATAG ATGTCCTCAA GGAACTTCTA CTTTTCASTC 

CXXIATGATTT TYTTCATACT CTOGTTOCCT TTCEACAACA CAACCATTGT ACTTACCATC 360 

ACAGTAATAT ACCAATGrTCT CTPGGAOCTT ATTTCCCTTG TCRAGAAAAT ATCAAGCTAA 420 

TAGGAGGGAA AAC5CAATATT CX3GC3CTCCGC GCCCTGAACT CAATATCTGC CTCTTGCTCA 480 

CAATGC3TGGA AACCAGTAAG GQCAAAGATG ACGTTTATGA TOGTATQCTG CTAGACTACT 540 

20 TCrTTTCTTA TCATCAC?ITC ATCCATCTAT TATQCXX3AGT TGCAATCAAC TGTCAAAAAT 600 

TTACTGAAAC ATTAGTTAAG CTGAGTGrTCC TAGTTGCCTA TGAAGGTTTG CTACTTCATC 660 

TTC3CACTGTT CCCCAAACTT TQQACTGAGC TATGCCAGAC TCAGOXrTQCT ATGTCAAAAA 720 

ACTGCATCAA GCTTTTGTGT GAAGATCCTG TTTTOGCAGA ATAmTTAAA TGTATCCTAA 780 

TQGATGAAAG AACTTTTTTA AACAACAACA TTGTCTACAC GTTCATGACA CATTTCCTTC 840 

30 TAAAGGTTCA AAGTCAAGTG TTTTCTGAAG CAAACTGTGC CAATTTGATC AGCACTCTTA 900 

TTACAAACTT GATAAGCCAG TATCAGAACC TACAGTCTGA TTTCTCCAAC CGAGTTGAAA 960 

TTTCCAAAGC AAGrPQCTTCT TTAAATQGGG ACCTGAGQGC ACTCGCTTTG CTCCICTCAG 1020 

TWZACACTCC CAAACAGTTA AACCCAGCTC TAATTCCAAC TCTGCAAGAG CTTTTAAGCA 1080 

AATGCAGGAC TTGTCTOCAA CAGAGAAACT CACTCCAAGA GCAAGAAGCX: AAAGAAAGAA 1140 

40 AAACTAAAGA TGATGAAGGA GCAACTCCCA TTAAAAGGCG GCX3ICTTAGC AGTGATGAGG 1200 

AGCACACTGT AGACAGCTGC ATCAGTGACA TGAAAACAGA AACCAGGGAG GTCCTGACCX: 1260 

CAACGAGCAC TTCTGACAAT GAGACCAGAG ACTCCTCAAT TAOTGATCCA GGAACTGAGC 1320 

45 

AAGATCTTCC TTCCCCTSAA AATAGTTCTG TTAAAGAATA CXX5AATGGAA GTTOCATCTT 1380 

CX5TTTTCA6A AGACATGTCA AATATCAGGT CACAGCATGC AGAAGAACAG TCCAACAATC 1440 

50 GTAGATATGA CGArTGTAAA GAATTTAAAG ACCTCCACTG TTCCAAGGAT TCTACCCTAG 1500 

CCGAGGAAGA ATCIGAGrTTC CCTTCTACTT CTATCTCTGC AGTTCTGTCT GAdTAGCTG 1560 

ACTTGAGAAG CTGTGATQGC CAAGCTTTGC OCTOCCAGGA CXXTTGAGGnT GCTTTATCTC 1620 

TCAGTICTOG CCATTCCAGA GGACTCTTTA GTCATA1GCA GCAACATGAC ATTTTAGATA 1680 

CCCTGTGrTAG GACCATTGAA TCTACAATCC ATGTCGTCAC AAGGGATATC TOGCAAAGGA 1740 

60 AACCAAGCTG CTTCTTGACA TTAGGTGrPAG CATGTCTACT TITAAGTCCC TCACCOXAA 1800 
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CCCCCATGCT GTTIGTATAA GTTTTGCTTA TTTGTTTrTG TGCTTCAGTT TOTCCAGrTGC 1860 

TCTCTGCTTG AATGGCAAGA TAGATTTATA GGCTTAATTC TTGGTCAGGC AGAACTCCAG 1920 

5 

ATGAAAAAAA CTTGCATCTT CAGTATACTT CCTAAAGGQC AATCAGMAA TQCSMATGnT 1980 

TTAICTAATT AAGAGTTCAC TTTAGTOCSCT TTCATTTAAT ATGQCTGrPCT QQGAAGAACA 2040 

10 GQGTTQCCTA GCCCTGTACA ATGTAATTTA AACTTACAGC ATTTTTACTG TGTATCATAT 2100 

GGrPGTCCTCT GTQCCAGTTT TOTACCTTAT AGAGQCAGAT TOCCTCOC»T OGCTGrTGGTT 2160 

CTTATrArCA AAATTAAGTT TACTTGrTATA OGGAACAACC ACAAGAAATT TGATTCTGTA 2220 

15 

AAGAATCCTC TTTAGCTGTG GCXnGGCAGT ATATAAATQG TQCTTTAITr AACAGAATAC 2280 

CTGTGGAGGA AATAAAGCAC ACTTCATGTA AAAATAAOTG TTTTATTTTT ATTC3ACATGA 2340 

20 CTGATTGATT GCTATTCTGT GCACTNAATT AAACTGATTG TGATQACTTA AAAAAAAAAA 2400 

AAAAAAAAAA AAAAAAAAAA AAAA 2424 



25 



(2) INFORMATION FOR SEQ ID NO: 226: 



(i) SBQfUEJKIE CHARACIERISTICS : 
30 (A) LENGTH: 1080 base pairs 

(B) TTPE: nucleic acid 

(C) STRANDBDMBSS: double 

(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

ATATAGGACG GATAATCTGT TTACATTCTG TTCTTCTCGA TGCACTCACA AGCGGGTAAC 60 

TAGGTGACAA GAAAACAAAG ATCTTATTCA AAAGAGGTCT TACAQCAACC CAAOSTCTCA 120 

40 

TCrrCCCATA GTAAAGATGA CGGCGCCTTG AGGTrAAGCTA CAGGCAACAC CACTTCOGOG 180 

TTTCTCTTGC GCCCTGGTCC AAGATGGCGG ATGAAGCCAC GCGACGrTGTT GTGTCTGAGA 240 

45 TCCCX3GTGCT GAAGACTAAC GCCGGACXXX: GAGATCGTGA GTTGTGGGTG CAGCX5ACTGA 300 

AGGAGGAATA TCAGTCCCTT ATCXX3GTATG TQGAGAACAA CAAGAATGCT GACAACGATT 360 

GGTTCCXaCT GGAGTCCAAC AAGGAAGGAA CTCGGTGGTT TGGAAAATGC TGGTATATCC 420 

50 

ATGACCTCCT GAAATATGAG TTrGACATCG AGrTTTGACAT TCCTATCACA TATCXTTACTA 480 

CTGCXXXAGA AATTGCAGTT CCTGAGCTGG ATGGAAAGAC AGCAAAGATG TACAGGGGTC 540 

55 GCAAAATATG CCTGACGGAT CATTTCAAAC CTTTGTGGGC CAGQAATCTO CCCAAATTTG 600 

GACTAGCTCA TCTCATGGCT CTGGGGCTGG GTCCATGGCT QGCAGTGGAA ATCCCTGATC 660 

TGATTCAGAA GGGCGTCATC CAACACAAAG AGAAATGCAA CCAATGAAGA ATCAAGCX:AC 720 
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TGAGC3CAGGG CAGAGGGACC TTTGATAGGC TAOGATACTA TTTTCCTCTG CATCACACTT 780 

AACTCATCTA ACTGCTTCCC CGGACACCCT CCAOCTCTAG TTGrTACTAA CTTAGCTGCAG 840 

TAGGCATTGC TGGOGAAGAA ACAAACACAC ACCAAACAGT ACTGCTACTT AGTTTCTAAG 900 

GCTC5CACAGG GAAGGGAAAG ACTGGGCTTT GGACAATCTA GAGGTAATTT ATATCCXXXX: 960 

CCAGGTOGAG CAACATGCGA TTCTGGAGGC ACGGGGGTAA CTGAAAGrTGA CTACATATAG 1020 

TCTTTCTCC3T TTCTQGAGAT AACCCATCAA TAAAAGCTGC TTCCTCTGOG TAAAAAAAAG 1080 

(2) INFORMATION FOR SEQ ID NO: 227: 



(i) SEQUENCE CHARACTERISTICS: 

(A) hENCrmi 1336 base pairs 
20 <B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 227: 

TTC3CATTCAC AATTACTQGG AGGCAGGCAG GQGCAGTTGC ATGCTGGC3GG TC3GCTGCATO 60 

GSCTGCCASC TCTCCTGGGT TTGAAGGATG CGGTACASCT GCTTCAGCTG AGCAACGATG 120 

30 TTATOCTTGA TGTCTQGGGT TGAGATCTGC AGGCGGACAC TGCCACTATC AAAGC3ATCGT 180 

GTGAAATCAC CAGAAAACAT CTCCTAGATC ATCOGAGCCA CTACTGGAAT GACCTGAACC 240 

AAGATGAGTT TCCTTTCCAA TGGTTTCCCA TCTOGCCATr CTTCCCCAAA GCATAAGTAG 300 

35 

ATCTCAAACG GTGGCTGCTT CPCTATCTGT CCTTTCTGGT GGGCAATGAG ATCQCTAAOG 360 

AATGTTTCCA GACAAAATA6 CTTGACCTTC TTTTGTCTCT CAATCAGGTT GGGAGCAACA 420 

40 ACTGATGGGG CACATGGCXX: AGACCAGTAC ACCTTQCACT QGCACAGyCT GATGGCATAA 480 

ATC3GCATGAC CGCTGAOCTC CAGGATCAGT CCTCTGTCCA TGACGTCCAG CAGCTTGCTA 540 

GTGAACAGCT TCTGCTTCTC ATTGGTAATA TC3CTCAGGAC CrOGGAATTT GACCTGCTCC 600 

45 

AGNCTGACGG GACCAAAGAG CTCCTCCTGG TCAGGCATGG GACCCAGGTC CCCATAGAAG 660 

ACrrCGGCAGC CCTGAGGGTT GCTCACGGTC ATGGTCCTGC OCGTACTCCT TCCCACGGTA 720 

50 CTGAAACTTG ATGTCCAGGT CAGTCATTGG GAGAGAGCTG ATCCACAGTT CTGGAGAGCT 780 

ATAGAAGGRC TGTATAGGTG CCTGGGGWAC TTCCATCTCC AGGGGTTCAG TTTTGGGCCA 840 

CACTGCCTCC GGSCTGCAGT TGCCCACACT GCAAITTGCCC ACACTGGCTG GCGCCATQGG 900 

55 

AGAACCATTG ATGTTCAGGA AOQGGAAGGT GTCCTQGATG GGAACATGGT GCTGCGACTG 960 

ATCCAGCTCA TCTTCCTCAT CTTCTTCATC CACATCATTA TCCTTCTCAT CCCAGGGAGC 1020 

60 AGACCCTGTG GATCXTOGGT TAATGATCGA SCCCTGGGGC TGAGGGATCTT CACACACTTG 1080 
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ATATATCnC ACPGOGITCA TGGGCACCTC CXITCGTGCC ATCCATACAT CCAGGTTGAA 1140 

TTCTCTGCTC TTATTGftGAG CACflGOGCAG CTGOGCCTTC CATTTAGCTG GGTCAGGGTC 1200 

5 

ATCCACOCCT TCXTIGGrrACT TCCCTGTCTC TACAGCCCAG GCCTTAAAAA TGGTATTTTC 1260 

CTCTTCTTGT TGAOOGCTAT GCCGGGTOGC AKSTTTOCAG GGAATPCTOGA AGCGrTTTAGA 1320 

10 GTCCCTGTGT AGCCAG 1336 



15 (2) INFORMATION FCSi SEQ ID NO: 228: 

(i) SEQUENCE OTVRACTERISTICS : 

(A) LENGTH: 2043 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANEELSIESS: double 

(D) TOPOUXSTY: linear 

(xi) SBC^IENCE DESCRIPTION: SEQ ID NO: 228: 

25 TCAGCTGGTC CCTTCCTTGT GTCCTGGGGG ACCTGCTQGC GGCCTCTTOC TGGGAGCCAT 60 

GACCTCAGAC CCCACCCACA CTCCAGATCG AGAOCCCTGC CTCCCCCCGG CAAATGTCCT 120 

CCCGCTGCCT TGCAGCCTGC ACTTTGCACA TGCTCACCCC CAGCACAGTC CCACTGGCCC 180 

30 

CTCAMCTCCC CTTCCCTGAG CTCCTTCCCA AGGACTCCTG GTCACTGCCT GCTCTGCAKT 240 

CAGAGGCCCA GGGTCCAGCA GCCCGG9QGG AACGGGTGCT GCCTSTTCCT CCAGTTAGCT 300 

35 CCAGOTCflGG TCTGAGACCC GTCYTGAGTA AAGGrTCTGAG CAMDGACCGT GCCCTCTC3CC 360 

CAGGGCTC3GG TCCTGAGCAG CTGGTTTTCC TGCAGGAAGG TTGGAGCAAG CAAflGTCCTT 420 

CTCTGCCCTC AGGGTCflGCT GCCCAGACTG GOGCGGMTOC AGAGAGGCftG GTGGGCTGTG 480 

40 

GCTGGACTGG TCOQGAGCTC GCTTCCTTAC CAGAAAAGCC TCAGCCTTCC TCTGGAAGCA 540 

TCCOXGTTC TGGGCAAGGG GGAAGGGCTC CTTTAAGGGG TGTGCTTTCC CAGTQGGGAG 600 

45 CAGTCTGGCC CTGCCCCCTA CTAAAGCCTC TGCTCTCAGC ACTTTCCCCC AAGrrCCTTGT 660 

AACTTGCTTG AAGGrrGGGTT CTGGCTGCCA GCCAGTCCCT GGACAAACTC TCCTQCCCCT 720 

ITTAAATTTC ACTCATTTTG TATAAACXXA QCAGGCTQGT CSTTTTACTrAG CCCIOTAGCT 780 

50 

TTITrCATTT TTTCTTTCCG TCTTTCTTCT TGAGTTCAOG GTTCAATATT GCCTCCTCGC 840 

CCTGCJrGAGG GGAGGTGCTG CTTTTCTCSCC CCACCTGCCG GCTQGTTCCA GCAGCGCTQG 900 

55 NQCCCAGCTG GGGGGCCX3GG ATQGGGGCTT CTCTCTCTGG GftGGGGTCSCA GGTC3CCCTCC 960 

CCAGGCTQQG AGGGTTCCTT CCCTAGCTCC CCATCTGCCC CCGCTGGrrGA GftGTTQGGCT 1020 

TCTTOGTCTT GGAACTCCCT GGCATTQGGA ACAGAGCATT TCCAGCATTT GrrTGTTGTTG 1080 
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TTTTACTCAC CTAACCCTTA QAAAATGAAT GTTAGAAOGT GCCTQCCGAG GOGGGACAGA 1140 

GroTTTGCTC GCXSCTOGAGA AGGCTCTGCT CAGCCCTGAG ASTOCCTTCX: TOCCCCACOG 1200 

ATACTTGGCAC TITAAAAAGG AAGCTGACCG CACAGTGTCC AGACGAATTG GCXXXXZAGAA 1260 

GATGGGGAGT TCTGTCCTGC CCTTCTGTGT CTGOGTGACC TCAOCCAGOC TAQGAGGGAG 1320 

GTOCATTCAG GGTAGATTTG CCTCICATTC AAAGTTCTGG GQCTTTQOGy GGAAAACAGC 1380 

CAC5CTTTGGC GCTGTTQGGG AGACTCCTCC AGAOCAQGAA OCCCAGAAGG AGAC3«3AGCX: 1440 

TGCCACATCC TCCCACGCCA GGCCCTGGGC CAGGGTGATT GGACTGAGAA TTTGGCCACA 1500 

AOCTyUVTTGA TQCTGGCTGG AACCAGAGQC CAGAAAGCCT GGCCTTGTOC CCATGTGGCyV 1560 

GCCCICTCCT CAGCXTTCTT GTCOCCTTGA GCTCAGTGAA TTCCCACCAG GTCCCCACAG 1620 

CTCCTGGACT TCAAATTCTA TATATTGAGA GAGTTGGAGA GTATATCAGA GATATTTTTG 1680 

GAAAGGAGTr GGTrCTATGCA ATGTCAGTTT GGAATCTTCT TGAAAGTTTA ATGTnTTAT 1740 

TAGGAGATTT AAAGAAAATA AAGGTCTACA ATATCnTAG cjrrrmTlT TTTCCTGnT 1800 

ACCGCACAAA CTGACCACAT GGCATGTCTA TCAGGATGGA GGGTGrTCCAT OnitrrCCrCT 1860 

GTCTITAGGG AGGTCATAAG GAGATGGSCG RAGGGGTOTT Tm ' Il.Trm ACTCCCCTCC 1920 

TTTCTAACAG AATGITGCCA CCACTGCTTG AGTGGGCTGT C?rTTGTTCCT CTGTCCCAGC 1980 

TTCTGTTGrrA GAAAATAACA TICTTAGGGG AACTCAGGCT AGrPGTCAGCG TCTTGGTTTG 2040 

GGG 2043 
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(2) INFOMATICW FOR SEQ ID NO: 229: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS; double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 229: 

TAAAAAGAAG CGQGAGAATC TGGGCGTCGC TCTAGAGATC GATGGGCTAG AGGAGAAGCT 60 

GTCCCACTGT CGGAGAGACC TGGAGGCCGT GAACTCCAGA CTCCACAGCC GGGAGCTGAG 120 

CCCAGAGGCC AGGAGGTCCC TGGAGAAGGA GAAAAACAGC CTAATGAACA AAGCCTCCAA 180 

CTACGAGAAG GAACTGAAGT TTCITCGGCA AGAGAACCGG AAGAACATGC TGCTCTCTGT 240 

GGCCATCTTT ATCCTCCTGA CGCTCGTCTA TGCCTACTGG ACCATGTGAG CCTGGCACTT 300 

CCCCACAACC AGCACAGGCT TCCACTTGGC CCCTTGGTCA GGATCAAGCA GGCACTTCAA 360 

GCCTCAATAG GACCAAGGTG CTGGGGTGTT CCCCTCOCAA CCTAGTGTTC AAGCATGGCT 420 
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TCCTQQOGGC cx:aggccttg cctccctggc ctgctggcxjl; GTTCCGGGTC TCCAGAAGGA 480 

CATQGTGCTG C?rCCCTCCCT TAGCXCAAGG GAGAGGCAWT AAAGACACAA AGCTGGAAAT 540 
(2) INFORMATIOEN FOR SEQ ID NO: 230: 



(i) SEQUQJCE CHARACTERISTICS: 

(A) LQIGTH: 448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECMESS: doijble 
15 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 230: 

AATTGrrGAAA TATTAGAATA TTGTTACTAT TTGACCCAAC TCAAAATCTC CATGGGAAAA 60 

TACCTGrrCGA TACCCACAGT ATTGTTGAAA ATAATCAGAT GCAGTATCAC AGCTGTC?rCA 120 

GACTCTAGTA CCAGTTGCSGC AATCAAGGCA CAGCTAAAAA TTGAAAACAA AGATCTGGAC 180 

25 AACAAAACAG CCAAAGGTQG GQGTCAAGAA GCTCTC3ACGT GTACCTAGCT GTAGAATQCT 240 

ATGCACACCT GCCAGGTGTA GTGTQCATAT CCAGGAAAAA CTGCAGAGAG CCCCAGTCTT 300 

CAMCTCTGGT TGACCATGAG CTCTGTGTAA GCAGGAAGTG AAGQCTAAGG CAGATTTAAG 360 

30 

CTCTGAAAGC ATTCCACAAC ATACACACAA ATCGTGCAAA GCATTAAGGA AATCTTGTTA 420 

CTGCTAAGTG TTGCTGACCC AGGAACAA 448 

35 

(2) INFORMATION FOR SEQ ID NO: 231: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 407 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDfECNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 
GTATGCroCC CCAAACCAAT ATGTGTGGCT GCCmWACC TGACTTCTCC AACATGTAGC 60 
50 CCCAAGAGGA GGCCTCTAGA CTRAGGGAOS GGCTGGTGAC CCAGGTGrTGG TGGQGCTGCA 120 
TGARACTACC AGAGAGACAG ACATTCTOGA ACTCACCCTG GGGGATCCAG TGGATCTGCC 180 
TATGCTTCTGG TCCACCCCAG ACCTGTGAGA TGTTCCTCAT GAGGATGCAC TTCTGCTTCT 240 

55 

GCAAGTATTG CTQCAGCTTC ATAGTGACTC CCACCAGCAC CAGCAATACA GYTAGCTACC 300 
TGTGGCCrrG GATCTCAGCC AGCATQGCTG GGAGAGGGAG CARCTGGGCA TGTACCCTAA 360 
60 ATGCTGTTAC CAGGGAAGGA CTCCCAGAGT GAAGACAAGT AGGGACT 407 
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5 (2) INFORMATION FOR SBQ ID NO: 232: 

(i) SEQUQICE CHARACTEEaSTICS: 

(A) XiQiGrni: 830 base pairs 

(B) TYPE: nucleic acid 
10 (C) SraAKDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE EESCRIPTION: SBQ ID NO: 232: 
15 GTATTTGATT TCAGQCTOCT AAATGGGCTC ATTTAGCATT CATTCXTTTGA TCTAGACATT 
AAAAAAAAAA CTGAATAGCA TTCTTTCCAG GOTAACTAAT AAAQCAGftCA TGCTAAQCCT 
ATAAATACAT CAGCACTCXA GCACACGTTT AAGGTTQCCA CQGACAAGGA TCACACAATA 
GAGAACACTG TAGTT0QC3TC TGCTCACAAG ACCCAGAACA TTC3ATCAGTT TrrcrPTOITC 
GTTTATTATT TTTCTGTTAA AAAATTGTCA AAAGTTTCTT TTAGCTAGAT GATATTTTAA 
TAGCTGCX3AG TQCTITOGAA CTATAAAGAT GTCACTACTT AACACACATA CCTTATOTTT 
TGTTTTGTTT TGTTTTACAC TCAGTATAAA TCAQGAGAAG TTAGCCAACC ATCTAGCATT 
TAGAATCCTC TTrTTTATTG TCTTCTAAGG ATATGGATGT TCCCATAACA GCAACAAAAC 
AGCAACAAAA ACATTTCATA AATATCACIT GATAGACTCT AAGCACCTGC TTAACTTTCT 
GTOCCAAATA TTTAGTGTGT ATATATATAT ATATATACAC ACACACACAC ATATATATTC 
AACAAATAAA GCAAAATATA ACATGCATTT CACATTTTGT CTTTCCCTGT TACGATTTTA 
ATAGCAGAAC TGTATGACAA GTTTAGGTGA TCCTAGCATA TGTTAAATTC AAAITAATGT 
AAAACAGATT AACAACAACA AAGAAACTGT CTATTTGAGT GAAGTCATGC TTTCTATTAT 
AATAACTTGG CTTCX3GTTAT CCATCAAATG CACACTTATA CTGTTATCTG 



(2) INFORMATION FOR SBQ ID NO: 233: 

(i) SBQOENCE CHARACTERISTICS: 

(A) LQiGTH: 932 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOU)GY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 233: 
CCAGAAGAAA GACCAATCTA GAATATC3GAA CTCTAATCAC TTCTAGTATT TCAACTTCCT 
AGCAGAAATG AACTTQGCCC TAGACCTAGG GGATAAGCAA TGTTCTTTAT GTAGCCAATG 



CTAOGGAAAC AAAAGAGGTG AAAGAGACCC TTTTTTTATA CTTAATGTAC ATATATTCAC 



460 



im-iUftGCA AGAATGCCAG AAATAQCCTT CATTTCTACC CTGCAAAATA ATCCAGATCT 
GCTTTCTAAA TTTCTAAAGT GAAACATOCA ATAnTATOC TCTGACTGAC 

TOCTGAATTG GARGAGGAAG RACTTCTGTT TACAGAAAAC YGTATTGriTA TATATCTCAG 
GCTGTOrATT GTGACTATCA GCA3TCTGGT GCAAATGAAC TTTTCTCCAT CATOGACTOT 
GCSAAAATPGA TACTTTTAAA GCATATTCTT CTTATGAGCAC AGGTCCTCCT AGTCAAACTT 
AATTTGACAA AGGGTCTCAT ATGCOTTCCT AACCTGAWTT GTATTAACAr TCACAGAGCC 
TACATTTTCT CATTAGGGTT KPQATOCTCA GTATCTITCC AAGTCCX:AGG CA£a«XnTOC 
CTTTTCTGAT CAAACATACC AITTTTTGTA TTTCACAACT ATAGACAGTC ACTTCTGCAG 
TOCXIAATTTA AAAATGCAGA ACTGCTTTAT CXAAGAATGC TGAAAAATAC TGTTCTATCC 
AGGTTTCCTA AACTATAAAA GCAGATTTTG CTTTTGTTTG TTAATCATAG GCATOGCCGA 
GCATTGTGGA TTAGCXTCAG GCTTAAAATC AGATGCATGT CTGGTAAGAT GACCACTCIC 
TCACTATCAA GAGCXTTQCAG AGCX:ATnTC CAGACCTCTG AOTOXCAGA ACACATAGTC 
CCCAOGTTTC TAATTTGGAG CAAASXZrTAAA AG 

(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SBQCENCE CHARACTERISTICS: 

(A) LENGTH: 2786 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 234: 
TTAGCAQGGT GAGCTGTTAA AACAGCACAC ATCTCTCATC CCCTCTTCCT TTATTCCCCC 
CrGGOTTTCA GAAAGGAAGG ATATATOQGG ACCACCTCCC CCTTCTTTGA TCCCAGCATC 
TCAGTCCCCC TCCCAACCCT CCATATGGCT CTCAATGGTG CTCAdTGCT TC3GAAGCAQG 
CTCCCAATAG GGAGGQGSCT GCCCTCTACA GTCTCTTTGA CTGTAAGACA GGGCTCTGTA 
TCAGTGAGAC GATGAGAAAA GTCCCAGGCT AATQGCAGAA ATTTGCACTT TGAACATGTG 
TGTTTTIOTG TTGTGGAACC TGAGATTCCT TATTTATTAA CAGGAAGTCT GATTTTTTTT 
TirrOGAGTC TTTCTTGCTA TATTTTGrrGG GGCTGGGAGA GAGAGATTAG ATTATTTTGA 
CATOGGATCC CITCCATAAC AGGTACTTTG AAGQCAAGAC ATAGGGnTGA AGAAGCACAA 
CCAGCCTCTG AAATCATAGC TCTCCAGTGG CTTTTAAAGA AAGCTGGrrCC TCAGCACTAA 
CAAAATCACT ACAATAGCCT AGTGCTTTTT TGGAAGCCTT TTTAGGGAAG AATOTTAGGT 
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TCATGGTAAC TAGTATGCTC TTTGftGATTT 
GGGTGAGGAG GGnTGTTCAG AATCTAAATT 

5 TTTciTTrcc riTm - m c tccctaccat 

GGCTOCITGC ' ITlTlUmt: TTGCTTTOCT 
AAOGACACTG CTGTTAGTGA AOGAACAAAG 

10 

AAACTQCTCT GrTTTCCCrTT TAGTAACACT 
TTAATAATCC TATATAATAA TTGCTTTOGC 
15 CCTTGGGATA GAGGrTTGTGT TGGGGAATAG 
GGTAGTGTTT TTGTACAAGG TCATACCX3CC 
TCAGCJEAAAG AGTAATTCCT ATCCTGTGTG 

20 

ATTCAITOrr TATTGTCAGA AATGCATGAT 
AAAAAAAAAA AGAACAGTTT GTGrTTCACA 
25 CTTTTTTCCC AAAAAGAGGA GTAACAAAAT 
AACTAGTCTC AGCATTTOGG ATGCCAQGGA 
CTCAAATQCG CTATTGTTTC TTTTCAGAGT 

30 

OOGATAGAAA ATGGAATAAA GATAGAAGOG 
TTCGAGTCGA CTTTQGTATA ITGACTAGAT 
35 ACAAAAAAGT TGTCCTOCTC TCAGGTCXTT 
TTCCACACTT AGCTTTTITC TCACACTTAT 
GTTAGrrCATC AGATATTTTA GCCACCTACA 

40 

CTGAGATGGG AGAAAATGTA TTCTCCrTTC 
AGTTAGrrrCT ACTAATTAGA AACTTGCTGT 
45 CrrCTTTATA GCTACCATTT GOCTACAATA 
ATTTTTTATA GACTTTATAT rmunTlT 
GTAATTAAAC TA3XTCAGCC GTTrcCCTGC 

50 

CTTCCAGGGA GCTCTTTTAA TCTTAAAGTT 
TACCTTTTTA ATAACTCTTC CCACTGCATA 
55 TGAAACTGTA GTTGAGATAC AGCTATTTAA 
KGTQGTrTGCC CAAQGTTGTT TTGOGTAACT 
GQCAAACACT GGCCATGGCC GTTGGGACTrAC 



TTACAGTOTT GAAACTTAAG AATTTTGAGA 660 

ACAGATAGAT GATTGTTTCT TGTGAATTTG 720 

TTCCTTACAT TTCCCTTOGG GCCCATCTCT 780 

TTATCAGTTC ATTCCAGCTC CCTGrTTAGTG 840 

TCTATGAGTC CTAAAAITIT AAGTCAAAGA 900 

TCTGAAGAGG AAAAACTTCA ATAGCCAAAG 960 

TTICACCTAA AATTCTGGGC ATCACAATTT 1020 

ATTGCTTATT GCTGTTCACT GGAGAGAAAA 1080 

AGAAGCXCCA AATCCTATTT TQQCTCATCT 1140 

CCTCAGAAGC TAGAATOGAA GGCTTAOCCT 1200 

GGCTCTTOGA AAGAATGACG TTITOCTOGA 1260 

AACATOGCTT ATCAATTTTT TCAAAGAATT 1320 

GTCATTTCTG AAAGAOGCTT ACTTTATAOC 1380 

ACAGAGAGTG AGACACCTAC AATCACCAGT 1440 

GriGCAGATT TGCCATTTCT CCATAATATG 1500 

ATGTAGAATA TGCITTCCTG CCAACATQGT 1560 

TTGAAAATAC AAGATTGATT AGATGAATCT 1620 

TTTACACTTT TTGACTAACT AGCATCTATA 1680 

CXnTTGTCTC CX3TAAATTTC ATTTGCAGTG 1740 

CAAAAGCAAA CTGCATITTT AAAAATCTTT 1800 

CTATACXX3CT CTCCCAACAA AAAAACAACT 1860 

ACTTTTTCTT TTCTTTEAGG GGTCAAGGAC 1920 

AATTATTGCA GCAGTTTGCA ATACTAAAAT 1980 

GATAAAGGQA TOCTQCATAG TAGACTTTGCJr 2040 

TTTCCXTTTCT GCTCCATATG CXirCATICTC 2100 

CTACATTTCA TGCTCTTAGT CAAATTCTGT 2160 

TTTCCATCTT GAATTGGTOG TTCTAAATTC 2220 

TATTTCTGGG AGATGTOCAT CXXTTCTTCTT 2280 

GAGACTCCTT GATATCCTTC AGAGAATTTA 2340 

TGOGAGTAAA ATAAAAATAT CXiAGGTATAG 2400 
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ACTAGCATCC ACATAGAGCA CTTGAAOCTC CTTTGTACCT GTTTGC3QGAA AAAGTATAAT 2460 

GAGTOTACTA CCAATCTAAC TAAGATTATT ATAGTCTQGT TGnTCAAAT ACCArmTT 2520 

TCTCCTTTTG TGTmTCCC ACTTTCCAAT GTACTCAAGA AAATTGAACA AATGTAATOG 2580 

ATCAAOTTAA AATATTTTAT TTCTTAAAAG CCTmTiU C CTGTTGTAAT GTGCAGGACC 2640 

CTTCTCCTTT CATGGGAGAG ACAGGTAOrT AOCTGAATAT AGGTTQAAAA GC3TTATGTAA 2700 

AAAGAAATTA TAATAAAAQG GATACTTTGC TITTCAAATC TTOCTTTTCT CTTATTCTAG 2760 

GTAAGGCATA TTAAAAATAA ATATGT 2786 

(2) INPQRMAriON FOR SEQ ID NO: 235: 

20 (i) SBQUETTCE CHARACTERISTICS: 

(A) LENGTH: 458 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: doijble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 235: 

GQGTGCAGGA ATTCGGCACG AGAGAATGOT TGATTTTCTT TCCTATTTTA AQGATCTTCT 60 

CTCTTGTTGA TGTTGAAAAC TTACCTTAGT GAAGATGTGT TTCAACATGC TGTTGTCCTT 120 

TACCTGCATA ATCACAGCTA TGCATCTATT CAAAGTGATG ATCTGrTQGGA TAGTTTTAAT 180 

GAGGTCACAA ACCAAACACT AGATGTAAAG AGAATGATGA AAACCTGGAC CCTQCAGAAA 240 

GGATTTCCTT TAGrTGACTGT TCAAAAGAAA GGAAAGGAAC TTTTTATACA ACAAGAGAGA 300 

TTCTTTTTAA ATATGAAGCC TGAAATTCAG CCTTCAGATA CAAGGTACAT GCCCTCTTTC 360 

TTTTCATQCC ATCTCTTTTG CACTCTCAGG TGGAAATATT TTTAAGTGTT OTATAATCAT 420 

AAGTTCTTGT GAAACCTAAC AAGATTATCC CTTCCTAA 458 



(2) INPORMATKa* FOR SBQ ID NOi 236: 

(i) SEQUQICE CHARACTERISTICS: 

(A) LENGTH: 591 base pairs 

(B) TYPE: nvicleic acid 

(C) STRANDEENESS: doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 
AGGATGAAGA GGAAATTATC TCTTGGATTG CTCTCCAGGA AATCCTTCTC TATACTTTAA 60 



AAGCTCTPGT TCTTTPCTAG GARTCCAATG TGCTGATTGC TGCTAACAGT CAGQGTACAA 



120 
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TTAAGGTOCT AGAATTOGTA TGAAGGGrTTA 
AAATACATCT GCAGCTGACA ATGAGAGARG 
5 CAAAGrrCATC ATGGGTTTTG GATTTGTTTT 
TTTATGAGCC TTTGGGACAT TGGGAATACC 
ATGGACATTG CTC3CTCTTQG TOGTGTXATC 

10 

TGAATAAAAA TAAATAACWA AACAATAAAA 
TTTTICTCAA ATOC3«XAAG AGATAACTCT 
15 GTAACAGTGA ATAATTTC3TA AflGTTCGTAT 
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463 

ACTCAAGTCA AATTGTACTT GATOCTGCTG 180 

AAACAGAAAA TGTCATGTGA TGTCTCTCCC 240 

GAATAmrr ' I C l T mTlC TTKTCCCTCC 300 

CSWaCCAACTC TOCACCATCA A3CTAACTCC 360 

TAATTTTTGT GATAGGGAAA CAAATTCTTT 420 

GTTTATTGAG CCACAGTTGA GCTTOGAAAG 480 

TTTTANGAAG TAGCATATGT GAACTATAAT 540 

TTCCCAACCT CTTTCQGAAT T 591 



20 (2) INFORMATION FDR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LmTTH: 1286 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDECNESS: double 

(D) TOPGLOGY: linear 

(xi) SEC20EKCE DESCRIPTION: SEQ ID NO: 237: 

30 TCTTTTTAAG GTACAGCAGG GAAGAACPGG AAACTCAGAG AAAGAAACTG CCCTTCCATC 60 

TACAAAAGCT GAGTTTACTT ClCVltJCnX: TTTGTTCAAG ACTQGGCTTC CACCGAGCAG 120 

GAGATTACCT GGGGCAATTG ATGTTATCGG TCAGACTATA ACTATCAGCC GAGTAGAAGG 180 

35 

CAGGOGACGG GCAAATGAGA ACAGCAACAT ACAGGTOCTT TCTGAAAGAT CTGCTACTGA 240 

AGTAGACAAC AATTTTAOCA AACCACCTCC GTTTTTCCCT CCAGGAGCTC CTCOCACTCA 300 

40 CCTTOCACCT CCTCCATTTC TTCCACCTCC TCCGACTGTC AGCACTGCTC CACCTCTGAT 360 

TCCAOCACOG GGTTTTCCTC CTCCACCAGG CGCTCCACCT CCATCTCTTA TACCAACAAT 420 

AGAAAGTGGA CATTCCTCTG GTTATGATAG TSGTTCTGCA OGrPGCATTTC CATATGGCAA 480 

45 

TGCGATGAAG AACGATACAG ATACAGGGAA TATGCAGAAA 6AGGTTATGA QCGTCACAGA 540 

GCAAGrrOGAG AAAANGAAGA AOGACATAGA GAAAGACGAC ACAGQGAGAA AGAGGAAACC 600 

50 AGACATAAGT CTTCTCGAAG TAATAGTAGA CGTrCGCCATG AAAGTGAAGA AGGAGATAGT 660 

CACAGGAGAC ACAAACACAA AAAATCTAAA AGAAGCAAAG AAGGAAAAGA AGCGGGCAGT 720 

GAGCCTGCCC CTGAACAGGA GftGCACOGAA GCTACACCTG CAGAATAGGC ATGGTTTTGG 780 

55 

CCmTGTGT ATATTAGTAC CASAAGTAGA TACTATAAAT CTTGTTATTT TTCTGGATAA 840 

TG?ITTAAGAA ATTTACCTTA AATCTTGTTC TGTTTGTTAG TATGAAAAGT TAACTTTTTT 900 

60 TCCAAAATAA AAGAGTGAAT TTTTCATGTT AAGTTAAAAA TCTTTGTCTT GTACTATTTC 960 
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AAAAATAAAA AGACAGCAAT GACTTTATAT CXZAAGAAAGG AATGTGAATC ACyrcACTTAA 1020 

CAGGGAATCT AAAGAGCTGT GTTAGCTGTG TACATACACA GATTATCTCA GAAAAGGTCA 1080 

AGQGTPCCAC TTOGC3CCACA GTTTmTGT TAATCAAACA CXIACTCTCTr AAQ^GGCTOC 1140 

ATCACAAARG GCAACCAAHG GGCOCCTCTT ARGGCTTTGA QGATTAAAAC TAfflCTTTAT 1200 

10 CCATTACTOC TGTGGACACT CTTQGCTTRG TATWrTTAGG GQGC2ITCCrr ACCmTnT 1260 

GGmrCCNC ACCTTTTTGG TTQGGC 1286 

15 



20 



25 



30 



40 



50 



55 



60 



(2) INFORMATION FOR SBQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTTH: 734 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEENESS: double 
(D) TOPOLOGY: linear 

(xi) SEQIJENCE DESCRIPTION: SBQ ID NO: 238: 

ATC3GCAGCGC AGAAGGACCA GCAGAAAGAT GCCGAGGCGG AAGGGCTGAG CGGCACGACC 60 

CTGCTGOCGA AGCTGATTCC CTCOGC3TC3CA GGCCGGGAC3T QGCTGGAGCG GCGCCGCGCG 120 

ACCATCOGGC CCTGGAGCAC CTTCGTQGAC CAGCAGCGCT TCTCACGGCC CCGCAACCTC 180 

GGAGAGCTC3T GCCAGCGCCT CGTACGCAAC CTrGGAGTACT ACCAGAGCAA CTATOICTIC 240 



35 GTGTTCCTGG GCCTCATCCT GTACTGTGTG GTGACGTCCC CTATGTTGCT QGTOGCTC TC 



45 CAGATGGAAC CCGTGTGAGG TGTCTTCTGG GACCTGCCGG OCTCCCGGGC CAGCTGCCCC 
ACCCCTGCCC ATGCCTGrrCC TQCACGGTCT GCTGCTCGGG CCCACAGCGC OGTCCCATCA 



AAAAAAAAAA AAAA 



(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LSJOTH: 809 base pairs 
(B) TYPE: nucleic acid 



300 



CXTOPCTTTT TCGGCGCCTG TTAACATTCT CTATCTGOGC ACCTTGGAGT CCAAGCTTOT 360 

CCTCTTTQGC CGAAAGGTGA GCCCAGCGCA TCATATGCTC TGGCTGGAOG CATCTCCTTC 420 

CCCTTCTTCT GQCTGGCTQG TGCGGGCTCG GCCGTCTTCT GQGTGCTOGG AQCCACCCIG 480 

GTGGTCATCG GCPCCCACGC TGCCPTCCAC CAGATTGAGG CTGTCGACGG GGAGGAGCTC 540 



600 
660 



CAAGOCCGGG GAGGGATCCC GCCTTTGAAA ATAAAGCTGT TATGGGTGTC ATTCAAAAAA 720 



734 
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(C) STRANDEENESS: <^uble 
<D) TOPOLOGy: linear 

^ (xi) SBQUQICE DESCRIPTION: SEQ ID mi 239: 

CGGGGTICTrC AQGGTACCGG GCTO5ITACA GCAGCTCTAC OCCTCACGAC GCARACATGG 60 

C^tfXXTiWGAA'GGACCAGCAG AAAGA3X3CCG AGGCGGAAGG GCTQAGCX3GC ACGftCCCTGC 120 

10 TGCCGAAGCT GATTCCCTCC GGTGCAGCXX: GGGAGTOGCT GGAGCGGCGC OGCGCGACCA 180 

TCCGGCCCTG GAGCACCTTC GTOGACCAGC AGCQCTTCTC ACGGCCCCXX: AACCTGGGAG 240 

AGCTGrrOCCA GOOCCTCGTA CGCAACGrTQG AGTACTACCA GAC3CAACTAT G'AXJritXTltJl * 300 

TCCTGQGCCT CATCCTOTAC TGTGTGGTGA CGTOCCCTAT GTTQCTGGTG GCTCTCGCTG 360 

TCrrrTTCGG cgccicttac atpctctatc TCCGCACCTT GGAGTCCAAG CTTGTOCTCT 420 

20 TTGGCCGAGA GGTCAG(3CCA GCXXATCAGT ATGCTCTQGC TQGAGGCATC TCCTTCCCCT 480 

TCTTCTOGCT GGCTC3GTCCX5 GGCTCGGCOG TCTTCTGQGT GCTGGGAGCC ACCCTGGTQG 540 

TCATOGGCTC CXACGCTOCC TTCCACCAGA TTGAGGCTGT QGACGGGGAG GAGCTGCAGA 600 

25 

TGGAACCCGT GTGAGGTGTC TTCTGGGACC TGCCGGCCTC CCGGGCCAGC TGCCCCACCC 660 

CTGCXXATQC CTGTCCTOCA CQGCTCTGCT GCTCGGGCCC ACTtfSCGCXXST CCCATCACAA 720 

30 GCCCX3GGGAG GGATCCXX3CC TTTGAAAATA AAGCTGTTAT GGGTOTCATT CAGGAAAAAA 780 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 809 

35 

(2) INPORMATiaW FOR SBQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LQJGTH: 2201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: dofuble 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 240: 

TOGACCCACG CGTOOGGCAA CATGGCGGCT GCOGTGGTGC AGCGCCCGGG CTGAGCGACA 60 

GCAAGTGCAG CGQGCTOCTA CCCCGGGTGA GQGGTQQCCT CCGCGTGGGA TCGTGCCCTC 3L20 

50 

TTCAGCCCGC TCCTGTCCCC GftCATCACCT CTATTCCGCA CGTCCCCTCC GCGCTGTGrTG 180 

TCTACTGAGA CGGGGAGGCG TGACAGGGCC CGGGTCCCTT CTCAGTGGTG CTCTGTC3CTT 240 

55 CAGQGCAAGC TCCCCGTCTC CGGGCGCACT TCCCTCGCCT GTGTrCGGTC CATCCTCCTT 300 

TCTCCAGCCT CCTCCCCTCG CAGGCGGATG AMCOGGACGA CGGGCCAGTG CCTGGCACCC 360 

CGGGGTTGCC ARGGTCCAMG GGGAACCCGA AGTOCGAGGA GCCCGARGTC CCGAACCAGG 420 

60 
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ARGGQCTOCA GOGCATCAMC GGCCTGTCTC CCGGCCGTTC GGCTCTCATA GK3GCGG1X3C 
TGTOCTACAT CAATCTOCTG AACTACATOG ACCQCTTCAC CGTOOCTOGC GTCCTTCCCG 
ACATOGfiGCA GTTCTrCAAC ATOGGQGACA GTAGCTCTGG GCTCATCCAG ACCGnCTTCA 
TCTCCAGTTA CATGC3TCTTG GCACCTGTOT TTGQCTACCT OGtSTGACAGG TACAATOGGA 
AGTATCTCAT GTGCGGQGGC ATTGCCTTCT GGTCCCTGGT GACACTOGGG TCATCCTTCA 
TCCCC3GGAGA GCATTTCTOG CTGCTCCTCC TGACCCGC3GG CCTQGTQGGG GTOQQGGAGG 
CCAC3TTATTC CACCATCGCG CCCACTCTCA TPGCCGACCT CTTPGTOGCC GACCftGCGGA 
COGGATGCTC AGCATCTTCT ACTTTGCCAT TXXOGTGGGC ACTQGTCTQG GCTACAOTCC 
AGGCTCCAAA GTCAAGGATA TQGCTGGAGA GTGGCACTGG QCTCTGAGGG TCACACOGQG 
TCTAGGAGTG GTQGCCGTTC TGCTCCTGrTT CCTGGTACTG OSGGAGCCGC CAAGGGGAGC 
CX3TGGAGCGC CACTCAGATT TOCX^WXCCT GAACCCCACC TCGTQGTQGG CAGATCIGAG 
GGCTCTQC3CA AGAAATOCTA GTTrcGTCCT GTCTTCCCTG GC3CTTCACTC CTGTOGCCTT 
TGrrCACQGGC TCCCTGGCTC 1CT0GGCTCC GGCATTCCTG CTGCX3TTCCC GCaiaTTCCT 
TGC3GGAGACC CCACCCTGCC TTCCCGGAGA CTCCTGCTCT TCCTCTCACA GTCTCATCTT 
TGC5ACrCATC ACCTCCCTGA COGGAGTCCT GGGrTGTOGGC CTGGGTGTC3G AGATCAGCCG 
COGGCrcCGC CACTCCAACC CCCQC3GCTGA TCCCCT0C5TC TX5TGCX:ACTG GCCTCCTGGG 
CTCTGCACCr TTCCTCTTCC TGTCCCTPGC CTGCGCOCGT QGTAGCATCG TOGCCACTTA 
TATITIMATC TTCATTOGAG AGACXXTTCCT GTCCATGAAC TGGGCCATCG TOGCCGACAT 
TCTQCTGTAC GTGGTGATCC CTACCCGACG CTCCACCGCXr GAGGCX^TTCC AGATCGTGCT 
GTCCCAOCTG CTGQGTGATG CTQGGAGCCC CTAOCTCATT GGCCTGATCT CTCACCGCrT 
GCGCC3QGAAC TGGCCCCCCT CCTTCTTCTC CGAGTTOCGG GCTCTGCAGT TCTOGCTCAT 
GCTCTGOGCG TITGTTGGGG CACTGGGCGG CGCACTTTCC TGGGCACCGC CATCITCATr 
GAGGCCXSACC GCCX3GCQGGC ACAGCTOCAC GTQCAGGGCX: TCCTCCACGA AGCAGGCTCX: 
ACAGACGACC GGATTGTOGT GCCCCAQCGG GGCOGCTCCA CCCCSCGTCCC CGTOGCCACT 
GTGCTCArcT GAGARGCTOC CX3CTCACCTA CCTGCACATC TGCCACAGCT GGCCCTCGGC 
CCACXXXIAOG AAGGGCCTGG GCCTAACCXX: TTGGCCTGGC CCAGCTTCCA GAGGQACCCT 
G0C5CCGTGTG CXIAGCTCCCA GACACTACMT GQGTAGCTCA GQOGAGGAGG TCGOGCSTTCCA 
GGAGGGGGAT CXXTTCTCCAC AGGGGCAGCX: CCAAGGGCTC GGTCCTATTT CTTAACGGAAT 
AAAATITOTA GCCAGACCCC AGGTOCCTGC TCrTOGTCTTT CTCTGGGTOG CCTCTGATCT 
TGCACCCCGT CTTCACCCCA GGGCTCCTGA AGACTGTOCX; T 
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2201 



W0^5M48 



467 



PCT/US98m493 



^ (2) INFORMATION FCSt SBQ ID NO: 241: 

(i) SBQUQJCE CHARACTERISTICS: 

(A) LQiGlW: 1661 base pairs 

(B) TYPE: nucleic acid 
(C> STRANnEENESS: double 

10 (D) TOPOLOGY; linear 

(xi) SEC3WENCE DESCRIPTION: SEQ ID NO: 241: 

GTCCTTCOOG ACATOGAGCA GTTCTTCAAC ATCGQGGACA GTAGCTCTGG C3CTCATCCAG 60 

ACCGTC3TTCA TCTCCAGTTA CATGGTGTTG GCACCTGTC3T TTQGCTACCT GGCTreACAGG 120 

TACAATCC3GA AGTATCTCAT GTGCGSGGGX3C ATTGCCTrCT GGTCCCTGGT GACACTOGQG 180 

20 TCATSCrrCA TCCCCGGAGA GCATTTCTCX; CTGCTCCTCC TGACCCGGQG CCTOC?rGGGG 240 

GTCC3GGGAGG CCACTTATTC CACCAT0C3CG CCCACTCTCA TTGCOGACCT CnTCTGOCC 300 

GACCAGOQGA SOQGATGCTC AGCATCITCT ACTTTGCCAT TCa3GTGGGC<?jOTraC?rcTOG 360 

GCTACATTGC AGGCTCCAAA GTPGAAGGATA TGGCTQGAGA CTC3GCACTQG GCTCTCAGGG 420 

IGACACCQQG TCTAGGAGTG GTGGCCGTTC TGCTGCTGTT CCTGGTAGTC CGGGAGCCGC 480 

30 CAAGGGGAGC CGTTGGAGCGC CACTCAGATT TGCCACCCCT GAAOCCCACC TCGTCGIGGG 540 

CAGATYTGAG GGCTCTQGCA AGAAATOCTA GTTTCGrrCCT GTCTTCCCTC GGCTTCACTC 600 

CTGTGGCCTT TGTCACGGGC TCCCTGGCTC TGTC3QGCTCC GGCATTCCTC CTCCGTTCCC 660 

GCGTGGTCCT TGGGGAGACC CCACCCTGCC TTCCCGGAGA CTCCTQCTCT TCCTCTCACA 720 

GTCTCATCTT TGGACTCATC ACCTGCCTGA CCGGAGTCCT OQGTCTGGGC CrGGCSTOTOG 780 

40 AGATCAGCCG COQGVTCCGC CACTCCAACC CCOGGGCTGA TCCCCTGGTC TCTOCCACTC 840 

GCCTCCTGGG CTCTQCACCC TTCCTCTTCC TGTCCCTTGC CTGOGCCCGT GGTAGCA3a5 900 

TGGCCACTTA TATTTTCATC TTCATTQGAG AGAOCCTCCT GTCCATGAAC TOGGCCATCG 960 

TOGCCGACAT TCTGCTGTAC GTQGrPGATCC CTACCCGACG CTOCACCGCC GAGGCCTTCC 1020 

AGATOGTCCT GTCCCACCTG CTGGGTGATG CTGQGAGCCC CTACCTCATr GGCCTOATCT 1080 

50 CTGACCGCCT GCGCOGGAAC TQGCCCCCCT CCTTCTTGTC CGAGTTCCGG GCTCTCCACJr 1140 

TCTCGCTCAT GCTCTGCGCG TTTGnTGGGG CACIGGGCGG CGCACTTTCC TX3GGCACCGN 1200 

CATCTTCATT GAGGCCGACC GCCGGCGGGC ACAGCTGCAC GTGCAGGGCC TCCTCCACGA 1260 

AGCAGQffTCC ACAGAOGACC QGATTGrTGGT GCCCCAGCGG GGCOGCTCCA CCCGCGK3CC 1320 

CGTQGCCAGT GTQCTCATCT GAGAGOCTGC OOCTCACCTA CCTGCACATC TCCCACAGCT 1380 

60 KGCCCTGQGC CCACCCCACG AAGGGCCTGG GCCTAACCCC TTGGCCTCGC CCAGCTTCCA 1440 
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GAGGGACOCT GGGCrGTCTG CCAGCTCCCA GACACTACMT QGGTAGCICA GGQGftGGAGG 
TQGC3GGTCCA GGAQQGQGAT CCCTCTCCAC AGGGGNCACC CXAAGGGCTC GCTTOCTAITr 
GTAACGGAAT AAAATTTOTA GCCAGftCCCC AGGTOCCTGC TCTOJlV m ' CTCTOGGIGG 
CCTCTGA3CT TGCACCCXX3T CTTCACCCCA GGGCTCCIGA A 
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(2) INFORHATICN FOR SBQ ID NO: 242: 

(i) SBQUQICE CHARACTE31ISTICS: 

(A) LQIQTH: 1146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 242: 

NGACAGAAAA GCAGAAGATG AGACTCTGTT CATTCACTTT TCCTAGGCCC ATCCTOK5CT 60 

CATCTTTCCC CCTOCCATCA TACCTCCTCC TTCCTGGAGC CTCTGCCGGC TTOGCTCrAA 120 

TQGTOGCACT TACCTGGATA TTrCAGTGGG AGGATGAAAG GCGAGACTCA CCCTACGCQG 180 

TQC3GACAGAT GGGGftGAGGA AAAAOGCAGA GATW3CCAGG AGAGGGGTCC AGGftCAAACC 240 

AGAGAGGrrTG GC?rCAGGQGA AAAGTOINGG GAGAAAGTGG GGTOCAGGCC CTCCAGGCCG 300 

GTTEAGCXTtf; CAGCPGOGGC CTCCCCGQGC CCITGGCATC CAACTTCGCA GACAGGGTAC 360 

CAGCCTCCTG GTGTGTATCA TAGGATTPGT TCACATAGTG TTATGCATGA TCTTCGTAAG 420 

GTTAAGAAGC CGTGGTGGTG CACCATGACA TCCAACCCXTT ATATATAAAG ATAAATATAT 480 

ATATATATGT ATGTAAATTA TAGCACTGAG GGCCCTGCTG CXXTOCTGGA CCAAGCAAAA 540 

CTAAGCCTTT TGGTTPGGGT A3TATGTTTC GTTTrGTrAT TKmrCTTT TFOTOQCTTTG 600 

TCTTATOTCG TGATAGCACA AGTroCCAGTC GGATTGCTCT GTATTACAGA ATAGTOTnT 660 

TAATTCATCA ATGTTCTAGT TAATGTCTAC CTCAGCACTT CCTCTTAGCC TAATTTTAGG 720 

AGGTTGCCCA ATmvmC TTCAATTrTA CTGGTTACTT TnTGTACAA ATCAATCTCT 780 

TTCTCTCTTT CTCTCCTCCC CACXTOTCAC CCTTGCCCTC TCCATCTCCX: TCTCCCGCCC 840 

TCCCCTCCTC CCTCTGGCTC CCCCTCTCAT TTCTGTCCAC TCCATTCTCT CTCCCTCTCT 900 

CCTGCCTCCT GCTOCCCCCT CCCCAGCCCA CTTSCCOSAG TTGTGCTrGC OGCTCCTTAT 960 

CTGTTCTACT TCCQAAGCAG TTrCACTCGA AGTTGrTGCAG TCCTGGTTGC AGCTTTCCGC 1020 

ATCnOXTTC GTTTCGTGTA GATTGACGCG TrTCTTTGTA ATITCAGnrcT TTCTCACAAG 1080 

ATTTAAAAAA AAAAAAAGGA AAAAAAAAAA AAAAAAAAAC TCGAGGGGGG GCCCGGTACC 1140 
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(2) INFX3RMATICN FOR SBQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCJTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 

(xi) SBQUEMCE nBSCRIPTIQN: SBQ ID NO: 243: 

AACCCACGGC TC5CTGCGC5CA GGGCC3T0GAG GGCAGAGGQC OGCQGAGGCG CACTTOCAAA 60 

CATQGCTCAG AGCflGAGACG GOOGAAACCC GTTCGCCGAG CCCAGCGAGC TTCACAACCC 120 

Cn-TCAGCCA CCACCAGCCT ATQAGCCTCC AGCCCCTGCC CCATTCCCTC CACCCTCAGC 180 

TCCCTCCTTG CMSOCCTOGA GAAAGCTCAG CCCCACAGAA CCTAAGAACT ATCGCTCATA 240 

CACCACTCAG GCCTCAGCTG CAGCAGCCAC AGCTGAGCTG CTTGAAGAAAC AGGAGGAGCT ioO 

CAACCGGAAG GCAGAGGAGT TGGACCGAAG GAGNCGAGAG CK5CAGCATC CTGCCCTOGG 360 

ROCXIACAGCT ACTOGACAGA ACAATTGGCC CCCTCTACCT TCTTTITCTC CACTTCAGCC 420 

CTQCTrrrrC CAGGACATCT CCATCXSAGAT CCCOCAAGAA TITCAGAAGA CTGTATCCAC 480 

CATGTACTAC CTCTGGATGT GCAGCACGST GGCTCTTCTC CTCAACTTCC TCGCCTCCCT 540 

GGCCAGCTTC TCnCTGGAAA CCAACAATGG CGCAGGCTTT GGGCTTTTCTA TCCTCTCGCJr 600 

CCTCCTITTC ACTCCCTGCT CdTTGTCTG CTQGTACCGC CCCATGTATA AGGCTTTCCG 660 

GAGTGACAGT TCATTCAATT TCTTOGTTTT CTTCTTCATT TTCTTCGTCC AGGATC?K3CT 720 

CTTTC3TCCTC CAGGCCATTG GTATCCCAGG TrGGGQATTC AGTrGGCTOGA TCTCTGCTCT 780 

GGTGGTGCCG AAGGCAACAC AGCAGTATCC GTGCTCATGC TGCTQGTCGC CCTOCTCTTC 840 

ACTQGCATTG CTGOGCTAGG AATTGTCATG CTGAAACGGA TCCACTCCTT ATACCGCCGC 900 

ACAGGTGCCA GCTTTCAGAA GGCCCAGCAA GAATTTGCTG CTGC3TCTCTT CTCCAACCCT 960 

GOGGTGCGAA COGCARCTTG CCAATGCAGC CX3CTGGGGCT GCTGAAAATC CCTTCCGGGC 1020 

CCCGPGACCC CTGACTGGGA TQCCCTC3GCC CTGCTACTTG AGGGAGCTCA CITAGCTCCC 1080 

GTCCCTAAGG TCTCTGGGAC TTGGAGAGAC ATCACTAACT GATOGCTCCT CCGTACyiGCT 1140 

CCCAATCCTA TGGOCATGAC TGCTGAACCT GACAGC3CGTG TQGQGAGnTC ACTOtXSACCT 1200 

AGTCOCCCCA TCAGGCCACA CTGCTGCCAC CTCTCACACG CCCCAACCCA GCTTCCCTCT 1260 

GCTGTGCCAC GOCTGTTOCT TaXTTTATTT AAATAAAAAG AAAGTCGAAC TCGAAAAAAA 1320 

AAAAAAAAAA AAAAAAAAAG GQQQGNCCNC I35O 
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5 (2) INFORMATICN FOR SBQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 1529 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: double 

(D) T0P0UX3Y: linear 

(xi) SBCJUHTCE DESCRIPTICN: SEQ ID NO: 244: 
15 TCCCAGAGGC CGGGGQGTTC CAGCTCTC5CC TC5TAGCAGAG CCCTGAGGAG GAGGAGGAAG 60 
AGGATCSTGCT GAAATACGTC OGGGAGATCT TTTTCAGCTA GGGCATAAAC TGTGCACTGA . 120 
ACTGTCTGCC GAGAGCAGCT GGAGGACAGC TGAGCTTCCA CTQGTGCPGC TQGGCCGMCC 



20 

GCCTGTQGGA ATQGGGCTCT CTOTCSCTCCT ACCTTTGTGC CTTCTTGGGC CTOGCAGATT 



180 
240 

CACCTCAGGC CAGAAGCCCC TGGACACTCC GQG0CTTGC3G GTGCCGTTCT GAGTGTOCGG 300 

25 AAGGCAGGAC TCAAAATGAG ATCCCATTTG ACTCCCTCTG TATGTACTGT GCCCTCTCCT 360 

GGCTCTTGAG GCTCTGGAGT CCCAATTGTC TGTGTTAGTC AGTGACCAGG TTCCAGQGAA 420 

AATRATGTCA TGTQGTGC3TC CAACTTACTG GAACCAAAGA GACAGTACTT TCCAAAGAAA 480 

AGGATCACTG CCAQGTGCAC TGGAATTGCT ACAGITTAGT OCGCATGATC TCTCCTGAAG 540 

GAGGAAGCCT GTTTCAAAAA TAGTTTCCAT CATGAGTCTA TCAATGAGCT CCCACCTCTC 600 

35 CAGCCAGCCT AGAAAGCAAA CGAGCTGCCC ACACTTCTCT GCCCTGTCTG GGAGGTTC5AG 660 

GOCACACrrGT ATAGACTC3GT AAGCCAGACA GGCCTCCTCC OGCAAGCTC3C TACCTTQCTT 720 

TCACCTGTAC CTTOGTCCCC GGGCAGCTAG CTATAAAGCA AGAQGGACAG GAGCCCAGAA 780 

40 

GAGACACTGA GGACAAGAGA TCACACCAGA CTACATGrrCT CrGOCTCTGT TTTCAGTGTG 840 

GCTTTGGACA GGAATATATG AATAAATCAC TC3CCATACAG CnTITCCAAT ACACAAGTGC 900 

45 TAGAAAATAC ACACAATTCC CCAATGCCTA ACTITCTGCTA ATGTCTTTCC AAGTTCTGGG 960 

TTQGGAAGTG GAGGGTOGCA GCGTTTGTTT GTGOQCAACC GTCCAGTCCT GrTTCACAGOG 1020 

50 GTCCTCCAGG GTCTCATCAT GGGAGTGATT TGTCAGCGGA CGCCTCTGCC 1080 

CTGTCTOGCT TCAGGTOCAG GGAAGCTTTG AAGCAGTCAA GCCTTGTCrr TC3TACCCCAT 1140 

GTGTCCTC3TC TTICTTGAGT CACTCAGAGA TCACTCCTQG ACCTCTQQGG TTOGAGTTCC 1200 

55 AGTGATGGCT TATGGCGGCC CACTCACTAT GC?rGQGCTGA CSTGGAAGCTC CTTAACCATG 1260 

TCCCCAGAGA CACTGAGGTG CTCGCTCTTT TAATGTCCTC CTTTGTTGCC GTAAGrPTCTT 1320 

TGCTAGGTTT CATTTTOGCA TTTQGCAAAT CAGCCTGGAA CTTCTGGCCCC ATGACAQCAA 1380 

60 
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TCACTCCCTC CCCACCXTCC TGAAGCTAGA GGAAGATTTG CTCAGATCCA TTAATTAAAG 1440 
.CAGGAATTGG TGTCACAATG AGCTGCATGG TTTAGGGACT CTTTOGGAQC CTTOGAACTC 1500 
5 CTGAAOGACA AACAATCTTG TACTAAGAA 1529 



10 (2) INFORMATICW FOR SEQ ID NO: 245: 

(i) SEQUaJCE CHARACTERISncS : 

(A) 1537 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEENESS: doilble 

(D) TOPOLOGY: linear 



(xi) SBQOHrcE DESCMPTIGN: SEQ ID NO: 245: 
20 GTGCGAOGTC CXXX3CCAGCC CCCAGCGGCC 1TCCCXX3CCC QQQGCGCTCC CAGAGCAAAC 
GAGGCCXXTG AGAGCTCCAC CTAGTTCACA GGATAAAATC CCACAGCAGA ACTCGGAGTC 
AGCAATQGCT AAGCXXXZAGG TGGTICTAGC TCCTGTATTA AaCTCTAAGC IGTCTCTCAA 



25 

TGCCCCTGAA TTTTACCCTT CnGGTTATTC TTCCAGTTAC ACAGAATCCT ATGAGGA3t3G 



60 
120 
180 
240 

TTGTGAGGAT TATCCTACTC TATCAGAATA TGTTCAGGAT TTTTTGAATC ATCTTACAGA 300 
30 GCAGCCTGGC AGrTTTTGAAA CTGAAATTGA ACACnTTCCA.GAGACCCTGA A3X3GTTCTGT 360 
TACAACAGAT GATQCTTTGC AAGAACTTGT GGAACTCATC TATCAACAGG CXACATCTAT 420 
CCCAAATTTC TCTTATATQG GAGCTCGCCT GTGTAATTAC CTGTCCCATC ATCTCACAAT 480 
TAGCCCACAG AGrPGGCAACT TOCXXXMTT GCTACTTCAA AGATGTCGGA CTGAATATGA 540 
AGTITAAAGAT CAAGCTGCAA AAGGGGATGA AGTTACTOGA AAACGATTTC ATGCATTTCT 600 



660 



40 ACTCTTTCTG GGAGAACTTT AOCTTAACCT GGAGATCAAG GGAACAAATG GACAGGTTAC 

AAGAGCAGAT ATTCTTCAGG TOSGTCTTCG AGAATTGCTG AATQCCXTGT TTTCTAATCC 720 

TATGGATGAC AATTTAATTT GTGCAGTAAA ATTOTTAAAG TTGACAGGAT CAGTPrTQGA 780 

45 

AGATGCTTQG AAGGAAAAAG GAAAGATGGA TATGGAAGAA ATTATTCAGA GAATTQAAAA 840 

CCmCTCCTA GATOCAAACT GCAGTAGAGA TGTAAAACAG ATGCTCITCA AGCTTGTAGA 900 

50 ACTCCGGTCA ACTAACTQGG GCAGAGTCCA TGCAACTTCA ACATATAGAG AAQCAACACC 960 

AGAAAATGAT CCTAACTACT TTATGAATGA ACCAACATTT TATACATCTS ATCGrTGrTTCC 1020 

TTTCACTGCA GCTGATCCAG ATTACCAAGA GAAATACCAA GAATTACTTG AAAGAGAGGA 1080 

CirmrcCA GATTATCAAG AAAATGGAAC AGATTTATCC GQGGCTQGTC ATCCATACTT 1140 

GGATCStEAlT GATGATGAGA TGGACCCAGA GATAGAAGAA GCITATGAAA AGTTTICTTr 1200 

60 GGAATCAGAG CXJTAAGCGAA AACAGTAAAG TTAAATTTCA GCATATCAGrr TTTATAAAGC 1260 
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AGTTPAGGTA TGGTGATTTA GCAGAACACA AGAGAGCAAG AAAATGrTGTC ACAICTATAC 1320 

^ CAAATTRAOG ATGTroAGnT ATGTTACTAA TCTATC3CAAC TTTAATTTTG TTTAACACTA 1380 

TCPGCCAAAA TAAACTTTAT TOCCTATAAC TTAAAATGTC TATATATATA TAA'EAGTTTA 1440 

TTATGTACAG TTAATTCTAC TGTTTTGGCT QCAATAAAAT CGATTTTGAA ATAAAWRAAA 1500 

10 AAAAAAAAAA AAGGGNCXXT GCTCTAGAGG ANCCAAG 1537 



15 (2) INFORMATION FOR SBQ ID NO: 246: 

(i) SEQUENCE C21ARACTERIffriCS: 

(A) LENGTH: 506 bcise pairs 

(B) TYPE: nucleic acid 
20 (c> SIRANDEENESS: double 

(D) T0POLCX3Y: linear 

<xi> SEQUENCE DESCRIPTION: SBQ ID NO: 246: 

25 TGCAGGATTT GGCCAGGACC CSCCGOGGTG G0GC3TTGCTA TCGCTTOGCA GAACCTACTC 60 

AGGCAGCCAG CTGAGAAGAG TTGAGGGAAA GTCCTGCTGC TGGGTCTGCA GACGCGATOG 120 

ATAACGTGCA GCCGAAAATA AAACATCGCC CCTTCTGCTT CAI3TGTGAAA GGCCACGTCA 180 

30 

AGATGCTGCG GCTQGATATT ATCAACTCAC TCX3TAACAAC AGTATTCATG CTCATCGTAT 240 

CTGTGITGGC ACTQATACCA GAAflCpwCAA CATTGACACT TQGTOGAGGG GTGTTTGCAC 300 

35 rrCTGACAGC AGTATCSCTGT CTTGCOC3ACG GGGCCCTTAT TTACCGGAAG CTTCTOTTCA 360 

ATCCCAGOGG TCCTTACCAG AAAAAGCCTG TGCATGAAAA AAAAGAAGTT TTGTAATTTT 420 

ATATTACTTT.TTAGTTTGAT ACTAAGTATT AAACATATTT CTGKATTATT CCAAAAAAAA 480 

AAAAAAAAAA AAAAAAAATT TGGTQG 506 



40 



45 
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(2) INFORMATION FOR SBQ ID NO: 247: 



(i) SBQUaiCE CHARACTEEIISTICS : 

(A) LQ9GTH: 1348 base pairs 
50 (B) TYPE: niicleic acid 

(C) STRANDECNESS: dotable 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ED NO: 247: 
GTCTTTCTTT IWCTGmTG AGTTCGTCAG TGAGTCAATA GGGTAACATG GGCdTCAGG 60 
ATGACCCCTT GGAACTGTGC CGAfJTTOCTT AAATCTCAGC TGGGATCCTG GACCTC3GGAG 120 



60 



GCCCCTGTGA GGGOCAGCTC TOGAAAAACC TGGGAGTTGA TGCCGGAGGY TGGGAAGAAC 



180 
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TCTGCTOGAG GGCAGGGTGC CCTOGAACAC TQGTftGnTCT OQGGCTGGGA GGGAGAGGGG 240 

^ CrCXXXXTTT CTCTGAAATG AfiOWCTOCTC TTCAGCAGTT CAAGTACTTG TTCTCAAAAC 300 

ATTTTCTAAT TGATTQC3TAG CTTTTCATAA GCAlTCriTC TTTAflGGCAT GGAAAGGGAA 360 

GAATGCTCAA GCAAGTCATG TTrerTTTTCA GTOOGATGGG OCCGCGOTCT CACTCCTOQG 420 

10 GGCnCOCCT TGCATGTOGC AOCnTCTGC AGGGCCACCA GGCAGACTCT TOXACCnC 480 

•rccxAcravA gcaccaagqg gcttgaaccg taatttggct aatcagaggc AmrmTC 540 

600 
660 



15 '^^^^^'^ TITCACACTT GTCCAACCGT CXTATTTTTT TAAAAGTTCT Gi ' iXJL ^ nxSVA 



n-AACAOGAA ACTAGAGAGA AATAGTTTCT GAAQCCAGnT TATTCTCAAG ATCCCCAAQG 

GGAGGnrCGG TAGfiGAAAAA TAGTAAGCTG CTTTAGAAAC TGACGAGGGC AAACAGCCAG 720 

GACGCAOTOG AGAGGAATTT GCCAAAGATC TACCCTGAGA TAACGCCTCT CXrAGTOTCrr 780 

CACCACGTGA ATAACCAGCG CTCCAAAGTG TTriTCTGCT TTGAAAAAAA AAATTCCACA 840 

AGCmTTAAA GGTGCATTTA AGAATCCATG TGACnTAGA ATGGAACTCC CGC3CCCTOGC 900 

AACTCTCACG TGTGCTAGAA GGTTCGATGC CTCTQGAATG CATOIGATAC TCATCTCCAT 960 

TTK?rTTCCT TGATTOCATT T TlXJin Vr rr TAGCAGATCT GrrCCCTCTCG GTCGICTCTA 1020 

AGAAGTOGGA CACCTTOGTT TTrGTGTTAG ATTGAGCTQG GCAGCTCCAA TCAGCrrCTT 1080 

TATATGCAAA TTAQGCACGA CCCATCTGTG GTT0CXrK3GT TGGTGGCTAA TCAAGTX3AGG 1140 

GGAGQGAGGG ATGTCACXXX: AAAAGTAGGC CCTCCCATTG GCTTIGGCCA GGCCAGACAC 1200 

TTCACATCX3T TTACATGGTT CTC3TGTAATT TTAAAGTrTA TGTGTATAAA GCGAAGCTGT 1260 

TTCTGTCAAA CTGTATATTT TOPAAATAAA TATATTQCTA CTTTGAGAWR AAAAAAAAAA 1320 

AAAAACTCGA GGGGGGOCCG GTACCCAA 1348 



(2) INFORMATION FOR SEQ ID NO: 248: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LQXnU: 1766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDKESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID IK): 248: 
GTGCCGAATC GGCAGAGCGG CACGAGOGGC CACGAGAGCA GGCGQAGTAA AGGGACTTCA 
GCGAOCCAGT TGCOGGATTA TTCTATTTCC CCTCCCTCTC TCCCGCCCCG TATCTCTITr 



CftCCCPTCTC CCACCCICGC TOGOGTASCA TGGCGGAGCG TCGGCGGCCA CTCAGICCCA 
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TTCCATCrCC TOGTOGTOCT TOGGAGCCGA GCCGTCOSCG CCCGC3CGQCG GCGGGAGCCC 240 

AGGAGCCTGC CCOGCCXTOG GGACGAAGAG CTGCftGCrcC TCCTGTGCGG TGCACXSATCT 300 

GATTTTCTGG AGAGATGTGA AGAAGftCTGG GTi " itJli:nT GGCACCACGC TGATCATGCT 360 

GCTTTCCXTTG GCAGCTTTCA GTCTCATCAG TO i UJlTiVl' TACCTCATCC lOXriVriVi' 420 

CTCTGTCACC ATCAGCTTCA GGATCTACAA GTCCGTCATC CAAGCTGTAC AGAAGTCAGA 480 

AGAAGGCCAT CCATTCAAAG (XTACCTGGA CGTAGACATT ACTCTGrrCCT CAGAAGCTTT 540 

CCATAATTAC ATGAATQCTG CCATGGTGCA CATCAACAGG GCCCTGAAAC TCATTATTOG 600 

TCTCTTTCTG GTAGAAGATC TOGTTGACTC CTTCAAGCTG GCTGTCTTCA TGTGGCTGAT 660 

GACCTATGTT GGTCCTGmT TTAACGGAAT CACCCTTCTA ATTCTTGCTG AACTGCTCAT 720 

TTTCAGTOPC CaavrTGTCT ATGAGAAGTTA CAAGACOCAG ATTGATCACT ATGTTGGCAT 780 

CGCXXXSAGAT CAGACCAAGT CAATTGTTGA AAAGATCCAA GCAAAACTCX: CPOGAATOSC 840 

CAAAAAAAAG GCAGAATAAG TACATGGAAA CXZAGAAATGC AACAGTTACT AAAACACCAT 900 

TTAATAGTTA TAACGTCGriT ACTTGTACTA TGAAGGAAAA TACTCAGTCT CAGCTTGAGC 960 

CTGCATTCCA AGCTTTTTTT TTAATTTGGT GTTTTCTCCC ATCCTTTCCC TTTAACCCTC 1020 

AGTATCAAGC ACAAAAATTG ATGGACTGAT AAAAGAACTA TCTTAGAACT CAGAAGAAGA 1080 

AAGAATCAAA TTCATAGGAT AAGTCAATAC CTTAATGGTC GTAGAGCXnT TACCTGTAGC 1140 

TTGAAAGGGG AAAGATTOGA GGTAAGAGAG AAAATQAAAG AACACTTCTO QGTCCTTCTG 1200 

TCCAGTTTTC AGCACTAGTC TTACTCAGCT ATCCATTATA GTTTTGCCCT TAAGAACTCA 1260 

TGATEAACTT ATGAAAAAAT TATTTGGGGA CAGGAGTCTG ATACCTTCCT TGGTTTTTTT 1320 

TTOCAGCXCT CAAATCCTAT CTTOCTGCCX: CACAATGTGA GCAGCTACCC CTGATACTCC 1380 

TTTTCTTTAA TGATTTAACT ATCAACTTGA TAAATAACTT ATAOGTGATA GTGATAATTC 1440 

CTGA3TCCAA GAATGCCATC TGATAAAAAA GAATAGAAAT GGAAAGTGGG ACTCAGAGGG 1500 

AGTCAGCTWGG CATGCTGCGG TGGOCSGTCAC TCXCTCTGCC ACTATOXCA GGGAAGGAAA 1560 

RGCTCCGCCA TTTGGGAAAG TGGTTTCTAC GTCACTGGAC ACCGGTTCTG AGCATTAGTT 1620 

TGAGAACTCG TTCCCGAATG TGCTTTCCTC CCTCTCCCCT GCXXIACCTCA AGTTTAATAA 1680 

ATAAGGnTGT ACTTTTCTTA CTATAAAATA AAAAAAAAAA AACTCGAGQG GGGCCCX3GTA 1740 

CCCAAATCGC CGGATATGAT CGTAAA 1766 
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(2) INFORMATION FOR SBQ ID NO: 249: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LQIGrrH: 2664 base pairs 

(B) TOPE: nucleic acid 

(C) STTU^NDECMESS: double 
<D) TOPOUXSY: linear 

(xi) SBQUEKCE DESCRIPTION: SBQ ID NO: 249: 

ACyrGTCCTCG GAGCAGGOSG AGTAAAi3GGA CTTGAGCGAG CCAGTTGCCG GATTATTCTA 60 

TTTCCCCTCC CTCTCTCCCG COCXCTATCT CrTTTCACCC TTCTCCXrACC CTCGCTOQCG 120 

TASCATGGCG GAG0C3TOGGC GGCCACTCAG TCCCATTCCA TCTCCTCGTC GTCCPTOGGA 180 

GCCGflGOCGT CX3GCX3COCGG COSCGGOGGG AGCCCAGGAG CCTQCCCCX3C OCTGGGGACG 240 

AAGAQCTOCA GCTCCTCCTG TGCXSGTGCAC GATCTGAaTT TCTQGAGAGA TGTGAAGAAG 300 

ACTGQGTTTG TCTTTQGCAC CACGCTGATC ATGCTGCTTT CCCTQGCAGC TTTCAGTOTC 360 

ATCAGTCTGG TITCTrACCT CATCCTOGCT CTTCTCTCTG TCACCATCAG CTTCAGGATC 420 

TACAAGTCCG TCATCCAAGC TGTACAGAAG TCAGAAGAAG GCX:ATCXA'IT CAAAGCCTAC 480 

CTOGACGTAG ACATTACTCT GTCCTCAGAA GCTTTCCATA ATTACATGAA TGCTGCCATG 540 

GTCXACATCA ACAGGGCXCT GAAACTCATT ATTaSTCTCT TTCTQGTAGA AGATCTGGTT 600 

GACTCCTTGA AGCTOGCTGT CITCATGrrGG CTGATGACCT ATGTTGGTGC TGTTTTTAAC 660 

GGAATCACXX: TTCTAA3TCT TQCTGAACTG CTCATTTTCA GTGTCCCGAT TGTCTATGAG 720 

AAGTACAAGA CXX:AGATTGA TCACTATOTT GGCATCGCCC GAGATCAC3AC CAAGTCAATT 780 

GTTGAAAAGA TCCAAGCAAA ACTCCCTQGA ATCGCXAAAA AAAAGGCAGA ATAAGTACAT 840 . 

GGAAACCAGA AATGCAACAG TTACTAAAAC ACCATTTAAT AGTTATAACG TCGTTAdTG 900 

TACTATGAAG GAAAATACTC AGTGTCAGCT TGAGCCTGCA TTCCAAGCTT TTTTTTTAAT 960 

TTQGTGnrr CTCCXIATCCT TTCCCTTTAA CCCTCAGTAT CAAGCACAAA AATTGATQGA 1020 

CTGATAAAAG AACTATCTTA GAACTCAGAA GAAGAAAGAA TCAAATTCAT AGGATAAGriC 1080 

AATAOCTTAA TQGTQGTAGA GCXTTTTACCT GTAQCTTGAA AGGGGAAAGA TTQGAGGTAA 1140 

GAGAGAAAAT GAAAGAACAC CTCTGGGTCC TTCTGTCCAG TTTTCAGCAC TAGTCTTACT 1200 

CAGCCATCCA TEATAGTTTT GCCCTTAAGA AGTCATGATr AACTTATGAA AAAATTATTT 1260 

GGGGACAGGA GTGTGATACC TTCCTTQGTr TTTITTTGCA GCCCTCAAAT CCTATCTPCC 1320 

TQCXrCACAA TGTGAGCAGC TACCXX7PGAT ACTCCTTTTC TTTAATGATT TAACTATCAA 1380 

CTTGATAAAT AACTTATAGG TGATAGTGAT AATTCCTGAT TCCAAGAATG CCATCTGATA 1440 

AAAAAGAATA GAAATGGAAA GTGGGACTGA GAGGGAGTCA GCAGGCATGC TGCXSGTOGCG 1500 

GTCACTOCCT CTGCCACTAT COXAGGGAA GGAAARGCTC CGCCATTTQG GAAAGTGGTT 1560 

TCTAOGTCAC TQGACACXX3G TTCTGAGCAT TAGnTTGAGA ACTOHTCCC GAATGTGCTT 1620 
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•rccTCccTCT crccTOcccA ccTCAAarrr aataaataag GrrrGrrAcrrr tcttactata i680 

^ AAATAAA3OT dCTAACTGC TGrTGCACTGC TGTAAACTTG TTAGAGAAAA AAATAACCTC 1740 



35 
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CATGTOGGCT CCTCAGTTAT TGAGTriTTG TGATCCTATC TCAC?rcroGG GGGGAACAaT 

CTCAAGAGC5T GAAATACAGA AACSCCnTTT TTCTPGATCT TTTCCCGftGA TTCAAATCTC 1860 

10 OGATTCOCAT TTGGQGGCAA CnTITmCT TCACCTTCAA TATCAGAATT CAGOGAACTT 1920 

GAAAGAAAAA TCATCTOTGA GrrrCCTTCAG CnTCTCACTC ATA£?rCAraA TCCTTCAGAG 1980 

GGAATATCCA CTGGCGAC3TT TAAACTAAGG GCTATQATAT TTGATOGrTCC CAAftCTACGG 2040 

CAGCTGCAAA AAGTftGTCGA AGGAAATTGT CTACGTGrTCT TOGAAAAAIT AffTTAGGAAr 2100 

TTGGATOGGT AAAAGGTACC CTTGCCITAC TCCATCTTAT TrTCTTAGCC CCCTrTCA£?r 2160 

20 GimAACTG C?mCATGTC CTAGTAGGAA C3TQCATTCTC CATCCTCATC CTCTCCCCIC 2220 

CCAGGAAGTC AGrrGATTGTC TrTTTGGGCT TCCCCTOCAA AGGftCCTTCT GCAC?roC5AAG 2280 

TGCCACATCC AGrTCTTlTC TnTCTTGCT GCTGTGTTTA GATAATTGAA GAGATCTTTG 2340 

TQCCACACAG GATTmTTT TXTTTTAAGA AAAACCTATA GAIX3AAAAAT 'PACTAATGAA 2400 

ACTGTGTGTA CXTPGTCTGrrG CGTGCAACAT AAAAATACAG TAGCACCTAA GGAGCTIGAA 2460 

30 TCTTQGTTCC TGTAAAATIT CAAATTGATG TQGTATTAAT AAAAAAAAAA AAAACAMAAA 2520 

AAAAAAAAAA AAAAGGGOGG CCX3CTCTAGA GGATCCAAGC TTACGTACGC GTCCAIGCGA 2580 

CX3TCCATAOC TCTTTCTATA GGC3GTCCCCC AAATTCCATT CANCGGGCCG TCGCTmTAN 2640 

AAAGGTCGTG ANTGGGGGAA ANCC 2664 



40 

(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SBQUajCE CHARACTERISTICS: 

(A) LQJGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUEI«:E DESCRIPTION: SBQ ID NO: 250: 

CGTQQGAGTG AGGTEACCAGA TTCAGCCCAT TTOGCCCCGA CGCCTCTKTT CTCGGAATCC 60 

GQCnXXrPGCG GATTGAGGTC CCGGTTOCTA ACGGTGGGAT CGGTGTCCTC GQGATGAGAT 120 

55 TTGGOGTTTC CTCGGGGCTT TGGTC3GGATC GGTCJICCTCA GGATGAGATT TAGGGTTTCC 180 

TCGGGGCTTT CGGGATCTTC ACCTAATATC OGGACTGCAA GATQGAGGAA GGCGGGAACC 240 

TAGGAGGCCT GATTAARATG CJTCCATCTAC TGGTCTTGTC AGGTGCCTGSG GGCAOXXAAA 300 

oU 
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TGTOGGTGAC CITCGTCTCA C5GCTPCCTGC TTTTCCXSEAAG CCTTCCCCGA CATACCTTCG 360 

GACTAGTOCA GfiGCAAACTC TTCCCCTTCT ACTTCCACAT CTCCATQGGC TGTGCCTrCA 420 

TCAACCTCTG CATCTTQGCT TCACAGCATG CTTGGGCTCA GCTCACATTC TGGGAGGCCA 480 

GCCAGCTTTA CXTTQCTOTTC CTGAGCCTTA CGCTQGCCAC TGTCAACGCC OGCTGGCTQG 540 

AACCCCQCAC CACAGCTGCC ATGTOGGCCC TGCTW^ACCCT GGAGAAGGAG OGAQGCCTQG 600 

GrrOGGGAGGT ACCADGCAGC CACCAGGGTC OCGATCCCTA CXXXXaVGCTG CX»GAGAAGG 660 

ACCCXaAGTA CAGTOCTCTC CGCCAGAATT TCTTCCGCTA CCATC3GC3CTG TCCTCTCITr 720 

15 GCAATCTQGG CIGCGTCCTG AGCAATGGGC TCPGTCTCGC TGQCCTTCCC CTGGAAATAA 780 

GGAGCCTCTA GCOTQGGCCC TGCATGCTAA TAAATCCTTC TTCAGAAAAA AAAAAAAAAA 840 

AAACTCGAGG GGC5GCCXXX5T AOCCA 865 

20 



25 



35 



(2) INFORMATION FOR SEQ ID NO: 251: 



(i) SBQUE^JCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 
{B) TYPE: nucleic acid 
(C) SmMIDESJNESS: double 
30 {D) T0POUX3Y: linear 



(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 251: 

TGGGGGQGGN AATGGCSTGrrC TGGCTCANOG ATTGCCNAAT CTGGAAATTC TCCATAACTT 60 

GCTAGCTTGT TTTTTTTTrr TTTTTTTACA CCCCCCOGCC CCACCCCCGG ACTTGCACAA 120 

TGTTCAATGA TCTCAGCAGA GTTCTTCATG TGAAACGTTG ATCACCTTTG AAGCCTGCAT 180 

40 CATTCACATA TTTTTTCTTC TTCTTCCCCT TCAGTTCATG AACTGGrTGTT CATTTTCTGT 240 

GTGTGTGTGT GnTTATTTT GTTTQGATTT TTTTTTTTAA TTTTACTnT AGAGCTTGCT 300 

GTGTTGCCCA CCTTmTCC AACCTCCACC CTCACTCCTT CTCAACCCAT CTCTTCOGAG 360 

45 

ATGAAAGAAA AAAAAAAGCA AAGTTTTTTT TTCTTCTCCT GAGTTCTTCA TGTGAGATTG 420 

AGCTTC3CAAA GGAAAAAAAA ATGTGAAATG TTATAGACTT GCAGCGTGCC GAGTTCCATC 480 

50 GGGTTTTTTT TTTAGCATTG TTATGCTAAA ATAGAGAAAA AAATGCTCAT GAACCTTCCA 540 

CAATCAAGCC TGCATCAACC TTCTGGGTGT GACTTGTGAG TTTTGGCCTr GrTGATQCCAA 600 

ATCTGAGAGT TTACTCTGCC ATTAAAAAAA CTCATTCTCA TCTCATGCAT TATTATGCTT 660 

55 

GCTACTTTGT CTTAGCAACA ATGAACTATA ACTGriTTCAA AGACTTTATCS GAAAAGAGAC 720 

ATTATATTAA TAAAAAAAAA AAGCCTOCAT GCTGGACATG TATQGTATAA OTATTTTrTC 780 

60 CTlTmTlT CCrrrTGGCT TOSAAATGGA CGTTCGAAGA CTTATAGCAT GGCATTCATA 840 



wo 98/39448 



478 



PCT/US98/04493 



10 



15 



20 



25 



30 



35 



40 



CTTTTOrrrr ATTGCCTCAT GACTTTTTTG AffPTTAGAAC AAAACAGTOC AAC0C3TAGAG 900 

CCTTCITCCC ATGAAATTTT GCATCTGCTC CAAAACTGCT TKaCTTACT CAGAACTTCA 960 

AOCTCCCAAT GCACTGAAGG CftTTCCTTGT GCAAAGATAC CAGAATOGGT •EACACATTTA 1020 

ACCTGGCAAA CATTC3AAGAA CTCTTRATCT TTrCTTTITA ATAAGAATtSA OGCCCCACTT 1080 

•TOGGGACTAA AATTGTOCTA TTGCXX3AGAA GCAC3TCTAAA ATTTATTITr TAAAAAGAGA 1140 

AACTGCOCCA TTATmrOG •mCTTTTAT TrTTATrTTA TArrTTTTOG CTTTTOCTCA 1200 

TTGTCAAATC TGGAATGCTC TOGGrnTCTA GTATATAATT TAATTCTAGT TnTATAATC 1260 

TGTTAGCCXIA GTTAAAATGT ATGCTACAGA TAAAQGAATG TTATAGATAA ATTTCAAAGA 1320 

GTTAGGTCTG TTTAGCTGTA GAaTTTTTAA ACGA3TCATG CACTAAATTC TTTACTATIG 1380 

TGATOTTAAG GGGQGTAGAG TTTGCAAGGG GACTGTTTAA AAAAAGTAGC TTATACAGCA 1440 

TGrrGCTTQCA ACTTAAATAT AAGTTGQCSTA TGTCrAGrTCT TTGCTATACC ACTXSACICTA 1500 

TTGAAAACCA AAGTATTAAG AGGGGAAACX; CXXXTOirTA TATCTGTAGG QGTAnTTAC 1560 

ATTCAAAAAT GTATGTTTTT TTTTCTTTTC AAAAITAAAG TATTTGGGAC TCAATTOCAC 1620 

TAAGATATAA CCTGCAAGCA TATAATACAA AAAAAAATTG CAAAACTGOT TAGAACGCTA 1680 

ATAAAATTTA TQCAGTTATA AAAATGGCAT TACTGCACAG rTTTAAGATC ATOCAGAnr 1740 

TTTTACAGT-T GTATTGTOGT GCAGAACTGG ATTTTCTGTA ACTTAAAAAA AAA1CCACAG 1800 

TTTTAAAGGC AATAATCAGT AAATGTTATT TTCAGGGACT GACATCCTGT CTTTAAAAAG 1860 

AAATGAAAAG TAAATCTTAC CACAATAAAT ATAAAAAAAT CTTGrPCAGTT ACTTITCTTr 1920 

ITtfM-ATTTT GCTGTCCAAA ATTGrmTAT ATCTTCAGTT ACTAACTAAC CACGOJiCTT 1980 

GTTCCTATGT GCTrTTCTTT CATTTTCAAT TCTGGTTATA TCAAGAAAAG AATAATCTAC 2040 

AATAATAAAC GGCATTTTTT TTTGAAAAAA AAAAAAAAAA AA 2082 
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(2) INPORMATION FOR SEQ ID NO: 252: 

(i) SBQUE^^CE CaaRACTERISTICS: 

(A) LENGTH: 1482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEKESS : double 
<D) TOPOIjOGY: linear 

(xi) SEC3fUENCE DESCRIPTION: SEQ ID NO: 252: 

CAGGCAGGCT GGCCCCGGGG ACTTCTCTCT GGCCCTGCTC octgcgagcg cicogccctt 

GCCXX3CCTGG (XCCTACGGA GTCCTTAGCC AGGATGGAGG CTGTTGrTGAA CTTOTACCAA 



60 
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GAGGTCATGA AGCACGCAGA TOCCOSGATC CAQOGCTACC CTCTGATGGG GTCCCCCTTC 180 

CTAATGACCT OCATTCTCCT GACCTAOTrG TACTrOGTTC TCTCACTTOG GCCTCGCATC 240 

ATGC3CTAATC QGAAGCCCTT CCAGCTOCGT GQCTTCATGA TTOTCTACAA CPiercACIG 300 

GrrOGCACTCT COCTCTACAT TGrPCTATCAG ITCCTGATOT OGGGCTO3CT GAGCACCTAT 360 

AOCTC3GOQCT GTCACCCTrGT GGACTATPCC AACAGCCCTG AGGCACTTAG GATOCTTCGG 420 

C3TGGCCTOGC TCTTCCTCTT CTCCAAGTTC ATTGAGCTGA TGGACACAGT GATCTTTATT 480 

CTCCGAAAGA AAGACGOGCA GGTGACCTTC CTACATGrrCT TCCATCACTC TOTOCTTCCC 540 

TOGAGCTQGT GGTOGGGGGT AAAGATTGCC COGGGAGGAA TGCGCICTTT CCATCCCAIG 600 

ATAAACTCTT CXX3TGCATGT CATAATOTAC CTGTACTACG GATTATCTOC CPTTOGCXXT 660 

GltSGCACAAC CCTACCTTPG GTOGAAAAAG CACATGACAG CCATTCAGCT GATCCACTTT 720 

GTCCTQGTCT CACTGCACAT CTCCCAGTAC TACTTTATGr CCAGCTCTAA CTACCAGTAC 780 

CCAGTCATTA TTCACXTTCAT CTGGATOTAT GGCACCATCT OCTTCATQCT CTTCTCCAAC 840 

ITCTQGTATC ACTCTTATAC CAAGGGCAAG CX3GCTGCCCC GTGCACITCA GCAAAATOGA 900 

QCTCCAQGTA TTDCCAAGGT CAAGC3CCAAC TGAGAAGCAT GGCCTAGATA GGCGCCCACC 960 

TAAGTGCCTC AGGACTGCAC CTTAGGGCAG TGTCCX3TCAG TGCXXTTCTCC ACCTACACCT 1020 

GTGAOCAAGG CTTATGTGGT CAGGACTGAG CAQGGGACTG GCCCTCCCCT CCCCACAGCT 1080 

QCTCTACAGG GACCACGGCT TTGGTTCCTC ACCCACTTCC CCCGGGCAQC TCCAGGGATC 1140 

TQGCCTCATT GCTGTCTGCC ACTCCAGAGC TGGGGGCTAA AAGQGCTOTA CAGnTATTTC 1200 

CCJCCTCCCTG CCITAAAACT TGGGAGAGGA GCACTCAGGG CTGGCCCrAC AAAGGGTCTC 1260 

GTGGOCTTTT TOCTCACACA GAAGAGGTCA GCAATAATGrr CACTGTOGAC CCACTCTCAC 1320 

TCCTOCACCC CACACACTQA AGCAGTAGCT TCTGGGCCAA AGGTCAGGGT GGGCGGGGGC 1380 

CTQGGAATAC AGCCTGTGGA GGCTGCTTAC TCAACTTGTG TCITAAITAA AAGTOACAGA 1440 

GGAAACCAAA AAAAAAAAAA AAAAACTCGA GGGGGGCCCX3 TA 1482 



(2) INPORMATIGN FOR SBQ ID NO: 253: 

(i) SBQCIENCE CHARACTERISTICS: 

(A) LQJGTH: 834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEttJESS : dcoible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 253: 



60 



QC3CA0GAGCG CCGTTGCCCG CCTGQCCCCT ACGGAGIOT TAGCCAGGAT GGAGGCTtTTT 
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GTGAACrtTGT ACCAAGAGGT GATGAAQCAC GCAGATCCCX: GGATCCAGGG CTACCCTCTC 120 



ATCGQC3TCCC CdTOCTAAT GACCTCGATT CTCCTCACCT AaTICTACTr CGTTCTCTCA 



TACAACTTCT CACTCGTOGC ACTCTOXTC TACATTGTCT ATCAGTTCCr GATOTCGQGC 
10 TQGCTGAGCA CCTATACCTG GOGCTOTCAC CCTCAQGACT GCACCTTAGG GOCTXircCG 



60 



ACrOGCCCTC CCCTCCCCAC AGCTGCTCTA CAGG6ACCAC GGCTITOGrr CXHCACCCAC 



180 



CTTGGGCCTC GCATCATGGC TAATCGGAAG OCCTTCCAGC TCCGTGGCTT CATCAITOK: 240 



300 
360 



TCAGTOCCCT CTCCflMCTAC AOCTGTGACC AAGGCTTATG TGGTCAGGAC TGAGCAGGGG 420 



480 



TTCCCOOGGG CAGCTCCAGG GATGTOGCCT CATTOCTCTC TGCCACTCCA GAGC3X5GGGG 540 



600 



CTAAAAOGGC TGTACAGTTA TTTOCCCTTC CCTGCCTTAA AACTTCGGAG AGGAGCACTC 

20 AGGGCPGGCC CCACAAAGGG TCTCGTGGCC TTTTTCCTCA CACAGAAGAG CTCAGCAATA 660 

ATCTCACTGT GGACCCAGTC TCACTCCTCC ACCCCACACA CTCAAGCAGT AGCITCTOGG 720 

CCAAAGGTCA GGGTGGGOSG GGGCCTGGGA ATACAGCCTG TGGAGGCTOC TTACTCAACT 780 

TGrPGlCTrAA TTAAAACTGA CAGAGGAAAC CAOGAAAAAA AAAAAAAAAA AAAA 834 



30 

(2) INFORMATION FDR SEQ ID NO: 254: 

<i) SBQOENCE CHARACTERISTICS: 

<A) LENGTH: 1508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

1TCAACTTTT AAAATTTTAG ATCAGCAAAC TCTAAGATCC TAGAATOGAA GCTOTTCCTC 60 

ATTTCTCCAT GCTCACCCTC CCAGGTCAGC GAGATQGTGA AGAAGCIGCA CGCGGCAACA 120 

45 CCACCAACGT TOGGAGTOGA CCTCATCAAT GAGCnCTGG AGAACTTK3G CAGATCTCCC 180 

AAGTGGTCTG GTCGGCAAGC CTITCTCTrr GTCTGCCAGA GrGTCATTCA GGA3GACIGC 240 



300 



CTTCCCATGG ACCAGTTTGC TGTGCATCTC ATGCCGCATC TGCTAACCTT AGCAAATCAC 
AGGGTTCCTA AOGTQOGAGT GCTGCTPGCA AAGACATTAA GACAAACTCT ACTAGAAAAA 360 
GACTATrrcr TQGCCTCTGC CAGCTGCCAC CAGGAGGCTG TQGAGCAGAC CATCA3GGCT 420 
55 CTTCAGATGG ACCGTGACAG CGATGTCAAG TATTTTGCAA GCATCCACCC TGCCAGTACC 



480 



AAAATCTCCG AAGATGCCAT GAGCACAGCG TCCTCAACCT ACTAGAAGGC TTOAATCTCG 540 
C?rcTCITrCC TGCTICCATG AGAGCCGAGG ITCAGTGGGC ATTCGCCACG CATOTGACCT 600 
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OC3GATAGCrr TOXKSGGACSG AGAGACCITC CTCTCCTGCG GACTTCATTC CAGGTOCAAG 660 

TTOCCTACAC CXMTACCAG GGftTTTCAAG AGTCAAGAGA AAGTACAGTA AACACTATTA 720 

TCTTATCTTG ACTTTAAGGG GAAATAATTT CTCAGAOGAT TATAATTGTC ACCGAAGCCT 780 

TAAATCCTTC TtTICrTCCTG ACTGAATGAA ACTTGAATTG GCAGAGCATT TTCCTTATCG 840 

AAQGGATGAG ATTOCCAGAG ACCTGCATTG CTril--ll XriU GrrTTTATTTA ACAATO5ACA 900 

AATQAAATTC TTACAGCCTG AAGGCAGACG TGTGCCCAGA TGTCAAAGAG ACCTTCACTTA 960 

TCAGCCCTAA CTCTTCTCTC CCAGQAAGGA CTTOCTTOGGC TCTGTOGCCA GCTCTCCAGC 1020 

CCAGCXXTGT GTGrTGAATCG TTTGTGACGT GTGCAAATGG GAAAGGAGGG CnTTTTACAT 1080 

CTCCTAAAGG ACCTGATGCC AACACAAGTA GGATTGACTT AAACTCTTAA GCGCAGCATA 1140 

TTCCrcrTACA CATOTACAGA ATGGTTGCTG AGTGTCTGTG TCTGATTTTT TCATCCTCGT 1200 

CATGACCTGA AGGAAATTTA TTAGAOSTAT AATGTATGTC TGGTGTTTTT AACTTCATCA 1260 

TGATCAGCTC TGAGGTGCAA CTTCTTCACA TACTGTACAT ACCTGTCACC ACTCITOGGA 1320 

GTGCTGCAGT CTTTAATCAT GCTGTTTAAA CTGTTOTGGC ACAAGTTCTC TTGTCCAAAT 1380 

AAAA-PTTATT AATAAGATCT ATAGAGAGAG ATATATACAC TTTTGATTGT TTTCTAGATC 1440 

TCTACCAATA AATGCAATTT GrTGACCTOTA TTAAAAAAAA NTAAAAAAAC TCGAGGQGGG 1500 

CCCGOTAC 1508 
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(2) INFORMATIC3N FOR SEQ ID NO; 255: 

(i) SBQUQICE CHARACTERISTICS: 

(A) LWjmz 2514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : do\able 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

GAGAGACTCA CACTTCTTTT CCATTATCAC TGACGATGTA GTQGACATAG CAGGGGAAGA 60 

GCACCTACCT GTGrPTGGTGA GGTTTGTTGA TGAATCTCAT AACCTAAGAG AGGAATTTAT 120 

AGGCTTOCTG CCITATGAAG CCGATGCA6A AATTTrGGCT GTGAAATTTC ACACTATCAT 180 

AACTGAGAAG TGGQGATTAA ATATQGAGTA TTGTCCJTGGC CAGQCTTACA TTGWCTCTAG 240 

TOGAmrCT TCCAAAATGA AAGTTGTTGC TTCTAGACTT TTAGAGAAAT ATCCCCAAGC 300 

TATCTACACA CTCTGCTCTT CCTGrTGCCTT AAATATGTQG TTGGCAAAAT CAGTACCTGT 360 

TATGGGAGTA TCTC3TTGCAT TAGGAACAAT TGAGGAAGTT TGTTCTTTTT TCCATCGATC 420 

ACCACAACTG CTTTT7WGAAC TTGACAACGT AATTTCTGTT CTTTTTCAGA ACAGTAAAGA 480 



W098/35M48 



482 



PCTAJS98/04493 



AflGQGGrrAAA GAACTQAAGG AAATCTGCCA TTCTCAGTOG ACAGGCAGGC ATGATOCTTr 540 

^ TGAAATTrrA GTOGAfiCTCC TGCAAGCACT TGTrrTATGT TTAGATGGTA TAAATAGTGA 600 

CACAAATATT AGATOGAATA ACTATATAGC TQGCCGAGCA TrTGTACTCT GCAGTGCAGT 660 

GTCAGATTTT GATTTCATTG TTACTASTGT TGTTCTTAAA AATGTCCTAT CTTTTACAAG 720 

10 AGCCTTTQQG AAAAACCTCC AGGGOCAAAC CTCTGATGTC TTCTTTGOGG CCGGTAGCTT 780 

GACTGCAGTA CTGCATTCAC TCAACGAAGT GATTGGAAAA •EATTGAAGTT TATCATGAAT 840 

TrTGGTTTGA GGAAGCCACA AATTTGGCAA CCAAACTTGA TATTCAAATC AAACTCOCTG 900 

GGAAATTCCG CAGAGCTCAC CAGGGTAACT TGGAATCTCA GCTAACCTCT GAGAGTTACT 960 

ATAAAGAAAC CCTAAGTGTC CCAACAGTGG AGCACATTAT TCAGGAACTT AAAGATATAT 1020 

20 TCTCAGAACA GCACCTCAAA GCTCTTAAAT GCTTATCTCT GGTACCCPCA GTCATGGGAC 1080 

AACTCAAATT CAATAOGTCG GAGGAACACC ATGCTGACAT GTATAGAAGT GACTTACCCA 1140 

ATCCTGACAC GCTCTCAGCT GAGCTTCATT GriTOGAGAAT CAAATGGAAA CACAGGGGGA 1200 

25 

AAGATATAGA GCTTTOCXSPCC ACCATCTATG AAGCXTTCCA OCTGCCTGAC ATCAAGTrTT 1260 

TTCCTAATCT GrTATGCATTG CTGAAGCTTCC TGTGTATTCT TCCTGTGATG AAGGTTGAGA 1320 

30 ATCAGOGGTA TGAAAATGGA CGAAAGCGTC TTAAAGCATA TTTGAGGAAC ACTTTGACAG 1380 

ACXIAAAGGTC AAGTAACTTG GCITTGCTTA ACATAAATTT TGATATAAAA CACGACCTGG 1440 

ATTTAATGGT GGACACATAT ATTAAACTCT ATACAAGTAA GTCAGAGCTT CXTTACAGATA 1500 

35 

ATICCGAAAC TGTGGAAAAT ACCTAAGAGA CTTTTAAAAA TAGGCTTTCT TATATTTGAT 1560 

ATTTGGAAGA AAAAGCXX5TA AGrTGTATGTA GACTACTTAA TCACTAAA3A TCTITGCCTA 1620 

40 TAGGACTCCA TTGAATACAT TAGCXMTGA TAATCTACCT GTrCAAATGG CCCCTGriTTG 1680 

AACTCTCAAG CTTTGAAGAC CTAOCTOTTC TTCCAGAAGA GAACGTTGAA AGTOCXMCT 1740 

TTCCTTTTGC GTGATCTCTG TTGATGGCAC TCTGGAATTG TITCAGTITAA GTCATTTTAG 1800 

45 

ACATAGCATT TATIATCACT GTGGATCTCT ACTTGTTCGG TGTTATGAAT TCTTTGAAGA 1860 

AATATATTrr GAAGAGGTGT GGGAGGAAGG AATACATTTT ATAAAATGnT GTACSTGAAGC 1920 

50 CXIACAATTGA CLTl'lUACTA ATAGGAGTTT TAMTATOTT AAAAATCTAT ACTGGACAGT 1980 

TACAAGAAAT TACCGGAGAA AAGCTTGrTGA GCTCACCAAA CAAGGATTTC AGrTCTAGATT 2040 

TTGTCTTTCT TGAACTTAAA GAAACAAATG ACAAAGTTTG AATGGAAAAG CCTGCTGrTTG 2100 

TTCCACATCT CGnTOCTGTT TACATTCCTT TGTQGAGCCT ACATCTTCCT AAGCTTTTTA 2160 

GCAGGTATAT GTTGAACACT TCTGTTTCAT GGTPGAGACA GAATCAGAGG CCATQGATAC 2220 



60 TGACAACTGA TTTGTCTGTT TTTTTTCTCT GrPCTTTTTCC ATGACTCTTA TATACTGCCT 
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CATCTTGATT TATAAGCAAA AOCTGGAAAA CXTTftCAAAAT AAGTCTTGTG GTTTATCTAG 



2340 



AAAAATATGG AAAATATTGC TGrTAHTTTT GGriGAAGAAA OTCAATTTTG TATACnTTAT 



2400 



TTCAATCTAA ATAAAATGTO AATTTTGrrPT AAAGCTTAGG CACAOTATTT TTTGTC3GQGT 



2460 



CAAAACAITC TICTGTAAAT TCTCTTAAAC ATTTGATAAA CAGCTTCACA ATTC 
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(2) INFORMATICN FOR SBQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 2357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEXIESS: double 

(D) TOPOUDGy: linear 

(xi) SBC^JH^CE DESCRIPTICa*: SBQ ID NO: 256: 

CTGCCTTATG AAGCCGATGC AGAAATITTG GCTCrTGAAAT TTCACACTAT GATAACTGAG 60 

AAGTGGGGAT TAAATATGQA GTATTOTCGT GGCCAQGCTT ACATTGTCTC TAGTC3GATTT 120 

TCTTCCAAAA TGAAAGTICT TQCTTCTAGA CTTTTAGAGA AATAITCCCCA AGCTATCTAC 180 

ACACTCTGCT CTTCCTGrrGC CITAAATATG TQGTTGGCAA AATCAGTACC TGTTATGQGA 240 

GTATCTGTTG CATTAGGAAC AATTGAGGAA GT lTOl 'il.''l' l ' TTTTCCATCG ATCACCACAA 300 

CTGCTTTTAG AACTTGACAA OGTAATTYCT GTTCTTTTTC AGAACAGTAA AGAAAGGQGT 360 

AAAGAACTGA AGGAAATCTG CCATTCTCAG TGGACAGGCA GGCATGATGC TTTTGAAATT 420 

TTAGTOGAAC TCCTQCAAGC ACTTGTTTTA TOTTTAGATG GTATAAAIAG TGACACAAAT 480 

ATTAGATGGA ATAACTATAT AGCTQGCCGA GCATTTGrTAC TCTGCAGrTGC AGTGTCAGAT 540 

TTTGATTTCA TTGTTACTAT TGrrTGTTCTT AAAAATGTCC TATCTTTTAC AAGAGCCrTT 600 

GQGAAAAACC TCCAGGGGCA AACCTCTGAT GrPCTTCTTTG CGGCCQGTAG CTTGACTGCA 660 

GTACTGCATT CACTCAACGA AGrPGANTGGA AAATATTGAA CTTTATCATG AATTTTOGnT 720 

TGAGGAAGCC ACAAATTTGG CAACCAAACT TGATATTCAA ATGAAACTOC CTGGGAAATT 780 

CCGCAGAGCT CACCAGGC3TA ACTTQGAATC TCAGCTAACC TCTGAGAGTT ACTATAAAGA 840 

AACCCTAAGT GTCCCAACAG TQC3AGCACAT TATTCAGGAA CTTAAAGATA TATTCTCAGA 900 

ACAGCAOCTC AAAGCTCTTA AATGCTTATC TCTGGTACCC TCAGTCAirGG GACAACTCAA 960 

ATTCAATACG TCQGAGGAAC AOCATGCT6A CATGTATAGA AGTGACTTAC CCAATCCTGA 1020 

CACGCICTCA GCTGAGCTTC ATTCrTGGAG AATCAAATGG AAACACAGGG GGAAAGATAT 1080 

AGAGCTTCCG TCCACCAOCT ATGAAGCCCT CCACCTGCCT GACATCAAGT TTTTTCCTAA 1140 
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TGICTATGCA TTGCTGAAGG TCXrrGTGTAT 'iVl'lU-'lViXJ ATGAftOGlTC i«3MTGAG0G 1200 

GTATGAAAAT GGACGAAAGC GTCTTAAflGC ATATTTGAGG AACACTTTGA CAGACCAAAG 1260 

GTCAAGTAAC TTQGCTTTGC TTAACATAAA TTTTGATATA AAACACGACC TGGATTTAAT 1320 

GGTGGACACA TATATTAAAC TCTATACAAG TAAGTCAGAG CTTCCTACAG ATAATTCCGA 1380 

AACTGTGGAA AATACCTAAG AGACTITTAA AAATAGGCTT TCTTATATTT GATATTTC3GA 1440 

AGAAAAAGCC GrTAAGTOTAT GTAGACCACT TAATCACTAA ATATCTTTGC CTATAGGACT 1500 

CCATTGAATA CATTAGCCAT TGATAATCTA OCTGTTTAAA TGOCOCCTCT TTGAACTCTC 1560 

AAGCTTTGAA GAOCEACCTG TTCTTCCAGA AGAGAAOCnT GAAAGTGCCA TGmCCnT 1620 

TGCGTCATCT CTGTTGATGG CACTCTGGAA TTGTTTCAGT TAAGTCATTT TAGACATAGC 1680 

ATTTATTATC ACTGTOGATC TCTACTTGTT GGGTOTTATG AATTCTTTGA AGAAATATAT 1740 

TTTGAAGAQG TGrTQGGAGGA AGGAATACAT TTTATAAAAT GnTGTAGfTGA AGCCCACAAT 1800 

TGACCTTTTGA CCAATAGGAG TTTTAAGTAT GrTTAAAAATC TATACTQGAC AGTTACAAGA 1860 

AATTACCGGA GAAAAGCTTG TGAGCTCACC AAACAAGGAT TTCAGTGTAG ATTTTGTCTT 1920 

TCTTGAACTT AAAGAAACAA ATGACAAAGT TTGAATQGAA AAGCCTGCTG TTGTTCCACA 1980 

TCTCGrrcCT GTTTACATTC CTTTGrrGGAG CCTACATCTT CXTTAAGCTTT TTAGCAGGTA 2040 

TATGTTGAAC ACTrCTGTTT CATGGTTGAG ACAGAATCAG AGGCCATC3GA TACTGACAAC 2100 

TGATTTGTCT GTTTrTTTTC TCTGTCTTTT TCXMGACTC TTATATACTG CCTCATCTTG 2160 

ATTTATAAGC AAAACCTGGA AAACCTACAA AATAAGTICTT GrTGGTTTATC TAGAAAAATA 2220 

TGGAAAATAT TGCTGTTATT TrTGGTGAAG AAAATCAATT TrGTATAGTT TATTTCAATC 2280 

TAAATAAAAT GTGAATTTTG TTTAAAGCTT AGGCACATTA TTrTTTGTQG QGTCAAAACA 2340 

TTCTTGrrGTA AATTCTC 2357 
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(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LBi«STH: 689 base peiirs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: doxjble 

(D) TOPOU)GY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 257: 
ACTTTCTQGT GCAAAAAGAT GTTCAAGCCT TATTTTATAC TTGCCTQCCC CTTTCTCTTT 
CATTTATPGG AGTGAGCTGC AGCTCTAAGA AGAOCTGITC TTTTGAATGG AGAGTAGCAT 
CAGGAACCAG GATGTGGGTG CGAQGCGrFGC TCCTGGCTGT TGCAGATTGC TGCACCCGGG 



60 
120 
ISO 
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AGCTCTTAGT GGACAGAGCT AGAGGATATG TQCACGTACT TCCATCTCTC TCTCTGTCTC 
CXSmTTAGC CCAGCACCAC AGGGTAOTrT CXZASTTTTTC TCTCTTTCCA TAGCTGrTAAG 
GCCCTTTCTC GGAATGGTTC TCATTCTCCT TAATCTATTA TTGGGTC A gr TTTCCTQCAT 
GTCCCX:AGCC TCCCATCACT GCCACCCACT OCCCACAGAG AIXXXXrrGCT CATCOGACTG 
GGGCmCAC TCCCACACTG TGTACCCXrPC TTGTCTOGAC GCCCTOCTQC CAAAACCTTC 
AGCAAACAGC TTTOCAAATXS GAAGTTGrrCA CTGTCARGGS CTTTACAATC AGCAACAGCA 
AAATCTACAT GCTGCTGAGG GTCCTGCCTC ATTAAGATQC AATAAATATC TAAGTACATA 
AAAACAGCAA TAGAAGAAAC GTAATGCTTT ATTCTCAAAT ATCNATGTCT ACATAGAAAA 
GCCAAAATTA TTAAGAATAG TAAGGAATT 



(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) hENSTH: 2377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 258: 
TCGACCCACG CC3TCCGOCGA TGTGATGATT OCTGCGTATT CCAAGAACCG GGCCTATGCC 
ATCTTCTTCA TAGTCTTCAC TGTGATAGGG GACQCCCCCG GCGCTGTGCT ATCCTGTCCC 
GGCCACCCTT GCGTTGGnTT TQCTGCTGTA CTQgPGGCGC CCCTGAOCGT GGCTGTCTCC 
TCTTGAAGGA AGCCTGTTTC TGATGAACCT GCTGACAGCC ATCATCTACA GTCAGTTCCG 
GGGCTACCTG ATGAAATCTC TCCAGACCTC GCTGrTTTCGG AGGOGGGrTGG GAACCCGGCT 
GCCTTTGAAG TCCTATOCTC CATQGTQGQG GAGQGAGGAG CCTTCCCTCA GGCAGTTGGG 
GTGAAQCCCC AGAACTTGCT GCAGGTQCTT CAGAAGGTCC AGCTQGACAG CTCCCACAGA 
CAGQCCATGA TGGAGAAGGT GCGTTCCTAT GGCAGTGTTC TGCTCTCAGC TGAGGAGTIT 
CAGAAGCTCT TCAACGAGCT TGACAGAAGT GTGGrTTAAAG AGCACCOGCC GAGGCCCGAG 
TACCAGTCTC CCTTTTCTGCA GAGOGNCCCA GrrTOCTCTTC GtXCACTNAC TACTTTGACT 
ACCrOGGGAA CCTCATOGCC CTGGCAAACC TGGTGTCCAT TTGCGTGTrC CTGGTGCTQG 
ATGCAGATGT TGCTGCCTGC TGAGCGTGAT GACTTCATCC TGGGGGGTCT CAACTGOGTC 
TTCATTGrPGT ACTACCTGTT QGAGATGCTG GCTCAAGGTC TTrTGCCCTG GGGCCTGCGA 
RGGTACYKKT CCTAACCCCA RCAAMGTGTT TTGAACGGGC TCCTCAMCGT TTGTCCTGGC 
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TQGWWKKGSM GATCTCAACT CTGGCTGrTGT AOCXMTGCC ACACXXMGC TOGAGGCOQG 900 

ANATQGTGGG CCTQCTGrTCG CTCTOQGACA TGACCCGCAT ACTGAACATG CTCATCGrTGT 960 

5 TCCGCTTCCT GCGTATCATC CXXZAGCATGA AGCXX3MX3GC OGrTGGTQGCC ACTACCGTCC 1020 

TOGGCCTOGT GCAAAACATG CGrrGOGTTTG GCX3GGATCCT GGrTGGrPQGTC TACTAOG?rAT 1080 

TTGCCATCAT TQGGATCAAC TICTTTAGAG QCGTCATTGr GGCTCTTCCT GGAAACAGCA 1140 

10 

GCCTGGCXXX TGCCAATAGG TCGGCGCCXTT GrGGGAGCTT CXy\GCAGCTG GAGTACTOGG 1200 

CX:AACAACTT CXSATGACTTT GCXSGCTGCCC TOGTCACTCT GTQGAACTTG ATGGrPGGTGA 1260 

15 ACAACTQGCA GCrrGTI'lVm GATGCATATC GGCGCTACTA AGGCCCGTGG TCCAAGAJTCT 1320 

ATTTTGTATT GTQGTGGCTG GTGTCGrTCTG TCATCTGGGT CAACCICTTT CTGGCCCTGA 1380 

TTCTGGAGAA CTTCCTTCAC AAGTGGGACC CCCGCAGCCA CXTTGCAGCCC CTTGCTGGGA 1440 

20 

CCCCAGAGGC CACCTACCAG ATGACTCJrGG AGCTCCTGTT CAGGGATATT CTGGAGGAGC 1500 

CCGGGGAGGA TGAGCTCACA GAGAGGCTGA GCCAGCACCC GCACCTGrPGG CTGTGCAGGT 1560 

25 GACGTCCQGG TCTGCXATCC CAGCAGGQGC QGCAGGAGAG AGAGGCTGGC ATAACACAGG 1620 

TGCCCATCAT GGAAGAGGCG GCCATGCTGT GGCCAGCCAG GCAGGAAGAG ACCTTTCCTC 1680 

TGACOGACCA CTAAGCTGGG GACAGGAACC AAGTCCTTTG CX3TGTCGCCX: AACAACCATT 1740 

30 

TACAGAACAG CTGCTGCTTGC TTCAGGGAGG CGCCX3TGCCC TCCGCTTTCT TTTATAQCTG 1800 

CTTCAGTGAG AATTCCCTTG TOGACTCCAC AGGGACCTTT CAGACAAAAA TGCAAGAAGC 1860 

35 AGCGGCCTCC CCTGTCCCCT GCAGCTTCCG TGGTGCCTTT GCTGCCQGCA GCCCTTGGGG 1920 

ACCACAGGCC TGACCAGQGC CTGCACAGGT TAACCGTCAG ACTTCCGGGG CATTCAGCTG 1980 

QGAATGATAC TAATACCTCC GATTTTAGCX: CAGCACCACA QGGTACGTTC CACnTTTTAT 2040 

40 

TTCTTTCCAT AGCTGTAAGG CCXTnTCTGG GAATGCTTAT CATTCTCCTT AATCTATTAT 2100 

TGGGTCAGTT TTCCTGCATG TCCCXMCCT CXXMCACT6 CXACCCACTC CCCACAGAGA 2160 

45 TGCCCTGCTC ATCXX3ACTGG GGCTTTGACT CCCACACTGT GTACCCCTCT TGTOTGGAOG 2220 

CCCTQCTGCX: AAAACCTICA GCAAACAGCT TTCCAAATGG AAGTTGnCAC TGTCAGGGCC 2280 

TTTACAATCA GCAACAGCAA AATPCTACATG CTGCTGAGGG TCCTGCCTCA TTAAGATGCA 2340 

50 

ATAAATATGT AAGTACATAA AAAAAAAAAA AAAAAAA 2377 



55 

(2) INFORMATION FOR SBQ ID NO: 259: 

<i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1193 base poixs 
60 (B) TYPE: nucleic acid 
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(C) STOANnECNESS: double 

(D) TOPOLOGY: linear 

(xi) SBOOENCE DESCRIPTION: SBQ ID NO: 259: 

TCTGOTOGCC GTCGCCCCGC CCCICOCCIT TGCCCGCTTCG QGaSGGACOT CCTGTgniCT 60 

ATTTOCAAGG ACTCCAAAGC GAGGCCGQGG ACTGAAGGTG TGGGTGTCGA GCCCTCTGQC 120 

AGAGGC3TTAA CXTGQGTCAA AUGCACQGAT TCTCACCTCG TACAGTTACG CTCTCCOQCX; 180 

GCACCTCCGC GAGGMXTTGA AC3TOCTGAGC GCTCAAGnTT GTCOGTAGTC GAGAGAAGGC 240 

CATQGAGGTG CC3G0CACCGG CAC0GCXX5AG CTTTCTCTGT AGAGCATTGT GCCTATITCC 300 

COGAGTCTTT GCTC5CCGAAG CTGrrGACTQC CXaVTTCGGAA GTCCTTGAGG AGCGTCAGAA 360 

GCX3GCTT0CC TACGTCCXAG AGCCCTATTA CCCGGAATCT GGATQQGACC GCCTCOGQGA 420 

GCTGTTTGGC AAAGACACAG TGAACACTAG TCTGAATGTA TACOGAAATA AAGATGCCTT 480 

AAGCCATTTT GTAATTGCAG GAQCTGTCAC GGGAAGTCTT TTTAGGATAA ACGTAGGCCT 540 

OCGTGGCTGG TGGCTGGTGG CATAATTOGA GCCTTCCTGG OCACRXTCr AGGAGQCX^TG 600 

CTGAOXSGCAT TTCAGAAGTA CTCTGGTGAG ACTGTTCAGG AAAGAAAACA GAAGGATCGA 660 

AAGGCACTCC ATGAGCTAAA ACTQGAAGAG TGGAAAGGCA GACTACAAGT TACTGAGCAC 720 

CTCCCTQAGA AAATTGAAAG TAGTTTACAG GAAGATGAAC CTGAGAATGA TGCTAAGAAA 780 

ATPGAAGCAC TGCTAAACCT TCCTAGAAAC OCTTCAGTAA TAGATAAACA AGACAAGGAC 840 

TGAAAGTGCT CTGAACTTGA AACTCACTGG AGAGCTQAAG GGAGCTGCCA TGTCCGATGA 900 

ATGCCAACAG ACAGGCCACT CTTTGGTCAG CCTGCTGACA AATTTAAGTC CTGGTACCTG 960 

TGGTQGCAGT GGCTTGCTCT TGrTCTTTTTC ' rmirmTr AACTAAGAAT GGGGCTGTTG 1020 

TACTCTCACT TTACTTATCC TTAAATTTAA ATACATACTT ATGTITGTAT TAATCTATCA 1080 

ATATATGCAT ACATGAATAT ATCCAOXAC CTAGATTTIA AGCAGTAAAT AAAACATTTC 1140 

GCAAAAGATT AAAGriTGAAT TTTACAGTTA AAAAAAAAAA AAAAAAAAAA AAA 1193 



(2) INPOBMATION FOR SEQ ID NO: 260: 

(i) SBQimKIE CHARACTERISTICS: 

(A) LHOGTH: 1262 base pairs 

(B) TYPE: nucleic acid 

(C) STEIANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 



GAAAAACCCA AAGATGCAGA CAATCTCTTT GAACATGAAT TGGGGGCTCT CAATATGGCT 
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GCATTACTAC GAAAAGAAGA AAGAGCAAGT CTTCTri^GTA ATCTTGGCCX: ATGnTGrTAAG 120 

GOGTTGTCSCT TCfiGACGQGA TTCTQCAATT OGAAAGCAGC TTOTTAAAAA TQAGAAOGGC 180 

ACCATAAAAC AAGCTTACAC GAGTSCTCCA ATGGTAGACA ATGAATTACT TCGATTGAGT 240 

CTTCGGTTAT TTAAGCGGAA GACTACTTGC CATC3CTCCAG GACATGAAAA GACTQAAGAT 300 

AATAAACTTT CACAGTCCAG TATCCAACAG GAACTC3TGTG TGTCTTAAGA CCGAAGrTTCA 360 

ATATQGTATT TITOGTACTG TCTTCCTTCA GCAGTKX^ITA TTCTPITGCA AAGTTCTITC 420 

GITTGACAAG CATTAGTGAC AAAGQCAGAA AAGATTTATC AGCCATGCCA AAAGAGTCAA 480 

15 GAATTTTGAT CTTTAGAGAC ACTAGTTTTG GCX:AACTTAA GATTTTACGrT TAATTTTTAC 540 

ATAGTA1TTG ACACTCATGC AAAATAATGT GAAAACATCT AGATTTAGTA GTTTATTCTG 600 

CGCCnTTGT TAA/U^CTGAA GATITPQGAA AATQGTTGTC ACTGCTCTTC CAGCCTATGA 660 

20 

ATATTTTTGT GAAATQGAAC CATGGATTTA TGTCTC3GATC ATCCATACflG AACCAACAAT 720 

TTTATTCAAA AACAATGTOT TCATCAAACT AATTGCTCAC ATTGTGCAGT ACTATGnTGT 780 

25 ACAGACCACG TGAAAGGGAA TOCTGC5TCTA GCTGGCGTGG TATGrTTTATA GGCGAATTTC 840 

AQCAGAAGGA AGCCAAAATA GrTTTTITCCT TTTGAAAGTT TTTTAAAAAT TATITCATGG 900 

GTCTTTTTTT TAATTAATAT GTGTCCATTG TTACAATGTA TGTTGGAaCT CrTITGACCC 960 

30 

TAAATGCTTT TTTTGTTATC AGAGATTGTG TACTATTTTT ATTTTTAATA AATCTATCTT 1020 

CCCTTTCCTT GnTTAGATT TACTTTCCTC TTCGTTAATC TTATPCCTGA TGATCTASAA 1080 

35 CATTAGTCAT CAACATTACA TGrTTTCATGC TTCAGATATT rrACTGCTTG TGTCCTTATT 1140 

GTTGGACAGC TTTAAACAGA GTTGATGGrTA CTTCAAATAT AGCTCATTGA TACTTAAGGG 1200 

CANCTTCCTT GQGATCTrGGG CTTTTTGGAA GGAAAAAAAT TOCCXXAAAG GCAAATCCXA 1260 

C3T 1262 
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(2) INFORMATION FOR SEQ ID NO: 261: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LraJCSTH: 1179 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOIiOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

GGCAAACTTT CCCOCAANGC TTCGAAACTT GCAAGCCGAA ACCTTGAATC GTTAAAAGTT 60 

GGGTTC30C3K: GQCGCCCTC3G CCCGAAGAAG CGCAATTGGC GTTCCGCGAA CGTK3GCCCT 120 

60 CAA0GC3CTCG GCAGCCAGCC ATGTCCTGCA CCCAGGACAG CGGCCCTGGG CTACAAGGAC 180 
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CTGGACCTCA TCTTCCTGCG CCGACCIGCG CQGGG3Utf3GG GAGTmCAGA CTGrTGAAGGA 
CGTOGTQCTG GflCTQCCTGT TGGACTTCTT AOCOGAGGGG GTGAACAAAG ASAAGATCAC 
ACCACTCACG CTCAAGGAAG CTTATGTGCA GAAAATGGTT AAAGTGTGCA ATGACTCTGA 
CCGATGGAGT CTTATATCCC TGTCAAACAA CAGTGGCAAA AATGTGGAAC TGAAATTTGT 
GGATTCCCTC CGGAGGCAGT TTGAATTCAG TGTAGATTCT TTTCAAATCA AATTAGACTC 
TCTTCTGCTC TTTTATGAAT GTTCAGAGAA CCCAATGACT GAGACATTTC ACCCCACAAT 
AATCGGQGAG AGCGTCTATG GCGATTTCCA GGAAGCCTTT GATCACXnTT GTAACAAGAT 
CATTGCCACC AGGAACCCAG AGGAAATCCG AGGGGGAGGC CPGCTTAAGT ACTQCAACCT 
CTTOGrrGAGG GGCTTTAGQC CCXXTTCTGA TGAAATCAAG ACCXTTTCAAA GGTATATOTG 
TTCCAGGTTT TTCATCGACT TCTCAGACAT TGGAGAGCAG CAGAGAAAAC TQGAGTCCTA 
TTTGCAGAAC CACTTTGTQG GATTGGAAGA CCGCAAGTAT GAGrHATCTCA TGACXXTTTCA 
TGQAGTC3GTA AATGAGAGCA CAGTGrPQCCT GATGGGACAT GAAAGAAGAC ASACTTTAAA 
CXTTTATCACC ATGCTGGCTA TCCGGGTGTT AGCTGACCAA AATGTCATTC CTAATGTGGC 
TAATGTCACT TGCTATTACC AGCCAGCCCC CTATGTAGCA GATGCCAACT TTAGCAATTA 
CTACATTGCA CAGGTTCAGC CAGTATTCAC G1X3CCAGCAA CAGACCTACT CCACTTGGCT 
ACCCTQCAAT TAAGAATCAT TTAAAAATGT CXTTGTGGGGA AGCCATTTCA GACAAGACAG 
GAGAGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAGAGC 



(2) INPMMATION FOR SEQ ID NO: 262: 

(i) SEQOQICE CHARACTERISriCS : 

(A) 1162 base pairs 

(B) TYPE: nucleic acid 

(C) STRANnKTTMKSS : double 

(D) TOPOLOGY: linear 

(xi) SBC^JOJCE DESCRIPTION: SEQ ID NO: 262: 
GGCAAACTTT CCCCCAANGC TTCGAAACTT GCAAGCCGAA ACCTTGAATC GTTAAAAGTT 
GQGTTGOGNC QGCGCCCTGG CCCGAAGAAG CX3CAATTQGC GTTCCGOGAA CGTTGGCCCT 
CAACGGCTCG GCAGCCAGCX: ATGTCCTGCA CCCAGGACAG CGGCCCTGGG CTACAAGGAC 
CTGGACXirrCA TCTTCCTGCG CCGACCTGCG OGGGGAAGGG GAGTTTCAGA CTGrTGAAGGA 
CGTCGTGCTG GACTGCCTGT TGGACTTCTT ACCCGAOQGG GTTGAACAAAG AGAAGATCAC 



ACCACTCACG CTCAAGGAAG CTTATGTGCA GAAAATGGTT AAAGrTGTGCA ATGACTCTGA 
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CCGATGGAGT CTTATATCCC TGTCAAACAA CAGTGGCAAA AATGTGGAAC TGAAAITTGT 420 

GGftTTCCCTC CX3GAQGCAGT TTGAATTCAG TGTAGATTCT TTPCAAATCA AATTAGACTC 480 

TCITCTGCTC TTTTATGAAT GTTCAGAGAA CCCAATGACT GAGACATTTC ACCCCACAAT 540 

AATCGGGGAG AGCGTCTATG GCGATTTCCA GGAAGCCTTT GATCACCTTT GTAACAAGAT 600 

CATTGCCACC AGGAACCCAG AQGAAATCCG AGQGGGAGGC CTQCTTAAGT ACTGCAACCT 660 

CTK3GTGAGG GGCTTTAGGC COGCXTTCTGA TGAAATCAAG ACCCTICAAA GGTATATGTG 720 

TTCCAGGnT TTCMCGACT TCTCAGACAT TOGAGAGCAG CAGAGAAAAC TGGAGTCCTA 780 

TTTGCAGAAC CACTTrGTOG GATTQGAAGA COGCAAGTAT GAGTATCTCA TGACCCTTCA 840 

TQGAGTQGTA AATGAGAOCA CAGTGTGCCT GATOGGACAT GAAAGAAGAC AGACTTTAAA 900 

CXTTATCACC ATGCTGGCTA TCCGGGrTGnT AGCTGACCAA AATGrTCATTC CTAATGTGGC 960 

TAATGTCACT TGCTATTACC AGCCAGCCCX: CTATCTAGCA GATGCCAACT TTAGCAATTA 1020 

CTACATTGCA CAGGTTCAGC CAGTrATTCAC GTGCCAGCAA CAGACCTACT CXIACTTGGCT 1080 

ACCCTGCAAT TAAGAATCAT TTAAAAATGT CCTGTOGGGA AGCCATTTCA GACAAGACAG 1140 

GAGAGAAAAA NAANGAAAAG AG 1162 
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(2) INPORMATICN FOR SBQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 735 base pairs 

(B) TYPE: niicleic acid 

(C) STE^ANDBENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 263: 

OGGGCTGGGT ATTTGCCTCG CACCATGGCG CCCAAGGGCA AAGTGGGCAC GAGAGGGAAG 60 

AAGCAGATAT TTGAAGAGAA CAGAGAGACT CTGAAGTTCT ACCTGCGGAT CATACTGGGG 120 

GOCAATGCCA TTTACTGCCT TGTGAOGTTG GTCTTCTTTT ACTCATCTQC CTCATTTTQG 180 

GCCTGGTTQG CCTTGGGCrr TAGTCTOGCA GTGTATQGGG CCAQCTACCA CTCTATGAGC 240 

TOGATGGCAC GAGCAGCGTT CTPCTGAGGA TGGGGCCCTG ATGGATGGTG GCACGAGCTC 300 

AACATGGAGC AGGGCATGGC AGAGCACCTT AAGGATGTGA TCCTACTGAC AGCCATCCrTG 360 

CAGGTGCTCA GCTQCTTCTC TCTCTATGTC TQGTCCTTCT GGCITCTOQC TCCAGGCCGG 420 

GCCCTTTACC TCXTGTQGGT GAATGTGCTG GGCCCCTGGrT TCACTGCAGA CAGTGGCACC 480 

CCAGCACCAG AGCACAATGA GAAACGGCAG CGCCGACAGG AGCQGCGGCA GATGAAGCGG 540 

TEATAGCCAT TGACATTGTG GCCACAGGCC ACTGGCOCTG GGTGGCTCTG TCAGQGTGCA 600 



PCr/US98A>4493 



491 



CAGCXXXnCA TGCCIGGAGC AATGAGQGTC TAGTCCAGGG GCXIAAAAQCA GTCTGAGGTA 660 
TTGGGTATAC TTATACTCTA raOGGTrOCSTT GAATAAATQG CTTAGAATGT GAAAAAAAAA 720 
AAAAAAAAAA ATTTT 735 



10 

(2) INFORMATION FOR SBQ ID NO: 264; 

(i) SEQUENCE CHARACTERISriCS : 

(A) LENGTH: 783 base pairs 
15 (B) TYPE: nucleic acid 

(C) sntAKDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SBC^JENCE DESCRIPnON: SEQ ID NO: 264: 

20 

AACJTGCATGA GCTQCCGATG TQGTQCTTAG TGATTQCX3GT TTCXaGTCGCT CTOCCGTGTT 60 

TCCCGQQCTG GGTArTTGCC TOGCACCATG GCGCCCAAGG GCAAAGTGQG CACGAGAGGG 120 

25 AAGAAGCfiGA TATTTGAAGA GAACAGAGAG ACTCTGAAGT TCTACCTQCX5 GATCATACTG 180 

GGQGCCAATG CCATTTACTG CCTTGTGACG rPGGTCTTCT TTTACTCATC TGCCTCATTT 240 

TQQGOCTGCT TQGCCTGGGC TTTAGTCTQG CAGTGTATGG GGCCAGCTAC CACTCTATGA 300 

30 

GCTCGATGGC ACGAGCAGCG TTCTCTGAGG ATGGQGCCXrr GATGGATQGT GGCATQGACC 360 

TCAACATOGA GCAGGGCATG GCAGAGTGAG TGTCXXXXAC CXXXAGCCXA GGCACCTTAA 420 

35 GGATGTGATC CTACTGACAG CCATCGTGCA GGTGCTCAGC TGCTTCTCTC TCTATGTCTG 480 

GTCCTTCTGG CTTCIGGCTC CAQGCXXXSGC CCITTAOCTC CTGTQQGTGA ATGTGCTC3GG 540 

CX^CCTGGTTC ACTC3CAGACA GTGGCACCCC AGCACCAGAG CACAATGAGA AACGGCAGCG 600 

40 

CXX5ACAGGAG OGGCGGCAGA TGAAGCGGTT ATAGCXiTVTTG NXATTTSGC SACNRGCCAC 660 

TGGCCCTCGG TQGCTCTGTC AGGGTTGCACA GCCCCTCATG CCTGGAGCAA TGAGGGTCTA 720 

45 GTCCflGGCSGC CAAAAGCAGT CTGAGGTATT GGGTATACTT ATACTCTATA GGGTCGTTGA 780 

ATA 783 



50 

(2) INFORMAPION FOR SBQ ID NO: 265: 

(i) SEQUQK:E CHARACTERISTICS: 
55 (A) LENCTTH: 1638 base pairs 

<B) TYPE: nucleic acid 

(C) STRANOEENESS : double 

(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 265: 
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GGCACGAGGC GGCGQCAGCX; GTOGOGCXXX; OGCCXTOCGG OGGGAGCCGrP NCXXOTTCOC 60 

^ CyroOGGGAGC GCGGGGYCGG GGTOCAGGGG ANCCOGGGMC AOXAGAGCX; GGAAGAGGAT 120 

GGATTGCCCG GCCCTCOCCC CCGGATGGAA GAAGGAGGAA GTGATCCGAA AATCTGGQCT 180 

AAGTGCTQGC AAGAGCGATC TCTACTACTT CAGTOCAACT GGTAfiGAAGT TCAGAAGCAA 240 

10 OXTCAGrrTG GCAAGCTACC TOC5GAAATAC TOTTGATCTC AGCAGmTC ACTTCftGAAC 300 

TGGAAAGATG ATGCCTAGTA AATTACAGAA GAACAAACAG AGACTE30GAA ACGATCCTCT 360 

CAATCAAAAT AAGGGTAAAC CAGACTTGAA TACAACATTG CCAATTAGAC AAACAGCAOC 420 

AATTTTCAAA CAACCGGTAA CXAAAGTCAC AAAJCATCCT AGTAATAAAG TGAAATCAGA 480 

CXXZACAACGA ATGAATGAAC AGCCACGTCA GCTTTTCTGG GAGAAGAGGC TACAAGGACT 540 

20 TAGTGCATCA GATGTrAACAG AACAAATTAT AAAAACCATO GAACTACCCA AAGGTICTTCA 600 

AQGAGTTQGT CCAGGTAGCA ATGAHGAGAC CCTTTTA3CT GCTGTTQCCA GTOCTTTGCA 660 

CACAAGCTCT GOGCXaW^TCA CAGGGCAAGT CTCCX3CTGCT CrrOGAAAAGA ACCCTCCTOT 720 

rPQGCTTAAC ACATCTCAAC CCCTCTGCAA AQCTTTTATT GTCACAGATG AAGACATCAG 780 

GAAACAQGAA GAGCGAGTAC AGCAAGTACG CAAGAAATTG GAAGAAGCAC TGATGGCAGA 840 

30 CATCTTGTCG CGAGCTOCTG ATACAGAAGA GATGGATATT GAAATGGACA GTGGAGATGA 900 

AGCCTAAGAA TATGATCAGG TAACTTTCGA CXX5ACTTTCC CCAAGAGAAA ATTCCTAGAA 960 

ATTQAACAAA AATOITIXJCA CTGGCTTTTG CCTGTAAGAA AAAAAATCTTA CCCGAGCACA 1020 

35 

TAGAGCTTTT TAATAGCACT AACCAATGCC TTTTTAGATG TATTTTrOAT GrATATATCT 1080 

ATTATTCAAA AAATCATGTT TATTTTGAGT CXTTAGGACTT AAAATTAGTC TTTTGrTAATA 1140 

40 TCAAGCAGGA CCCTAAGATG AAGCTGAGCT TTTGATGCCA GGTGCAATCT ACTGQAAATG 1200 

TAGCACTEAC GTAAAACATT TGTTTCCCCC ACAGTTTTAA TAAGAACAGA TCAGGAATTC 1260 

TAAATAAATT TCCCAGTTAA AGATTATTGT GACTTCACTG TATATAAACA TATTTTTATA 1320 

45 

CTTTATTGAA AGGGGACACC TGTACATTCT TCCATCRTCA CTGTAAAGAC AAATAAATGA 1380 

TTATATTCAC AGACTGATTG GAATTCTTTC TGnTGAAAAG CACACACAAT AAAGAACCCC 1440 

50 TCGTTAGCCT TCCTCTGATT TACATTCAAC TCTGATCCCG GGGCXTTTAGG TTTGACATGG 1500 

GAGGTGGGAG GAAGATAGCG CATATATTIG CAGTATGAAC TATTGCCTCT GGGACGTTGT 1560 

GAGGAATTCT GCTTTCACCA GAATTTCTAA GGATrTCPGG CTTAAATATC ACCTAGCCTG 1620 

TQGTAATTTT TTTTCCCT 1638 



55 
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(2) INPQRMATICN FOR SBQ ID NO: 266: 

(i) SBQU£IICE CHARACTERISTICS: 

(A) ZiEnGTH: 1455 base pairs 

(B) TYPE: niicXeic acid 

(C) STRANDECNESS : doiible 

(D) TOPOUXSY: linear 

(xi) SEQUEMCE DESCRIPnON: SBQ ID NO: 266: 

CGTCCGrrACT GCCATGCAGG TACOGGGTCC GGAATTCCCA GQGTCGACCC ACGOGTOCGC 60 

TCfiGTTQGCA AOGTACCTC3G GAAATACTGT TGATCTCAGC AGmTGACT TCAGAACTQG 120 

15 AAAGATGATG CCTACTAAAT TACAGAAGAA CAAACAGAGA CTGCGAAAOG ATCCTCTCAA 180 

TCAAAATAAG GGTAAACCAG ACTTGAATAC AACATTQCCA ATTAGACAAA CAGCATCAAT 240 

TTTCAAACAA CCGGTAACCA AACTCACAAA TCATCCTAGT AATAAAOTGA AATCAGACCC 300 

20 

ACAAOGAATG AATGAACAGC CAOGTCAGCT TTTCTC3GGAG AAGAGGCTAC AAGGACTTAG 360 

TQCATCAGAT GTAACAGAAC AAATTATAAA AACCATGGAA CTACCCAAAG GTCTTCAAGG 420 

25 AGrrTGGTCCA GGTAGCAATG ATGAGACCCT TTTATCTGCT CTTGCCAGTG CTTTGCACAC 480 

AAGCTCTC3CG CCAATCACAG GQCAAGTCTC CGCTGCTGTG GAAAAGAACC CTGCTGrTTTG 540 

GCTTAACACA TCTCAACCCC TCTGCAAAGC TTTTATTGTC ACAGATGAAG ACATCAGGAA 600 

30 

ACAGGAAGAG CGAGrTACAGC AAGTACGCAA GAAATTQGAA GAAGCACTGA TGGCAGACAT 660 

CTTGTOGCGA GCTGCTGATA CAGAA6AGAT GGATATTGAA ATQGACAGTG GAGATGAAGC 720 

35 CTAAGAATAT GATCAGC3TAA CTTTCGACCG ACTTTCCCCA AGAGAAAA3T CCTAGAAATT 780 

GAACAAAAAT GTTTCCACTG GCmTGCCT GTAAGAAAAA AAATGTACCC GAGCACATAG 840 

AGCTTTITAA TAGCACTAAC CAATGCCTTT TTAGATGTAT TTTTGATGTA TATATCTATT 900 

40 

ATTCAAAAAA TCATGTITAT TTTGAGTCCT AGGACTTAAA ATTAGTCTTT TGTAATATCA 960 

AGCAGGACCC TAAGATGAAG CTGAGCTTIT GATGCCAGGT GCAATCTACT QGAAATGTEAG 1020 

45 CACTTACGTA AAACATTTGT TTCCCCCACA GTTTTAATAA GAACAGATCA GGAATTCTAA 1080 

ATAAATTTCC CAGTTAAAGA TTATTGTGAC TTCACTGTAT ATAAACATAT TTTTATACTT 1140 

TATTGAAAGG GGACACCTGT ACATTCTTCC ATCRTCACTG TAAAGACAAA TAAATGATTA 1200 

50 

TATTCACAGA CTGATTQGAA TTCTrTCTGT TGAAAAGCAC ACACAATAAA GAACCCCTCG 1260 

TTAGCCrrcC TCTGATTTAC A3TCAACTCT GATCCCGGGG CCTTAGGTTT GACATGGGAG 1320 

55 GTQGGAGGAA GATAGCGCAT ATATTTGCAG TATGAACTAT TGCCTCTGGG ACGTTGrrGAG 1380 

GAATTGTGCT TTCACCAGAA TTTCTAAGGA TTTCTGGCTT AAATATCACC TAGCCTGrTGG 1440 

TAATTTTTTT TCCCT 1455 
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^ (2) INFOJWATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCmi: 1086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 
10 (D) TOPOLOGY: linear 

(xi) SBQUQTCE DESCRIPTION: SEQ ID NO: 267: 

CGCCTGCAGrr AOCGGTCCGG AATTCCCGGG TCGACCCAOG OGTOGCTGAC CCAGGAGAAG 60 

CTGCCTGTCT ACATCAGCCT GGGCTGCAGC GCGCTGCCGC CGOGGGGCCG GCAGCTGAAC 120 

TATGTCXTTCT TCAGGGOGGG CACOnCTTG CATTCATCTT TGTACCCCCA GCATCTAGCA 180 

20 GTGTTC3GCAT GTAGTAGC5CA CTCAAGAAAT GTGTGTTGAA TGAAOGATGC CTGTC5ACAAG 240 

CAAGCQGACT TTATTCTTTC CTGACCCTTG CTCCTATGAC ACACCTCCTC CTGACTGCCA 300 

CTGTCACTCC TTCAGAGCAG AACTCCTCTA GGGAACCTGG ATGQGAAACA GCCATQGCCA 360 

25 

AGGACATCCT GQGrPGAAGCA GCXXTACACT TTGATGAACT GAACAAGCTG AGGG?rGTTGG 420 

ACCCAGAGGT TACCCAGCAG ACCATAGAGC TGAAGGAAGA GTGCAAAGAC TTTGTGGACA 480 

30 AAATTGGCCA GTTTCAGAAA ATAGTTGGTC GrTTTAATTGA GCTTGTTGAT CAACTTGCAA 540 

AAGAAGCAGA AAATGAAAAG ATGAAGGCCA TCGGTCXTCG GAACTTQCTC AAATCTATAG 600 

CAAAGCAGAG AGAAGCTCAA CAGCAGCAAC TTCAAGCCCT AATAGCAGAA AAGAAAATQC 660 

35 

AGCTAGAAAG GTATCGGGIT GAATATGAAG CTTTGTGTAA AGTAGAAGCA GAACAAAATG 720 

AATTTATTGA CCAATTTATT TTTCAGAAAT GAACPGAAAA TriXJULTri ' l ' ATACTAGGAA 780 

40 GGCAAAACAA AAAAAAGCCT CTCAAAACCA AAAAAACCTC TGTAGCATTC CAGCGGCTTG 840 

ACCAATGACC TATGTCACAA GAGGTGGCGT GTAAGGAATG CAGCCCCCTG AAGACAGCAC 900 

TACAAGTCTG GGQGAGCCAG TTTTAACATC AGTQCACAGC TQCTCXTOGT GGCCCTGCAG 960 

45 

TGTAOGTTCT CACCTCTTAT GCTTAGTTGG AACTAAGCA6 TTTGTAAACT TTCATCCTTT 1020 

TTTTTGTAAA TTCACAAAGC TTTGGAAGGA GARGCAATAA ATTTTrGKTT TCNAAATGGC 1080 

50 TTGATG 1086 



<2) INFORMATICN FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGrTH: 1003 base pairs 

(B) TVPE: nucleic acid 

(C) STRANDEENESS : double 



495 

(D) TOPOLOGY: linear 
<xi) SBC^JENCE DESCRIPTICN: SBO ID NO: 268: 

5 GGCACGQGAG CftGCOGGGCT GGnCCTGCTG CGAGCCGGCG GCCOGGAGTG GQGOQGOGGA 60 

GCAAflCATGA AOGTTGGfiCT TOCCXACAGT GAAGTCAATC CAAATACOOG TGTCATGAAC 120 

AQCCGOGGTA TGnX3C5CTGAC ATATC3CATTG QGAGrTTOGCT TGCTTCATAT TGTClTftCTC 180 

10 

AGCATTCCCT TCTTCAGrPGT TCCTGTPQCT TGGACTTTAA CAAATATTAT ACATAATCTG 240 

GGGATOTACG TATTTrTGCA TGCAGTCAAA GGAACACCTT TCGAAACTCC TGAOCAGGGT 300 

15 AAAAGCAAGG CTCCTAACTC ATTOC5GAACA ACTGQACTAT QGAGTACAC3T TTACATCTTC 360 

AOGGAAGTTT TTCACAATTT CTCCAATAAT TCTAIATTTT CTGGCAAGnT TCTATAOGAA 420 

GTATGATCCA ACTCACTTCA TCCTAAACAC AGCTTCTCTC CTGAGTGTAC TAATTCCCAA 480 

20 

AATGCCACAA CTACATGGTG TTCGGATCTT TQGAATTAAT AAGTATTGAA ATGTTTTGAA 540 

ACTGAAAAAA AATTTTACAG CTACTGAATT TCTTATAAGG AAGGAGIGGrr TAGTAAACTC 600 

25 CWrTGrrTTCT CTGATAAICT GAAATGAGAA CTATTTACAT TQGAGGGCCA ATGGCTQGTC 660 

CTTCAACrrGC TOTTTTGAAG TGCAGATTTC CATTAAATGA TGCCTCTGTT TAATACACCT 720 

GGTACAITTC TGAASAGGGG CTTTATAAGC AGGCTOQGCA QGCXX:AGCTT ATAAGTTAAA 780 

30 

GGGCATCACA GTGAGGGTGT AOTAGATAAA TTCAAGGAAA TAAGAGATTT GTAAGAAACT 840 

AC3GACCAGCT TAACTTATAA TGAATC3GGCA TTGTGTTAAG AAAAGAACAT TTCCAGTCAT 900 

35 TCAGCTGTGG TTATTTAAAG CAGACTTACA TGTAAACCX3G AATOCTCTCT ATACAACTTT 960 

ATTAAAGATT ATTTTTA3TA CCX3TAAAAAA AAAAAAAAAA AAA 1003 



40 



(2) INFORMATION FOR SEQ ID NO: 269: 



(i) SEC2UENCE CHARACTERISTICS: 
45 (A) LENC5TH: 1234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

ATCAGCATCT ACAAGTAGCA TATTrTOGAT QCJlVlTiirit; TGCTACTTCA AAGTAACTAG 60 

GAAAAAATAA TCCTCGCAAC ACAGGTACCT TGTCATGTCA GAATTGGGGG TGTTAGGTTG 120 

CCAGTTGTAT CAGTGTTGAT TCATTTCATT ACTTCCTACA GAGCAAACAT GAAOGTTGGA 180 

GTTGCCCACA GTGAAGTGAA TCCAAATACC OGTGTCATGA ACAGCCGQGG TATGTCGCTG 240 

60 ACATATGCAT TGGCaGTTGG CTTGCTTCAT ATTGrCTTAC TCAGCATTCC CTTCTTCAGT 300 
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C?rrCCTGTTG CTTGGACTTT AACAAAIATT ATACATAATC TGGGGATOTA OGTATTTTTC 360 

^ CATGCAGrTCA AAGGAACACC TTTOGAAACT CCTGACCAGG GTAAAGCAAG GCTCCTAACT 420 

CATTGQGAAC AACTOGACTA TQGAGTACAG TTTACATCTT CAOOGAAGrTT TTTCACAATT 480 

TCTCCAATAA TTCTATATIT TCTGGCAACT TTCTATACGA AGTAJPGATCC AACTCACTTC 540 

10 ATCCTAAACA CfiGCTPCTCT CCTGAGTCTA CTAAOTOXA AAATCCCACA ACTACATOC?r 600 

GTTCQGATCT TTGGAAITAA TAAGTATTGA AATCnTTTGA AACTGAAAAA AAATTTTACA 660 

GCTACTGAAT TTCTTATAAG GAAGGAGTOG TTACTAAACT GCACTGnTC TSTGATAA1G 720 

TGAAATGAGA ACTATTTACA TTQGAGGGCC AATQGCTQGT CCTTCAAGTC CTGTTTIGAA 780 

GTOCAGATTT CCATTAAATG ATGCCTCTGT TTAATACACC TOGTACATTT CTGAAGAGGG 840 

20 GCTTTATAAG CARGCTQGGC AGCXXXMCT TATAAGTTAA AGQGCATCAC AGTGAGGGIG 900 

TACTAGATAA AITCAAGGAA ATAAGAGATT TGTAAGAAAC TAGGACCAGC TTAACTTATA 960 

^ ATGAATGQGC ATTGTGnTAA GAAAAGAACA TTTCCAGICA TTCAGCTCTC GTTATTTAAA 1020 

GCAGACTTAC ATGTAAACCG GAATCCTCTC TATACAAGTT TATTAAAGAT TATTTTTATr 1080 

AOCirrACATA TTTCKCTTCJr TTTATGTAAG YGGATGTATA TCCTCTTGTT TTATACAAGC 1140 

30 CflGTTCCCAC TTATCAGCSGT ACrriTTTGG TTTTGCTGGG CTTAATATTG TGTATTOGTC 1200 

AATGAGGCCA TTTTTACANT TATTAAOGTT ACAG 1234 

35 

(2) INFORMATION FOR SBQ ID NO: 270: 

<i) SBQUEWCE CHARACTERISTICS: 
40 (A) LENGTH: 574 base paixs 

(B) TYPE: nucleic acid 

(C) STRMUDEXXJESSz double 

(D) TOPOLOGY: linear 

45 (xi) SBQUQICE DESCRIPTION: SBQ ID NO: 270: 

NGAGGTQCGT TCTGAGCCGT CTGTCCTGCG CCAAGATGCT TCAAAGTATT ATTAAAAACA 60 

TATGGATCCC CATGAAGCCC TACTACACCA AAGTTTACCA GGAGATTTQG ATAGGAATCG 120 

GGCTGATGGG CTTCATOGTT TATAAAATCC GGQCTGCTGA TAAAAGAAGT AAGGCTTTGA 180 

AAGCTTCAGC GOCTGCTCCT GGTCATCACT AACCAGATTT ACTTQGAGTA CATGTQAAAG 240 

AAAAOGTCAG TCTGCCTGTA AATTTCAGCA AGOCGTGTTA GATOGGGAGC GTGGAACGTC 300 

ACTGTACACT TGTATAAGTA CCGTTTACTT CATGCXATGA ATAAATGGAT CTGTGAGATG 360 

CACTQCTftCC TGGTACTGCT TTCAGTGTGT TCCCCCTCAG CCCTCXX3GCG TGTCAGGCAT 420 



50 
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ACTCTGAGTA GATAATTTGT CATGCAGCGC ATQCAATCAG AATCTCflCTG AGCCACCCAT 480 

CATTGTGAAA TAATTACCTC AGTTGTACAG GACTTGGTGA TCAGGATCCA GGCACTCACT 540 

5 TGTA3TCTAC TQCTCAATAA ACGTITATTA AACT 574 



10 (2) DJFORMATICN FOR SBQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1731 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SBCJtlQJCE DESCRIPTION: SBQ ID NO; 271: 

20 GCTGCAAGGT GCQCCTCGTG CCQCTGCAGA TCCAGCTCAC TACCCTC3GGA AATCTTACAC 60 

CTTCAAGCAC TGrGTTTTTC TCXTTGTGATA TGCAGGAAAG GrTTCAGACCA GCCATCAAGT 120 

ATTITOQGGA TATTATTAGC GTGGGACAGA GATTGTPQCA AGGGGCCCGG ATTTTAGGAA 180 

25 

TTCCTOTEAT TGTAACAGAA CAATACCCTA AAGGTCTTQG GAGCACGGTT CAAGAAATTG 240 

ATTTAACAGG TGTAAAACTG GTACTTCCAA AGACCAAGTT TTCAATGGTA TTAOCAGAAG 300 

30 TAGAAGOGGC ATTAQCAGAG ATTCCOGQAG TCAGGAGrTGT TGTATTATTT GGAGTAGAAA 360 

CTCATGTGTG CATCCAACAA ACTGCCCTGG AGCTAGTTGG CCGAGGAGTC GAGGTTCACA 420 

TTGTTGCTGA TGCCACCTCA TCAAGAAGCA TQATGGACAG GATCSTTTGCC CTCGAGOGTC 480 

35 

TOGCTCRARC CNQGQATCAT AGTGACCACG AGrTGNAGGCT Grr i V i i;L'rii: AGCTGGTAGC 540 

TGATAAGGAC CATCCAAAAT TCAAGGAAAT TCAGAATCTA ATTAAGGCGA GrTGCTCCAGA 600 

40 GTCGGGTCTG CTTTCCAAAG TATAGGACAT TTGAAGAACT QGTATGCTAC TCACTGGTGA 660 

AGGACAGTCA GGTGAAGGAC TGTAAGCCCA CACAAGCTCT TCTTATCTCT ACTASAATTA 720 

AAATGrnCAAG TCAAAAACGG CTCCTTTTTT GCGCCTCCTA GTGAACTTAA CCAGCTAGAC 780 

45 

CATTTQAGTA CCAGCATTTA GTTACAAACG TCAAAGGCTT CCGGTGCTGC TTACCTTCCT 840 

TTTTTGTTAA TGTGCTTTTA TTTATTAAAA AAAATTACAA TGAAGATGCC TGTTTTGTCT 900 

50 CTACTGTOTA CTCTGATCGT ATCTITOCAA AGTGCAGACT CrTGTGAAGT TTTCTTAAAT 960 

TGTTCACTTT AAAGAAAATG ACGTEACCAAC AATGATTTGG CTTTTATATT ACTGTAAGAT 1020 

GTTATAATGT TAATGTQGAT GTAGTGCTTT TACTTTACAG ATTGATTGGA ATAAGATTAT 1080 

55 

TGCATATGAA TTTACCCACA GGACTCTGAA TCATCrTTACC CACTCCCCTC ACAATGrTTGT 1140 

CCAdTAGTG ACnTGCATTG ATCTATCCGT ACCAAATGAT GTTGAATAAT TACATATCTT 1200 

60 TCTKGACTAT ACTGATTTCT TATTTTGGTC ACTATTACTA AATCTCTGTT AATATTCTCT 1260 
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CTTTTAACTG AAAAGGGATG GGAIflGAAGG GTITOCAATG CCATATTATT GGTQGAGGGC 1320 

^ TOTTTTAACA TCPTPGAAGT ATOQCTTOCT GAATATCITT ACCAACATCT TGAATATATA 1380 

TTCTAGTCTC CACAAGATTT AGCAAAAAGA TAAAGCTTGG GTGGAATATC ATTTTAAAAT 1440 

GnrCATGTTC TGTTCTATAT TXTCITCACC TACTCTCCAA ATATTGIAAT QCAAAAAIJIC 1500 

10 TCAGTAATCA TTTGGTAGTA TTAATTTTGT GGTCATTGTT TCTCTrCGAT AAATTTATTT 1560 

TCATTAAATA CTTRTTAGAG QGTTTTGAAA TGrTTTTTCAA ATATGTGAAA TGTGAAflCTC 1620 

CTOPCTTTTA TATTAAAC3TA ATTAAAGAAA ATCTATTGTC ATTGAAATTA Tri'iXJ N LVi t: 1680 

CACAAGATCG CTCTATGAGT ATTCTTCCAG QGATTCTAAT ATTTATTTAA G 1731 



15 



20 



(2) INFC^MATION FCH SBQ ID NO: 272: 

(i) SBQUQTCE CHARACTERISTICS: 

(A) LQKTTH: 1320 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDECNESS: doubie 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 272: 

CTGCTTAGGA AGAGAAGGTC AGAGTTCGCG QGGGCAGAGG CATTCTTGCC GCTGGCCCAG 60 

TCACTATGTA GTGGAGGGGC AGACACOCTC CCGCAAATTC TQ6AAGGTTC TTACTCTCGA 120 

CTAGGGCAGT AGCCCAGGAC TCCTAGTCGC OQGCTTCAGG TCACTQCCGG CTQAACOGAG 180 

CTGCCGTCGC CATGrTTTOGC TGCTTGGTGG CQQG6AGGCT GGTQCAAACA GCTGCACAGC 240 

AAGTQGCAGA GGATAAATTT GTTTITGACT TACCTGATTA TGAAAGTATC AACCATGTTG 300 

TGGTTTTTAT GCTGGGAACA ATCCCATTTC CTGAGGQAAT GGGAGGATCT GTCTACTTTT 360 

CTTATCCTGA TTCAAATCX3A ATQCCAGTAT GGMAACTCCT A3GATTTGTC AOGAATGQGA 420 

AQCCAAGTCC CATCTTCAAA ATTTCAGGTC TTAAATCTGG AGAAGGAAGC CAACATCCTT 480 

TTGGAGCCAT GAATATTGTC CGAACTOCAT CPGTIGCTCA GATTQGAATT TCACTGGAAT 540 

TATTAGACAG TATGGCTCAG CAGACTCCTG TAGGTAATGC TQCTGTATCC TCAGTTGACT 600 

CATTCACTCA GTTCACACAA AAGATGrTTGG ACAATTTCTA CAATTTrGCT TCATCATTTC 660 

CTGTCTCTCA GGCCCAGATG ACACCAAGCC CATCTGAAAT GnTCATTCCG GCAAATGTGG 720 

TTCTGCAAAT GOTATGAGGC ATNTPCTGrTC TCCAATATTA AGGCTTTTTA TAACTGAATA 780 

TCTATTITCr CTATGAATAT ATTCCTTTTT TGACATTTAA ACATATTCTT TTATTCTGAA 840 

CATCAGCACT GCATGCCATT AAAGTATGTA CTATAGAGAT CTGATGAGAA ACAGTTCTTA 900 
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CCXZTAAATAT TTTGTTATAT TGrOXXATT ATGAATTTAT AAAGACflGES^ AAATATAGTT 960 

GCCTATGTTT TAGGGACCAC TATTAAAGCT TATAAATATT TCTGTATTTT CATTTAGAAG 1020 

5 TACCATCTAT GAGAGTAGTT TATACTC3CAC TGTGTACATG AATGC3CTAAT GAATCTATTT 1080 

TOCAACTTTC COCSTCTTTrA TAGATATTTC TTTTCACTTT GAGTATCCTA GAQATGGGAG 1140 

GATGCXTAGG AASACTTTGT TGAGAAGTGG TACXrAraCTG TAGCATQGGA GAGCA3TCGG 1200 

AATGCACTAG GTTTGAATTT GGCATAAIGG TAGCTATGrrG ACCCTCAGCA AATTTCTCTC 1260 

ATCTGCTCAT CTGANGAATG AGGAAATAGG AGTGAATrTG AawmCCTA GGTCCMTCT A 1320 

15 

(2) INFORMATION FOR SEQ ID NO: 273: 

20 (i) SBQUHJCE CHARACTERISTICS: 

(A) LB«GTH: 515 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDWESS: double 
(D) TOPOLCGY: linear 

(xi) SEQUENCE DESCRIPnON : SBQ ID NO: 273: 

CCCTQGAGAG QGGCTGCTGT GCCAGCTTGG GGAGGGTCTG GGATGGGGCT QCCCCTGATC 60 

GCCCTGATGT QGAGTACCTT GCCAGCATCT GCTGGGGTGA AdTTATTTT AGCCdTCCC 120 

TTGITGYTCT TATGAAGAAC AGAGGAGGGG TQGGCAGGTC AGTGATGrPCA GCAGTGAGTA 180 

TTCCCAGCAC AGCGGCTCTG 6AAGAGQCAT GAGGCATTTC TTTCAGGAAA TC3WCATTAT 240 

TCAGCCAGAA GGCATTCATT AAGTAAGTCC TGACTTTGTG CCCAGCTCTG TGTTATAGGC 300 

CCTTGGCGAG ACTCAGGAGG GGCARAGGAC GCTAGKTTKT AGWEAACAOG GAACCTCARA 360 

GCaWATATGG TCCAAGAAGA CCCGGQQGCG GTGAAAACCC TGTOGACTAA TGCTCAOGGG 420 

AGCCCGAGGT CACACTTTGA CTTTGCTACC ATQGGCTGTG TCTANGNACG TATATATGCT. 480 

GOGTAATTAT TACAGAGGCA GTCCATGTGC ATTGT 515 

(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERICTICS : 

(A) LQKTH: 2995 base pairs 

(B) TYPE: nucleic acid 

(C) SIKANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 
TGACACOCAT AAGGAATTCA TGAAGAAAGT AGAAGAAAAG OQAGTGGAOG TTAAdCAGC 60 
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AGTAGCXMX3 GGftGAACHCA TOCTGQCTGT CTGOCACCCC GATTGCftTCA CAACCA3CAA 120 

ACACTGGATC ACCATCA3CC GAGCTCGCTT OSAQGAOGTC CTGACATOQG CTAAGCAGCA 180 

5 CCftGCAOXT CITCAAftCGG CCTOTTCAGA, ACTGGTQGCT AASGCTGAGC TCCTGGAAGA 240 

ACTTCPGQCA TGGATCCAGT GGGCTGAGflC CACCCTCATT CAGCGGGATC AGGAGCCAAT 300 

CXXXXSGAAC ATTGACOGAG TTAAAGCCCT TATCGCTGAG CATCAGACAT TTATGGAGQA 360 

GATGACTCGC AAACAGOCTG AaJTQCSACCXS QGTCACCAAG ACATACAAAA GGAAAAACAT 420 

AGAGCCTACT CACGCGCCPT TCATAGAGAA ATCOCGCAGC GGAGQCAGGA AATCCCTAAG 480 

15 TCAGCCAACC CCTCCTCCCA TGCCAATCCT TTCACAGTCT GAAGCAAAAA ACCCACQGAT 540 - 

CAACCAGCTT TCTC5CCXX3CT GQCAGCAQC3T GTQGCTGTTA GCACTQGAGC GQCAAAQGAA 600 

ACTGAATCAT GCCTTQGATC GGCTOGAGGA GrTTGAAAGAA TTTQCCAACT TTGACriTCA 660 

TGTCTGGAGG AAAAAC3TATA TQCCSrrTGGAT GAATCACAAA AAGrrCTCXaAG TGATQGATTT 720 

CITCOQQCX3C ATTGATAAGG ACCAGGATGG GAAGATAACA CGTCAGGAOT TTATCGATOG 780 

CATTTTAGCA TCCAAGTTCC CCA0CACX:AA GrTTAGAGATG ACTGCTGTGG CTCACATTTT 840 

OGACCGAGAT GGGGATGGTT ACATTGATTA TTATGAATTT GTGQCTGCTC TTCATCCCAA 900 

caaggatgcg TATCGACCAA CAACCGATGC AGATAAAATC GAAGATGAGG TTACAAGACA 960 

ACTGGCTCAG TGCAAATGTC CAAAAAGGTT TCAGGTGGAG CAGATCGGAG AGAATAAATA 1020 

CXX3GTPCTTC CTCGGCAATC ACTTTGGGGA TTCTCAGCAG TTGOGGCTQG TCCGTATTCT 1080 

GOQCAACCOT GATQGTTCGC GTTGGTGGAG GATOGATGGC CTTGGAIGAA TTTTTAGnXSA 1140 

AAAATGATCX: CTGCCGAGCA OGAGGTAGAA CTAACATTGA ACTTAGAGAG AAATTCATCC 1200 

TACCAGAGGG AGCATCCCAG GGAATGACCC CX:TTCXX3CTC ACGGGGTCGA AGGTCCAAAC 1260 

CATCTTCCCG GGCAGCTTCC CCTACTOGTT CCAGCTOCAG TGCTAGTCAG AGTAACXACA 1320 

CXTGTACATC CATOCCATCT TCTCCAGCCA CCCXaCCCAG TQGAACCAAG GrTTATCCCAT 1380 

CASXaGCTAG CAAGTTQAAA CGACXIAACAC CAACTTTTCA TTCTAGTCGG ACATCCCTTG 1440 

CTQGTGATAC CAGCAATTAG TTCTTCCCCG GCCTCCACAG GTGCCAAAAC TAATCGGGCA 1500 

GACCCTAAAA AGTCTGCCAG TOGCCCTGGG AGTOSGGCTC GGAGTCGAGC aSGGAGTCGA 1560 

GCXMCAGCC GGCGAGGAAG TGACQCTTCT GACTTTGACC TCTTAGAGAC QCAITGCTrG 1620 

TTCCGACACT TCAGAAAGCA GOGCTGCAGG GGGCXIAAGGC AACTCCAGGA GAGGGCTAAA 1680 

CAAAOCTTCC AAAATCCCAA CCATGTCTAA GAAGACCACC ACTGCCTCCC CCAGGACTCC 1740 

AQGTCCCAAG CGATAACACT GTCTAAGCAC CCCCAAGCCA CTATCCACTT TCAATCCTQC 1800 

TOCATACATT GGGTCTATAT TTATTCTGAA CXXSGAGAAGT TATATTCn-A AAAGTCTAAA 1860 
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AGAATAA3TC TOPTATCAAG CTCCCTTATT rmTlVi ' lT TreTAAGTTA CTAnTTCAT 1920 
GTGAATATTT ATGTAGATAA AATTTC3CCTC CTGtSTAACCC TGrTAATOGAT GGQGCCCAGA 1980 
5 AATGAAATAT TTGAGAAAAA CAAGTCAAAA GGTCAAGATA CAAATCTOTA TTAAAAAAAA 2040 

AAAAGOCTAT TAATAGGCyTT TCTOCGCQCT GCAGCSGlTCr AAACCTOTT TATCmTAG 2100 

GA3TATTCCT AAATGCATCT TCrTTATAAA CPPtSACrTGC TATCTCAGCA AGATAAAaTTA 2160 

TA3TAAAAAA ATAAGAATCC TQCAGTGrTT AAGGAACTCT TTTTTTOTAA ATCACGGACA 2220 

CCTCAATTAG CAAGAACTGA GQQGAGGGCT TTTTCCATrc; TITAATOrTT TOrcATmT 2280 

AGCTAAAGAG AGGGAACCTC ATCTAAGTrAA CATTTOCACA TOGATACAGC AAAAGGACTT 2340 

CATTQCAATA CTGTCTTTGG ATATrGTlTC AGTACTOQGT GTTTAAAGGA CAAATAGCTC 2400 

CTAGAATTCA GGCSGTAAATG TAAGTCTPCA GAAAACGTCA GAACATrTGG GGTTTrAAAC 2460 

TGATTTCTTG CTCCCTATCC AGCCTAGACA CXAGTAACTC TTGTGTTCAC CAGGACXXAG 2520 

ACCCTTGGCA AGGGATAGGC TCGTTOGrrGA CATTGTGAAT TTCAGATITC TTTTNTCCAC 2580 

rrrmTGCT ATTTATTTAA ATGGTCGATC AACTTCCCAC AAACTGAGGA ATCAATTCCA 2640 

CGAGCCTGTT CTGAAAA3CT GGAOGTAAGA CAAACACGTG <nxX3TCCTrT AATOGACHTC 2700 

ACCAGCACAC TTOTTAACCA CnCCTGTTTG CTTrcGTCTT TTTTTGTCCG TAATAAASTC 2760 

AACTC3ACCAA GTGACCATGA AAAQGGGCTG TCTGQGGCTC C1X3TTTTTTA GCTOCTCTTC 2820 

TTCAGCTCCG ACCAICTTOC TCTGTCATTA TCTCAATrGG TTTTAATTGA QGCAGAAACT 2880 

GAAQCTCTAC CAATGAACTG TTTAGAAACA AGACACACTT TTGTATEAAA ATTOCTTX3CA 2940 

GTAACAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAACTCG AGGGGGQCCC GGTAC 2995 
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(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGrrH: 1990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEIX^ESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 
C3GGACCCGCG CGSCTCCCGG GGATQGTGAG CAAGGCOCTG CTGCNWCGTC TCTGCCGTCA 
ACCGCAGAGG ATGAAGCTGC TGCTGQGCAT OGCOTTQCTG GCCTACC3TCG CCTCTOmG 
GGGCAACTTC GTTAATATGA GGTCTATCCA GGAAAATGGT GAACTAAAAA TIGAAAGCAA 
GATTGAAGAG ATGGTTGAAC CACTAAGAGA GAAAATCAGA GATTTAGAAA AAAGCTTTAC 
CCAGAAATAC CCACCAGTAA AGTTTTTATC AGAAAAGGAT CGGAAAAGAA TTITCAWrAA 
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CAGGAC5G0GC AGKGTTCGTG GQCTCCCATC TKAACTGACA AACTCATGAT GGACGGCCAC 360 

GAGGTGACCG TGGTCGACAA TTTCTTCACG GGCAGGAAGA GAAAOGTGGA GCACTOGATC 420 

GGACATCAGA ACTTCGAGTr GATTAAOCAC GACGTCTQQG AGCCCCTCTA CATOGflGGTr 480 

GACCAGATAT ACCATCTGGC ATTCTCCAGCC TCCCCTCCAA ACTACATCTA TAATCCTATC 540 

AAGACATTAA AGACCAATAC GftTTOGGACA TrAAACATGT TOQQGCTGGC AAAACGftGTC 600 

GCTTGCOCXnC TQCTCCTOGC CTCCACATCG GAGGTOTATG GfiGAITCCTGA AGTOCACXXT 660 

CAAAGTGAGG ATTACTOQOG CCAOSTGAAT OCAATAQGAC CTOGGGCCTG CTACGATCAA 720 

GGCAAAOGTC TTGCAGAGAC CATGTOCTAT GCCTACATGA AGCAGGAAGG OGTQGAAGTC 780 

CGACTPCSQCCA GAATCTTCAA CACCTTTOGG CCAOXAIXX: ACATGAACGA TQGGCGAGTA 840 

CTTCAGCAACT TCATCCTCCA GGCGCTCCAG QQGGAGCXZAC TCACXXTrATA OGGATCOQGG 900 

TCTCAGACAA GGGCGTTCCA GTACGTCAGC GATCTAGTGA ATGGCCTCGT GGCTCTCATC 960 

AACAGCAACG TCAGCAGCCC GGTCAAOCTG GQGAACCXZAG AAGAACACAC AATCCTAGAA 1020 

TTTGCTCAGT TAATTAAAAA CCTrorrOGT AGCGGAAGTG AAATTCAGTT TCTCTCOGAA 1080 

GCCXZAGGATG ACCCACAGAA AAGAAAACCA GACATCAAAA AAGCAAAGCT GATC3CTGC3GG 1140 

TGGGAGCCCG TGGTCCaxrP GGAGGAAGC3T TTAAACAAAG CAATTCACTA CTTCCGTAAA 1200 

GAACTCGftGT ACCAGGCAAA TAATCAGTAC ATOXCAAAC CAAAGCXTTQC CAGAATAAAG 1260 

AAAGGAOSGA CTCGCCACAG CTGAACTCCT CACITTTAGG ACACAAGACT ACCATTCTAC 1320 

ACTTGftlXSGG ATGrEATTTTT GGCnTTTTT ' I CTTUiXJim' lAAAGAAAGA CTTTAACAGG 1380 

TGTCATQAAG AACAAACTGG AATTTCATTC TGAflGCTTGC TTTAATGAAA TQGATGnOCC 1440 

TAAAAGCTCC CCTCAAAAAA CTGCAGATTT TC3CCTTQCAC TTTTTGAATC TCTCTTTTTA 1500 

TGTAAAATAG CGTAGATGCA TCTCTQCGTA TTTTCAAGTT TTTITATCTr GCTGTGAGAG 1560 

CATATGTTCT GACTGTCGTT GftCAGTTTTA TrTACTGGTT TCTrTGTGAA GCTGAAAAQG 1620 

AACATTAAGC GGGACAAAAA ATCCOGATTT TAOTTATAAA AGTGGGTACT TAATAAATGA 1680 

GTCGTTATAC TATGCATAAA GAAAAAYCCT AGCAGTATTG TCAGGrTOGTC GTGCGCCQGC 1740 

ATTGATTTTA GQGCAGATAA AAGAATTCTG TGTGAGAGCT TTATOTTTCT CTTTTAATTC 1800 

AGACTTTTTTC CAAGGTCTAC TTTTGAGnTG CAAACTTGAC TTTGAAATAT TCCTGTTGGT 1860 

CATGATCAAG GATATTTGAA ATCACTACTG TCTTTTGCTG CGTATCTGGG GCGQGGGCAG 1920 

GTTGGGGGGC ACAAAC3TTAA CATATTCTTG GrTAACCATG GTTAAATATG CTATTTTAAT 1980 

AAAATATPGA I99O 
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(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCrm-. 2436 base pairs 

(B) TYPE: nucleic acid 

(C) STRAmEEHfESS: double 

(D) TOPOLOGY: linear 



(xi) SBCJOEKCE DESCRIPTIOI: SEQ ID NO: 276: 

AACTTCGCTT AGCTCTPOCAG GGrTNAAAOGG GTGAGNCCTT AAAAACAGAA GAGAACAAGA 60 

15 TTTAAAGTCC GTrGCATTCA AAATAACAAA CAATATCAAT GTTTTAATCA AGGATCTCTT 120 

CCACATTCCT CCTTCTTATA AGAGCACAGT AACACTATCC TGGAAACCTC TACAAAAGGT 180 

TCSAGATTGGG CAAAAGAGAG CCACTTGAAGA TACAACTTCA GGTTCACCAC CCAAGAAATC 240 

20 

TTCAGCAGGA CCAAAAAGAG ATGCCAGGCA GATTTATAAC CCTCCCAGTC GGAAATATAG 300 

CAGCAATTTG GGCAACTTTA ATTATGAGCA GAGAGGAGCC TTCAGGGGAA GTAGAGGTOG 360 

25 CCGAGGTTQG GGCACAOGAG GAAATCGTAG TOSGGGAACA CTCTACTGAA TAAGACATCA 420 

GCATTCTTCA GCATTGTCAT GAGCTTAATA TACTTAAATT CTACTACTCA TTCGATTQCC 480 

GGGGATGTCC CTTTAAACAG ACTGCTGCCT TCAGCTAAAA ACTTAATGTT CTTTATACCT 540 

30 

TTGTATGrEAT GACCTACTTT TGTAACAGAC CATQGTTGTG TCCAAGGTAA AACCACAGTC 600 

ATATTTTTGG ATGCTTTGTC TGCAATCTTG ACTTGnTTT GCAGTATCAT TATICAGACT 660 

35 TCAAATTGTG AATCTTTTAA ACATCTTGAT AATTTGTTGT TGAGAGCTGT TCATTCTAAA 720 

ATGTAATGAA ATTCAGTCTA GTTCTGCTGA TAAAGATCAT CAGrTPTTGAA AGGrETACIGA 780 

TTTTCCTCTT CCCTCTTAGT TTTTTAOCCA ATATATGGAG AAGAGTAATG GTCAATCTTA 840 

40 

ACATTTTGTT TTAATTCTTTT AATAAAGCTG CTGGGCAGTC GTOCAGCATT CCTACCPAGT 900 

GTCATAAAAG CAAAATACTT ACATAGCTTT CTTAAAATAT AGGAATGACA TTACATTTTT 960 

45 AGQAGAAAGT AAGTTGCITr GCAOCGCCTA CTTAATTCTT TTCCATATAT TGTGATACAA 1020 

ACTTTTGAAT ATQGAATCTT ACTATTTGAA TAGAAATGTG TATGTATAAT ATACATACAT 1080 

ACATAAGCAT ATATGTGTGT GrrGTGTGrTGT ATATATATAT ATATGCATGC TCTGAAACTT 1140 

' GACTACACAA CATAAATCAC TTTTTAAATT CCAGGAACGG GTAGTCTQAC ACGC5TGATTA 1200 

TCCTTTTGAG GCTGAATCCG TTATTAACTT GrTTATTTAGG TTTTTACTCC CAGTAGCAAG 1260 

55 GGATTCTAAG TTAGTTGCAC TTACATGATT ATTGrTTATTT AAAACTAAGA ATAAAGGCTG 1320 

CATTTTCAAA GATAAATTGG AATTGCTGTT GGTGAAATAA CAACCAAAAT ACTCAATCTC 1380 

ATC3TACATAC AGGTITCTAC AGGAAGAGAT GGTATAATTT ACAATTTGGA GATTTAATAA 1440 
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CCAGQGCTAC CCAGAAAAAG TGACTTGATA ACATGGTACC AATAAGTAAG GGATOCTCTC 1500 

TaXSTTTGCT TTTGCCACTT TCAAGATTTT AACTTCTCAG GTTATTAATC AAAATTATTG 1560 

5 TATAAGTTAG CXIAATAGAAT TTTTAGGTTA AAACAACAGA TGGGGQGTTT GTGGftGTCTT 1620 

TAATGTPCATXS GGCATTETTA GTAGCAI»GA CCCTTPGTTC TCCATTTGAA TGrTTTCGTAT 1680 

ATTTTTGrrrr CACAGTTAAT CTTCOCTCCC CAAGTTTQCT AITCAAATCA ACTGCCTGAA 1740 

10 

TGACATTTCT AGTAGTCTGA TOTATTTTTC TCAGGAATAG TTTGTGATTC CAATOCAGGT 1800 

GTCTTCATTA CCATTACCTC TACACTQCAG AAGAAGCAAA ACTCCTTTAT TAGAATTACT 1860 

15 GCACATGriCT ATOGGGAAAA TAGTTCTGAA AGGCTAGAAT GATACAAGTTG AGCAAAAGTT 1920 

GGTCAGCTTG GCTATGGAGT GGrPQGCAATA ATCTCTAAAC ATTCCAAAAG ACCATGAGCT 1980 

GAACCTAAAC TCCCTTGCSAA TCTGAACAAA GGAATATAAA ATTC3CCATTT GAAAACTC3AC 2040 

20 

CAGCTAATCT GGACCTCAGA GATAGATCAG CCAGTGGCCC AAAGCCATTT CAAGTACAGA 2100 

AATTATAGAG ACTACAGCTA AATAAATTTG AACATTAAAT ATAATTTTAC CACTITITGT 2160 



30 



2220 



25 CTTTATAAOC ATATTTCTAA ACTCAGAACT GAGCAGAAGT GACTTTACTT TCTCAAGTTT 

GATACTGAGT TCACTGTTCC CTTATCCCTC ACCCTTCCCC TTCCCTTTCC TAAGGCAATA 2280 

GTGCACAACT TAGGTTATTT TTGCTTCCGA ATTTQAATGA AAAACTTAAT GCCATGGATT 2340 

TTTTTCTrrr GCAAGACACC TGTTTATCAT CTTGTTTAAA TGTAAATGTC CCCTTATOCT 2400 

TTTGAAATAA ATTTCCTTTT GTAATTTTAA AAAAAA 2436 



35 



(2) INFORMATION FOR SEQ ID NO: 277: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) L©3GTH: 782 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENBSS : double 

(D) TOPOIiOGY: linear 

45 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 277: 
GCCACTGACT TCTOCCACCC TTCTGTCTCC CCCATAATAG TTTATTTGCJr TGGTCTGGAC 60 
50 TCACTTGTGG CCTTTRATTA AA3TCCTAAG GGGCCTGAAG AAGACATTTC TACTGCAGAG 120 
GGTTAGAGGC ACTTGAGCAA GGCCCCCACA TCCCAACTCT GGGAGTTGTC GTOGGAGGAG 180 
GCACTTCTGG GGGATAGGAC CAGACAAGAT AACAGGAGCT CACATQC3NAA GCAGAAQCTG 240 
TGACAAGrrrr AGTAGTOCCA AAATGGGTTTA TATCCCTPCC CCCTTTACAT CAGAATCTTG 300 
TGAAATGGGA AAACAACAGA AGGAGGGGAT CAAAGATAGC TGATCTCACA TGCTTCCCAG 360 



60 GCAGGGCARA GGTQGGAGTC AAACCCGGGT GACAGGTQGG TGGAGAGCCC TGTTTGAGGT 



420 



505 



TGrPGGCTGAT CCCTCTCTGG TATTAGTTTT TCCCCTGGGA GCAGGAAGCC CTAQGAflGfiG 
GGGACTGCAG GGTCCCCRGG GGATCTTTCX: TCCCTCCCCT GCATGAGGCA GAGGCAAGCT 
GCCTGOCAAC CCCCTCOCTC AAGGAATGGC CTTGCCCAGG AATGCCCACC ACACATACCC 

■ Ai r rivrrn ' r ttctagtcaa ACTCTTGnr attccttogc ttgcxtocct ccrrccrccc 

CTCTCAACCT TTACTTCTGA TTTCTATTTC ATOGAATTTG GGATTGAAGT TAAACTACAA 
CAGTGCX30CC AACACCAAGT CTTGCAOGAA AAAAATACAA AGAAATTTAA CAAAAAAAAA 
AA 

(2) INFORMATION FOR SBQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGrrH: 961 base pairs 
<B) TYPE: nucleic acid 

(C) STOANDEENESS: double 

(D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTIGN: SBQ ID NO: 278: 
GAGTTCOQGC TGGAGACCCG TGCTCTGQGC 0GC3CGCCTTC ACCATGGCCT CGGCAGAGCrr 
GGACTACACC ATCGftGftTCC CGGATCAGCC CTGCTOGAGC CAGAAGAACA GCCOCAGCCC 
AGGTGGGAAG GAGGCAGAAA CTCGGCAGCC TGTGGTGATT CTYTTQGGCT GGGGrrOGCTG 
CAAGGACAAG AACCTTGCCA AGTACAGTGC CATCTACCAC AAAAGGGGCT GCATOGTAAT 
CCGATACACA G0CCCGTC5GC ACATC3GTCTT CTPCTCOGAG TCAdCGGTA TCOCTTCACT 
TCGTGTTTTG GCCCAGAAGC TGCTCGAGCT QCTCTTTGAT TATGAGATTG AGAAQGAGCC 
CCTGCPCTTC CATGTCTTCA GCAACQGTGG CGTCATGCTG TACCQCTAOG TGCTGGAGCT 
CCTGCAGACC CCTOGCTTCT GCCQCCTtSCG TGTQGTGGGC ACCATCTTTG ACAGCGCTCC 
TGGTTGACAGC AACCrOGTAG GQGCTCTQCG GGCCCTGGCA GCCATCCTQG AGCGCCGGGC 
CGCCATGCTG CGCdCTTGC TGCTQGTQGC CTTTGCCCTG GTrGGTCGTCC TGTTCCACGT 
CCTGCTTGCT CCCATCACAG CCCTCTTCCA CACCCACTTC TATGACAGGC TACAGGACGC 
GGGCTCTCGC TQGCCCGAGC TCTACCTCTA VTCGAGGGCT GACGAAGTAG TCCTGGCCAG 
AGACATAGAA CGCATGGTGG AGGCACGCCT GGCACGCCGG GTCCTGGCGC GrrTCTGTGGA 
TTTCGTGTCA TCTGCACACG TCAGCCACCT CCCTTGACTAC CCTACTTACT ACACAAGCCT 
CTG?rGTCGAC TTCATGCGCA ACTQCGTCCG CTGCTGAGGC CATTGCTCCA TCTCAMCTCT 
GCTCCAGAAA TAAATGOCTG ACAMCTCCCC ACAAAAAAAA AAAAAAAAAA ACTCGAGGGG 
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(2) INFt3RMATIC»I FOR SBQ ID NO: 279: 

(i) SBQOENCE CHARACTERISTICS: 

(A) LENGrrH: 1228 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 279: 

15 

CGCGCTTTGC AGTTOGGTCT CCTOGTGTAC GC3CCAACGCC AAGTAGGGGA rrGOgTTCCt: 
TCCAGTOCXA GCCCTATCAG ATTTGGATAT GTCCTTCATA TTPGATTGGA TTTACAGTGG 
20 TTTCAGCAGT GTGCTACAGT TTTTAGGATT ATATAAGAAA ACTQGTAAAC TGGTATTTCT 
TGGATTGGAT AATGCAGCSAA AAACAACATT GCTACACATG CTAAAAGATG ACAGACTTGG 
ACAACATC3TC CCAACATTAC ATCCCACTTC CGAAGAACTG ACCATTGCTG QCATGAOGTT 

25 

TACAACTTTT GATCTGGGTG GACATGTTCA AGCTCGAAGA GTGTGGAAAA ACTACCTTCC 
TCXTTATCAAT QQCATTCTAT TTCTGGTGGA TTC5TGCAGAC CACGAAAGGC TGTTAGAGTC 
AAAAGAAGAA CTTGATTCAC TAATGACAGA TGAAACCATT GCTAATGTGC CTATACTGAT 
TCTTGGGAAT AAGATCGACA GACCTGAAGC CATCAGTGAA GAGAGGITGC GAGAGATGTT 
TGGTTTATAT GGTCAGACAA CAGGAAAGGG GAGTATATCT CTGAAAGAAC TGAATGCCCG 
ACCCTTAGAA GmTCATGT GTAjGTGTGCT CAAAAGACAA GGTTACGGAG AAGGCTTCCG 
CTGGATGGCA CAGTACATTC ATTAACACAA ACTCACA3TG GTTCCAGGTC TCAACGTrCA 
GGCTTACTCA GAGATTTGAT TGCTCAACAT GCATAACTTG AATTCAATAG ACTTTTGCTG 
GTTATAAAAC AGATGTTTTT TAGATTATTA ATATTAAATC AACTTAATTT GAATGAGAAT 
TGAAAACTGA TTCAAGTAAG TTTGAGTATC ACAATGTTAG CTTTCTAATT CCATAAAAGT 
ACTTGGmT TACAGTTTAT AATCTGACAT CACCCCAGCG CGATTTGTAA AGAGCAACTT 
TCCAGCAGTA CATTTGAAGC ACTTTTTAAC AACATGAAAC TATAAACCAT ATTTAAAAGC 
TCATCATGTT AAATTTTTTA TGTACTTTTC TGGAACTAGT TTTTAAATTT TAGATTATAT 
GTCCACCTAT CRTAAGTGTA CAGTTAATAA TTAGCTTATT CAATGATTGC ATGATGCCTT 
ACAGTTTTCA ATAACTTTTT TTCITATGCA AAOGTCATGC AATAAAACAA ACTCTAATGT 
TTGGCAAAAA AAAAAAAAAA AAANTCGA 
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(2) INFORMATICN FOR SBQ ID NO: 280: 

(i) SBC^JQJCE CHARAClERIffriCS: 

(A) LBIGrrH: 1327 base pairs 

(B) TYPE: nucleic acid 

(C) STKANECENESS: double 

(D) TOPOLOGY: linear 

(xi) SECUQCE EESCRIPnON: SEQ ID NO: 280: 

TCTCGGGrrCT CX3GGACAGGT GfiGCACCCTG ATGAAGGCCA CGC3TCCTCAT GCGGCAOCTG 60 

GGOGQGTGCA GGAGATOGTG GGOGCCCTCC GCAAGQGCGS CGGAGACCGG TTACAGGTGA 120 

TTTCTGATTT TEWCATGACC TTGAGCAGGT TTGCATATAA TCGAAAGCGA TOXCTTCTr 180 

CTTACAfiTAT TCTOC3ATAAT AGCAAGATCA TCAGTGAGGA C3TCTCGGAAA GAGCTCftCAG 240 

CGCTCCTTCA CCACTATTAC CCAATTCAGA TCGACXXACA CCGGACCGTC AAGGAGAAGC 300 

TACXrPCATAT GGTGGAATGG TGGACCAAAG CX»VCAATCT OCTATGTCAG CAGAAGATTC 360 

AGAACTITCA GATAGCCCAG GTOGTTAGAG AGTCCAATOC AATQCTCAGG GAGQGATATA 420 

AGACCTTCTT CAACACACTC TAOCATAACA ACATTCCCCT TITCATCTTT TCTGCGGQCA 480 

TTQCTGATAT CXTQGAAGAA ATTATCCGAC AGATGAAACT GTTCCACCCC AACAOCCACA 540 

TOGTGTCTAA CTACATGGAT TTTAA3X3AAG ATQGTTTTCT CXMGGATTT AAGGGCXAGC 600 

TGATACACAC ATACAAC3Utf; AACAGCTCTG TGTGrPGAGAA CTCTOCTTAC TTCCAC3CAAC 660 

TTGAGGGCAA AACCAATGTC ATCCTQCTGG GfiG?VCTCTAT CXSGGGACXTTC ACCATCGCCG 720 

ATOGGGTTCC TGCrTCTGCAG AACATTCTCA AAATTGGCTT CXTTGAATGAC AAGGTOGAGG 780 

AGCGGCGGGA NCGCTAACAT GGACTCCTAT GACATCGTGC TGGAGAAGGA CGAfiACTCTG 840 

GATGTC5GTCA ACGGGCTACT GCAGCACATC CTGTOCCNAG GGGGTCCAGC TOGAGATQCA 900 

AGGCCCCTGA AGQCGCAGGC TCCNAAGKCX: SCTGCAGGCC GTGCJTGAGGA GGC3QCX3CCTC 960 

CCCAGAGTCT GCTCCCXXX3T GAACACAGAG CAGAGCCAC3G GrTGQCCAGCA GrOQCTOGGT 1020 

CCTTCCGCGC CCCTCCGTCC TCCTTTCCCT GAGCACCTTC ATCACCAGAG GCTTGAAGGA 1080 

ACCCOQCCAT GTGGCftGGGC ACAGGCACTG TTCCTGCTGA ACCTTCGAOC ACAGCATOrC 1140 

AGTGCTCTAG GGATTGTCTA CTCCAGGGAT TITCTTCAAA ATTTTTAAAC ATGOGAAGTT 1200 

CAAACAAATA TAATGTGTGA AACAGATCAA AATTTTTAAA ATGAAAAAAA AGCTQCTCTG 1260 

ATTCAGQQGA TOTOGCTTCGG GCTAGAACCT GGACCTCTTG GCCTQGGGGC ACATGGGATG 1320 

CTTCTAG 3^327 
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(2) INFORMATION FOR SEQ ID NO: 281: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LQJGriH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STTU^NDECNESS: dovible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE nRSCRIPTION : SBQ ID 1K>: 281: 
TCACOCTOCC TACAGCGTGG AGCTCAGATO ACTGCGCCCT CCAOGGTrCAC TGTGAGCAGG 
TQOTATTCAC AGCCTQCATG ACCCTCACGG CCAGCCCTGG GGTGTTCCOC GTCACTCjrGT 
GGCTTTGGCT GAAGCCTAAT TCCACAOCTC CTlVlTi'l'lT GAGAGAGACT GAGAGAACCA 
TAATCCTTGC CTGCTGAACC CAGOCTGQGC CTQGATQCTC TGrPGAATACA TTATCTTGCG 
ATGTTGGGTT ATTCCAGCCA AAGACA3TTC AAGTGCCTGT AACTGATTTG TACATATTTA 
TAAAAATCTA TTCAGAAATT GGTCCAATAA TGCACGTGCT TTGOCCTGGG TACAGCCAGA 
GCCCTTCAAC CCCACCTTGG ACTTGAGGAC CTACCTGATG GGAOGTTTCC ACGTGTCTCT 
AGAGAAGGAT TCCTQGATCT AGCTGGTCAC GACGATGrTTT TCACCAAGGT CACAGGAGCA 
TTGCGTCGCT GATGGGGTTG AAGTTTGGTT TQGTTCTTGT TTCAGCCCAA TATGTAGAGA 
ACATTTCAAA CAGTCTGCAC CTTTGATACG GTATTGCATT TCCAAAGCCA CCAATCCATT 
TTGTG6ATTT TATGTGTCTG TGGCTTAATA ATCATAGTAA CAACAATAAT ACCTTTTTCT 
CCATTTTGCT TGCAGGAAAC ATACCTTAAG TTTTTTTTGT TTTGTTTTTG TTTTTTTGIT 
TTTTGrmTC CTTTATGAAG AAAAAATAAA ATAGTCACAT TTTTAATACY AAAAAATGGA 
CAAAAAAAGT CGAOGOGGG 

(2) INFORKATICN FOR SBQ ID NO: 282: 

(i) SEQUETKIE CHARACTERISTICS: 

(A) LENGTH: 2196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 282: 
AAAGACTCTA ACATCCATQA GCTTGAACAT GAGCAAGAGC CTACTTGTGC CKSCCAGATG 
GCTGAGCCCT TCCGTACCTT CCGAGATGGA TQGGTCTCCT ACTACAACCA GCCTGTGTTT 
CTGGCTGGCA TGGGrrCTTGC TTTCCTTTAT ATGACTGTCC TGGGCTTTGA CTGCATCACC 
ACAGGGTACG CCTACACTCA GGGACTGAGT GGTTCCATCC TCAGrTATTTT GATGGGAGCA 
TCAGCTATAA CTQGAATAAT GGGAACTGTA GCTTTTACTT GGCTACGTCG AAAATGTQGT 
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TTOGITCGGA CAGGTrCTGAT CTCAGGATTG 
ATCTCTGrrAT TCATGCCTGG AAGOCXXXTTG 
5 OGATCAAGGT TCATTCAAGG AGAGICAATT 
GAAATATACA TGTCTAATQG GTCTAATTCT 
TCTGTGCCCA TAATCTCTGT CACTCTCCTG 

10 

CTTTGGTCCT TTSATTTAAC TGTOACACAG 
AGAGQCATTA TAAATGGTOT ACAGAACTCC 
15 ATCATGC?rCA TCCTOGCTCC AAATCCTGAA 
TCCnTCTGG CAATOGGCCA CATTATGTAT 
AAGCTCrrTG CTTOCGGTCX: TGATQCAAAA 

20 

TCTOITCrTT GAGACAGTTT AACTGnTGCT 
TGCTTATTTT GTACTGCAGA ATTCCAATAA 
25 CAGCTGTOCC TTGAGAACTA AAAGCTGrTTT 
TTAATTTCCC TTATGrTTGAG GCATQGAAAA 
AOGGAGACTA TAATGATAAC ACTGAATTCC 

30 

GTAAAAGAGT GGTTAGICAC GTGAATTCAG 
GAAITCAGAT ATGTCAGTTT TCTGCAAAAC 
35 TTTTTGCATC TTAGrTTATTT TTAAAAACAA 
AACTAATATT ATCCTTATTG ATCCTATTGA 
CAAAACACTT AACTAGAATT CTCTAATAAG 

40 

TATTTTTA'rr ATCAGATGGG GCAACATATT 
GGnTATCATG TAAGCTOCAG GTAGAAQCAA 
45 TQCATACAGA CTTCAAATAT GTCAATAGTT 
ACAGAAGQGC AAGAATOCCA ATTTAACTCA 
AACATGAGGG TGTAAGCCTT CAGCCTGGCA 

50 

GGrrTTTGrrTT tacaaatcac ttgatttaac 
GTTTTATACC CAGAAGTTAT TTCTACATTG 
55 CCTTTCAAAT aXTITCAGA AGAATAGAAG 
GTTTTAAAAG TCAGrTTTGCA ACATGrCTCT 
TATGCACTTT CATGGAGACT GCAATAOGrTT 

60 



GCACAGCTTT CXTOTTTGAT CTTOltJ I V m 360 

GACTTGrrccG TrrcTccnT tgaagaiatc 420 

ACAOCTACCA AGATACCTGA AATTACAACT 480 

GCTAATATTG TCCX30GAfiAC AAGTCCTGAA 540 

TTTQCAGGCG TCATTGCTGC TAGAAT0C5GT 600 

TTOCTGCAAG AAAATGTAAT TGAATCTGAA 660 

ATCAACTATC TTCTTCATCT TCTQCAITTC * 720 

GCnTTGGCT TGCTCCTATT GATTTCAGTC 780 

TTCCGAnTG CCCAAAATAC TCTQC5GAAAC 840 

GAAGTTAOGA AOGAAAATCA AOCAAATACA 900 

ATCCTC5TTAC TAGATTATAT AGAGCACATG 960 

ATGGCTGGGrr GTTTTGCTCT GITTTTACCA 1020 

AOGAAACCTA AGTCAGCAGA AATTAACTCA 1080 

AAAATTOGAA AAGAAAAACT GAGTTTAAAT 1140 

CCTATTTCTC ATGAGTAGAT ACAATCTTAC 1200 

TTATCAnTG ACAGATTCTT ATCTGTACTA 1260 

TCACTCTrcfT TCAAGACTAG CTAAriTATT 1320 

ATTCrrCAAG TATGAAGACT AAATTTTGAT 1380 

TCTTAAGGTA TTTACATGTA TGrPCGAAAAA 1440 

CTTTATOGTT TAGCTTAAAG AGCACCTTTG 1500 

GTATGAAGCA TATCTAGCAC TPCACAGCAT 1560 

AGCTGTAAAG TAGATTTATC ACACAATGAC 1620 

TGGTCATAGA ACCTAGAAGC CAAAAGCCAC 1680 

TGnTATCATC ATTAGTGATC TGTGTTGTAG 1740 

AGTTACATGT AGAAAGCCCA CACTrGTCAA 1800 

ACACTCAGGrr AGAATATTTT TAnTTTACT 1860 

TTCTACAGCA AGAATATPCA TAAAAGTATC 1920 

AAAAAAAGTT TGTATATATT TTAAAAAATT 1980 

ACCAAGATGG TACTITGCCT TAACCGTTTA 2040 

GCTATGAGCA CTTTCnTAT CXTrTGGAGTT 2100 
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TAATCCITPG CrrCATCTTT CTACAGTATC ACATAATGAT TTOCrAICTr CTTAAAATCTT 2160 



TCTAAAAAAT TTCTATATAA AATATTTGAA ACTOAA 
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(2) INFORMATION FOR SBQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 base pairs 

(B) TYPE: nucleic acid 

(C) SnuVNDECNESS: double 

(D) TOPQUXJy: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

QCAGTTAAGG CITCTGATAA GGAAAGAGAG TCTGAACAGA GCACACACAT CTOGAGCTCC 60 

AGGAGTC5GGG GATC3CAGCAT CAGATTCCAT CTTGAATTTC TGCTAAAATA CTTTOrACTC 120 

ATAATCX5ATC TCAACAAAGA TCTOTATrTC ATCTGTQC3CT CCATCTTCCC TCTOGCTTCAA 180 

GTAGATGITA AOCTQGACCT TC3GCACGCCT CTTAACATGA AGAGATCTAG CTAGACAGAC 240 

AGACTCCCCC ATTTATOGGA ACAAGAATTC AATTTATTCT CTATITATAA AACATXTriT 300 

TAAAGTQCCT TQGGTATAAA AATCTAAATG TCTGCGGTGT GATCAGTCAG GAGCAOyrAA 360 

CTATCACTCT TCGCATCCTT TGGTCACTGG GAGATCCTTr GGGGGCTGGG AGGTCCITCT 420 

GTCCCAGGCT AAAGGAAAAG CTTCACAAGG GTAAGAGCCA CAGAACCCTC GGCAA6AAAG 480 

GCCGGTCAGG GAGAATGAAT QCTACAGAGA GGAAAGGAAG GAAAGGGGGT GGAACAGAGG 540 

TAGAAGGCAA GGAAGGGATG CCGCACTC3GA GACOGATGGG GACACTCTAA OTCTCCAAGA 600 

C3t3GAQGATCT TCCTTCTTeA ATQCTGAACA CAGCTAGTCT GAACCTTCCT TCGAAAC?PCC 660 

AGCTGTTTGC CCATC3CATAG GGCCAACTCT CCCTGCAAAG CAGCAAATC3T GGCTTCTATC 720 

AQGAAGGAAA AGTATCCATC AGTCTTGACAA GAGGTCACCT TCGAACTTOC ATCAACTCCT 780 

TGCXXAGCCh CAAAGAGTCC TQGTAGAAGT GAGGATCGCC TAGTCTTACG GCTOTCOCyrT 840 

TATAGAAGTA GCAGTACAAC ACTGCTQCTA GTCTCTGGAA TACAAACAGC ATTTCAAGTC 900 

CATCTGTCCA TATGAAGCTG TTGGAGnTTT TCCAGCGTAA GTTCATGACC CAGACATCAA 960 

GGGAGATGCT QAGGGCAAAG TACACAGCTG TCAGGATGAT GGTCCCTTTG AACTTATOGA 1020 

ATAGGAGGTT GACCAGGCCA GCCTGGAAGA CGAAGGTGTT GAAGAACATC AGGAAAATOA 1080 

TGATQATGTT GAAGAGGACT GCAATATCCT GGATGCACTG AGGGAGAGGY TTCTACnTOC 1140 

TTTGAATGAG AGCTGTITCC CTTGCTCTAA GGCAAGCACC TCCAA 1185 
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(2) INFORMATIO* FOR SEQ ID NO: 284: 

(i) SBQDQ3CE CHARACTEiaSTICS : 

<A> LENGTH: 1634 base pairs 

(B) TYPE: nucleic acid 

(C) STRANEEE3DNES5: double 

(D) TOPOLOGY: linear 

(xi> SBQOENCE DESCRIPTION: SBQ ID NO: 284: 

AQGGAAAGGG GAGGGTAGCX5 GGAQQGTAGC AGGTGAGTTC CTAGC3QCTGG AAGGmTAGC 60 

AGCAGCCTOG TGCAGTQCCC TGTCATCAAG ACAAACOCAC GC?1X3C1'MCTG GCnGCCTACC 120 

15 AAGCTrQGTT TGTACAAAAG CAAGGTQGGA CSTCTATTTTT CTACATGAGA TACATCACAC 180 

TTACCTGTGG GCCAGTATPG TGAAGTGAGT CTGAGTTGTT TACACTGATG CXTITCCCTGC 240 

CCACCACAAA TTGTCTACAT AGTCTTCAGA TGATACCACC CCTTTCCCCA GCTCCCAACC 300 

20 

AAGAGCTC3GT TCTAGGCXTTG TGTTATATGT CATATTTAGC STTTTTATAT ATGACCTTTG 360 

ATTTCTGTTG TTTGTATTTT AGCACAGTCT ATGCACCTTC ATTTAAATAC ATCTGTGTGC 420 

25 ATACAGATAC GCATATATGT GTC3TGCGTAT GCATATATCT CTCATCTGTA GTTTCCAAGA 480 

GTTCAGCTGA AGCAGATC3GA GTCCTGCAGC CCAGGAGACA CXXTTOCATCX: CTGCTAATAG 540 

TGTTTGOCAC AAGTATTAGT GAGTCTTCCT TATTAATATT TTCATTTCAG AAGACTGAAG 600 

30 

CAAAGCTGAT AGrrGTTTC3CT GTrTCTTTGG CAGCTAAGTG AGGGTCTTQG GATGACTTGC 660 

TGTGTTCCTC AAGCTGCACT TTQQGGCXM' CTCTQCAGTA TTAGCCCCCT TTTTQCTTGG 720 

35 TGGTACTCTG TCTGTGCXnx; TGTGTGTGTG TGATAGTCAC TCTTGCATGG CTTCCATGTC 780 

TOGnrcrGG catttggqga taaqgtgctg aagcx:agagc atttgcactt TGrrrrGAGGC 840 

CTOGTTGCCA ATGATAGATC ACTCCTGrTTG ACXTPCSGTATG TCTGCTTGCT TGCIGCTTTT 900 

40 

COTTGCTTTC TCTTGGAAGA GGAAAGGACT CTGGTCAGGC CCAGGCTGAG TGAGATGAGC 960 

TQCAGCTGGC TCATOGCXTTT CTTAGAGCAG AGAGAGGAGT ATGTCATTTT ACTAAGTTCC 1020 

45 TAAACAAACA TTTATQCAGG CAACACTCCT TGCAGATCXilA GAAACTGAGG CACAATAOGG 1080 

TTATGACTTG CTCAAGAATA TGTAGCTGCT AGGGGC3TAAA TCAAGGCATC ACAATTTCTG 1140 

TTCAGCGGGC AGGAATAGGC TGTGAATTGC TAGCAdTTT TTTrTTTAAG CAATTACTTT 1200 

50 

TTGACTTGTT CCTCTGAAAG TGCAAGAGGC GTACACCTTT CCCAAATGTA GACTAGAATC 1260 

TQCAGGATGC CACCCACTGT ATAGrTTCTGC TTTCCCAGAG AGGAAGAACT TTTAGAAACC 1320 

55 AAATGATCTT AATTGTTATT GCCCACCCXTT QGCTTTTCCG QGTAGAAAAT TCACAGTAGG 1380 

AATGATTGTT AAGAGAGAGT GCTTGGAACC ATGGGTTAAC AGGAAAGGCT ACCTAACTTC 1440 

ACATATCTGC AACCAGAGCA GCXIACCAAGC ATTACTEAGC AGCAGGAAAA TGATTGTATT 1500 

60 
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TGAGTTCC7PG TGrTGTCCAAA ACrC3AGCX:AC CATGrTTCTTT GAAAftCATGC CACCTCAAGG 1560 
CTGGGCGOGG TGC3CTCACAC CTGTAATCCC AGCAYTTTQG GGAGQCCSAG GGCGGGQCGG 1620 
5 KTTCACCGGG GGTC 



10 (2) INFORMATION FXJR SBQ ID NO: 285: 

(i) SBCKIENCE CHARACTERISTICS: 

(A) LEJIGrrH: 1795 base pairs 

(B) TYPE: micleic acid 
1^ (C) STRANDEENESS: doi±>le 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 285: 

TTCCCCOCAG GITOGCTTCC TTCGATTCCT TTTCTTQGTA TCAAOGTITG ATTGGAAGAA 60 

CAACCCCCTC TTTGTCAACC TCAATAATGA GCTCACTGTG GAGGAGCAGC TCGGGCACAG 120 

CTCMCCGTyA TQGTCATTC3T TACOCCCCAA GAC0C5CAAAA ACTCTGTOTG GACACAGGAT 180 

GC5ACOCTCAG CCCAGATCCT GCAGCAGCTT GTGGTCCTGG CAGCTGAAGC CCTQCCCATG 240 

TTAGAGAAGC AGCTCATQGA TCCCCGGGGA CCTGGGGACA TCAGGACAGT GTTCCGGCCG 300 

CCCTTQGACA TTTACGACGT GCTGATTCGC CTGTYTCCTC GCCATATCOC GCGGCAOCGC 360 

AGGCTTGTQG ACTCGCCAGY TGCCTCCTTC TGCOGGGGCC TGCTCAGCCA GCCGGQQCCC 420 

TCATCCCTGA TGCCCGTC3CT GGGTOATGAT CCTWCTCAGC TCTATCTGAC GCAGCTCAGG 480 

GAGGCCTTTG GGGATCTQGC CCTTTTCTTC TATGACCAGC ArGGTQGAGA GGTGATTQGT 540 

GTCCTCTQGA AGCCCACCAG CTTOCAGCCG CAGCCCTTCA AQGCCTCCAG CACAAAGGGG 600 

CGCATCSGTGA TGriCTCGAGG TGGGGAGCTA GTAATGGTGC CCAATGrTTGA AGCAATCCTG 660 

GAGGACTTTG CTGTQCTGGG TGAAGGCCTG GTGCAGACTC TGGAOGCCCG AAGTGAGAGG 720 

TQGACTGTGT GATCCCAGCT CTGGAGCAAG CTGTAGACGG ACAGCAGGAC ArTGGACCTC 780 

TAGAGCAAGA TGTCAGTAGG ATGACCTCCA CCCTCCTTOG ACATGAATCC TCCATQGAGG 840 

GCCTGCTC3GC TGAACATGCT GAATCATCTC CAACAAAACC CAGCCCCAAC TTTCTCTCTG 900 

ATC5CTCCAGC ATTGQGGCAG GGGCATQGrTG GCCCATGTAG TCTCCTGGOC CTCACCATCC 960 

CAGAAGAGGA GTGGGAGCCA GCTCAGAGAA QGAACTGAAC CCAGGAGATC CATCCACCTA 1020 

TTAGCCCTGG GCCTGGACCT CCCTGCGATT TCCCACTCCT TTCTTAGTCT TCTTCCAGAA 1080 

ACAGAGAAGG GGATGTGTOC CTGGGAGAGG CTCTGTCTCC TTCCTGCTGC CAGGACCTGT 1140 

GCCTAGACTT AGCATGCCCT TCACTGCAGT GTCAGGCCIT TAGATOGGAC CCAGCGAAAA 1200 

TGTGGCCCTT CTGAGTCACA TCACCGACAC TGAGCAGTGG AAAGGGGCTA TATGTGTATG 1260 
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AATAGAOCAC ATTGAAGGAG CACAATGCCC TCCTGTGnTG ATGCCACTTC CCAGGGTQGA 1320 

GACAGTOGAA AAGAACOGAG GACAGGAAAG GATTGGGTAG GfTCAAGGGGrT CAGGGGACTG 1380 

5 

CTAGTCACCC AATCTTOGAG AGGTGCAAAA AQCACTGGQG GCTACCOGTT AGC7IGCATCT 1440 

GCCCTGQCTG TTTQCXXX3TT CATCTCACAA ACTGCCACTA CTATGTACCr GCAGrTGGGGT 1500 

10 TGCAGAGATG GGGGAGACTC AAGrPCTTACT COXAGGAGC TCCCAGQGCC CAAGGAGGAG 1560 

AATGCTQCCT CCTTTCftGrTC TQGTCTACAC CCACTTTCTG CTAQCXrTCTC TGCTrCCTGT 1620 

AATTCTGGCT GTTTrrcCflG ACTCAGCTCA AATAGTOCCC CTCCTTAAQC CCATCCCTCG 1680 

CCXXCAGCCT GAGGrrGATCT TTCCCTCCTC TGAACTATTA GAGGAGTTAC TGTCTGTrCA 1740 

GTTCGrrTTGG CAGGCACACA CAGTGGCATA AATTCTATTG TTTTGAACTC TGATT 1795 



15 



20 



(2) INPORMATIQN FOR SEQ ID NO: 286: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) TJWmi: 858 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 
TCTGCTTTCG GTGCTGCGTG TACTGCTGGG CGGCTTCTTC GCGCTCGTQG GGTTQGCCAA 60 
35 GCTCTCGGAG GAGATCTCGG CTCCAGTTTC GGAGCGGATG AATGCCCTGT TCGTGCAGTT 120 
TGCTGAGGTG TTCCCGCTCA AGGTATTTGG CTACCAGCCA GATCCCCTGA ACTAOCAAAT 180 
AGCTGTGGGC TTTCTGGAAC TGCTGGCTGG GTTQCTQCTG GTCATGGGCC CACCGATGCT 240 

40 

GCAAGAGATC AGTAACTTGT TCTTGATTCT GCTCATGATG GGGGCTATCT TCACCTTQQC 300 
AGCTCTGAAA GAGTCACTAA GCACCTGTAT CCCAGCCATT GTCTGCCTGG GGTTCCTGCT 360 
45 GCTGCTGAAT GTCGGCCAGC TCTTAGCCCA GACTAAGAAG GTGGTCAGAC CCACTAGGAA 420 
GAAGACTCTA AGTACATTCA AQGAATCCTG GAAGTAGAGC ATCTCTGTCT CTTTATGCCA 480 
TGCAGCTGTC ACAGCAGGAA CATGGTAGAA CACAGAG?rCT ATCATCTTGT TACCAGTATA 540 

50 

ATATCCAGGG TCAGCCAGTG TTCAAAGAGA CATTTTGTCT ACCTOGCACT GCTTTCTCTT 600 - 

TTTAGCTTTA CTACTCmT GTGAGGAGTA CATGTTATGC ATATTAACAT TCCTCATGTC 660 
55 ATATGAAAAT ACAAAATAAG CAGAAAAGAA ATTTAAATCA ACCAAAATTC TQATGCOCCA 720 
AATAACCACT TTTAATGCCT TGGTGTAAGT ATACCTCTGA ACTTTTTTCT GrTGCCTTTAA 780 
ACAGATATAT ATTTTTTTTT AATGAAAATA AAAOCATATA TCCTATTTTA TTTCCTCCTT 840 
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TTAAAACCTT ATAAACTA 858 



(2) INFOFMATIOEN FOR SEQ ID NO: 287: 

(i) SBC^JENCE CHARACTERISTICS: 

(A) LENGTH: 915 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDHtHESS: double 

(D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTICN; SEQ ID NO: 287: 

GAATTOGGCA CGAGCGCQGC CATQGCGCTC CTGCTTTCGG TtXTOCGTGT ACTC3CTGQGC 60 

GGCTTCTTOG OOCTCGTGGG GTPQGCCAAG CTCTCGGAGG ASATCTOQGC TCCflGTTTCG 120 

20 GAGCGGATGA ATGCCCTGTT CGrTGCAGITr GCTGAGC3TGT TCCOGCTGAA QC3TATTTC3GC 180 

TACCAGCCAG ATCCCCTGAA CTACCAAATA QCTGTGOGCT TTCTGGAACT GCTGGCTGGG 240 

TTGCTCXTQG TCATGGGCCC ACCGATGCTG CAAGAGATCA GTAACTTGTT CTTCATTCTC 300 

25 

CTCATGATGG GGGCTATCTT CACCITOGCA GCTCTGAAAG AGTCACTAAG CACCTC7EATC 360 

CCAGCCATTG TCTGCCTQGG GTTCCTGCTG CTGCPGAATG TCGGCCAGCT CTTAGCCCAG 420 

30 ACTAAGAAGG TGGTCAGACC CACTAGGAAG AAGACTCTAA GTACATTCAA GGAAI^XTTGG 480 

AAGTAGAGCA TCTCTGTCTC TTTATGCCAT GCAGGTGTCA CAGCAGGAAC ATGGTAGAAC 540 

ACAGAGTCTA TCATCTTGTT ACXAGTATAA TATCCAGGGT CAGCCAGTGT TGAAAGAGAC 600 

35 

ATTTTGrCTA CCTGGCACTG CTTTCTCTTT TTAGCTTTAC TACTCTTTTG TGAGGAGTAC 660 

ATGTTATGCA TATTAACATT CCTCATGTCA TATGAAAATA CAAAATAAOC AGAAAAGAAA 720 

40 TTTAAATCAA CCAAAATTCT GATGCCCCAA ATAACCACTT TTAATGCCTT GGTGTAAGTA 780 

TACCTCTGAA CTTTTTTCTG TGCCTTTAAA CAGATATATA TTTTITTTWA ATGAAAATAA 840 

AACCATATAT CCTATTTTAT TTCCTCCTTT TAAAAOCTTA TAAACTATAA MAAAAAAAAA 900 

AAAAAAAAAA CTCGA 915 



45 



50 



(2) INFX)RMATICN FOR SEQ ID NO: 288: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LHIGTH: 1517 base pairs 
55 (B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 
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CCTTGTGC3CA ACTAGTGQGT COCXCGGGCT GCAGNAATTC CXSGCAGTOGT TCTG M CritriX> 60 

AAGATACTCT GftGnTCCTCT GAGAGATCCA AAGGCTOCGG GAGCAGACCC CCAACCOCCA 120 

5 AAAGCAGCCC TCAGAAGACC AQGAAGAGCC CTCflGCrTGAC CAGGGGTAGC OCTCAGAAGA 180 

CCAGCTGTAG CCXTTCAGAAG ACCAGQCAGA GCCXTPCAGAC GCTGAAGCGG AGCOQAGrTGA 240 

CXACCTCACT TGAAGCTTTG CXXIACAGGAC AGrTQCTGACA GACAAGAGTC GGCGACAGTG 300 

10 

GAAGCTGAAG TCCTTCCAGA CCAGGGACAA CCAGGGCATT CTCTATGAAG CTGCAOXAC 360 

CTCCACXXTC ACCTGTQACT CAGGACCACA GAAGCAAAAG TTCTCACTCA AACTQGAOXSC 420 

15 CAAGGA1X3GG CGCTTGTTCA ATGAGCAGAA CTTCTTCCAG CX3QGCOGCCA AGOCTCTGCA 480 

AGTICAACAAG TQGAAGAAGC TGTACTCGAC CCCACTQCTG GCCATCCTTA CCTGCATOGG 540 

TTTCGGTGTT CACCAQGACA AATACAGGTT CTTGGTCTTA CCXAGCXTTOG GGAGGAGCCT 600 

20 

TCAGTCGGCC CTQGATGTCA GCCCAAAGCA TGTGCTGTQC AGAGA0GTC7P CTGCTGCAQG 660 

TGGCCTGCCG GCTGCTGGAT GCCCTGGAGT TCCTCCATGA GAATGAGTAT GTTCATQGAA 720 

25 ATGTGACAGC TGAAAATATC TTTGTQGATC CAGAGGACCA GAGTCAGGTG ACTTTGGCAG 780 

GCTATGGCTT CGCNTTCCGC TATTGCXX^VA GTTGGCAAACA aSTOGCCTAC GTGGAAGGCA 840 

GCAGGAGCXry TCACX3AGGGG GACCTTGAGT TTCATTAGCA TC3GACCTGCA CAAGGGATGC 900 

30 

QQQCCCTCCC GCCGCRGYGA CXTTCCAGAGC CTGQGYTAMT QCATGCTGAA GTGGYTCTAM 960 

GGGrrrrCTGC CATQGACAAA TTGCCTTCCA AMAMTGAGGA CATCATGAAG CAAAAACAGA 1020 

35 AGTTGCCTTG GGAITCATTT TAATGTAAGC TKGACTTPGT CATGCCAGAA ACAAGGCTCG 1080 

GICAOCGTCA GCAGTTTGCA GTTTTCCACC TCCWCCX»GT TCCTCCX5TGT GGrTPGACCCA 1140 

GATATCTCCG TTATGCAGCC GCCTCCGGGG GACCACXTCC CTCCCTTTGA GTCAGCXACA 1200 

40 

GACAGCCTAC TTGACGGCXX: CGCTGGCCCC CACATTCCAC TGAACTGrTGC GGATQCXaCA 1260 

GTCACCCCCT CTCAGGCACA GCAIGACCTC CTGAAGrrCGA GCCTGCTTQC TTTGAACCTA 1320 

45 CCAGTTAAAA TCTOCTCAAA ATGrTTTGGAT ACCGCCCATT GGCCCCTCAC AGCCAOGAGC 1380 

TCCCTGACCA GTGTGCCTGT &TGTGTG1OT C?rGTGrrCTGT GTGTGTGCTr GGGACGGGTG 1440 

GGGAGGTCAC CTTTQQGTOT GCGGrTGTGCC CCCAGGACCT GTAAGTAATA AAATCTTTAT 1500 

50 

TTCCAAAAAA AAAAAAA 1517 

55 

(2) INFORMATION FOR SBQ ID NO: 289: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) ISEHtSSVH: 3865 base pairs 

(B) TYPE: nucleic acid 
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<C) ?nU\NDEXMESS: double 
(D) TOPOIiOGY: linear 

^ (xi) SEQUENCE DE9CRIPTICN: SBQ ID NO: 289: 

Tt3GftQGC3GC5G GAGCTTCCTT GAGCAGTQGG CCCAGQCCTC QCCCTCCACA CTTCATTCTC 60 

TGACCTTTCT CTCTCCTCAT TTCQGTGCAT GrPCCTTTCTG CAGCTQOCTT TCAGCACAQG 120 

10 TQGTTCCACT GGGGGCAGCT AACGCTGAGT GACAAGGATG QGAAGCCACA GGTQCATTTT 180 

ACTCAAGICT TCTCTAGTCA ATGAGGGGCA CXXAGTGCTT CTAGGGCAGG CTOGGTGGTC 240 

GTCCOCTAGG TATCAGCCTC TCTTACTGTA CTCTCCGCSGA MCTTAACCT TTCTATTTTC 300 

AGCCTCTGCC ACCTOTCTAG GCAflGCTQC5C TTCCCCATTG GCCCCTGTGG GTCCACftGCA 360 

GOGTOGCrSC CCCCCAGQGC CACCXXTITCT TTCTTGATCC TCTTTCCTTA ACAGTCACTT 420 

20 GGGCTTGAGT CTTGGCAAGGA ACCTTGCTTT TAGCTTCACC ACCAAGGAGA GAGGTTGACA 480 

TGACCTCCCX: GCCCCCTCAC CAAGGCTGGG AACAGAGGGG ATGTOCTTGAG AGCCAGGrTTC 540 

CTCTGGCCCT CTCCAGGGTG TTTTCCACTA GTCACTACTG TCTTCTCCTT GTAGCTAATC 600 

25 

AATCAATATT CTTCCXTTGC CTGTOGGCAG TWGGflSAGTG CTGCTGQGTG TACGCTQCAC 660 

CTGCCCACTG AGTTGGGGAA AGAGGATAAT CAGTGAGCAC TGTTCTGCTC AGAGCTCCTC 720 

30 ATCTACCCCA CCCCCTAGGA TCCAGGACTG GGTCAAflGCT GCATGAAACC AGGCCCTGGC 780 

AQCAACCXTO GGAATGGCTG GAQGTGGGAG AGAACCTGAC TTCTCTTTCC CTCTCCXTTCC 840 

TCCAACATTA CTQGAACTCT ATCCTCTTAG GATCTTCTGA GCTTCTTTCC CTGCTOGGTC 900 

35 

GGACAGAGGA CAAAGGAGAA GGGAGGGTCT AGAAGAGGCA GCXXTTCTIT GTCCTCTGGG 960 

GTAAATGAGC TTGACCTAGA GTAAATGGAG AGftCCAAAAG CCTCTGATTT TTAATTTCCA 1020 

40 TAAAATGrrrA GAAGTATATA TATACATATA TATATTTCTT TAAATimG AGTCTTTGAT 1080 

ATGTCTAAAA ATCCATTCCC TCTGCCCTGA AGCCTGAGTG AGACACATGA AGAAAACTCT 1140 

GrrrrCATTTA AflGftTGTTAA TTAAATGATT GAAACTTQGC TGrTOGCTACT GCTTCTTAAT 1200 

45 

GITGGGGGGA CflGGGCAGTG GTCTGGGCCC ACATTTAGAA GGGAAAATGT TTTGCCTGCT 1260 

GCACACATTG GACCCAAGTA TGGGCCTCTT CTGCCTAGTA CPGCCAAAGG GACTGTTAAG 1320 

50 GTGTCTTGTC CATCTTCTAC CCXXXZACCCC CCATTAOSGG TAAAGGRAAC CCCAGACTAG 1380 

GrrGAGGOGCC AGCAQCTGCC TCACATTGTG TTCTCTCCTG AGATC3GTCCA GCTCACATCC 1440 

AGACACCTTG TTCAGACATT rTATTTGAAT TTATQACAGT GATGGGGATT TGACPQAGAT 1500 

55 

GCCTTATC3GA GAAGTACCCC AOXTCTATG AAGACAGAAT CACPCTCTGC CATTCAITCT 1560 

GCCTGATGCT AACAACACGC AGCTGATITA GGGAGTGTCC CAGCCTAGCT GGATCAAGGG 1620 

60 AAATTCCAGG AGCXXTGGGG CAGGCCCTGG NCXXXIAGTGC CAAGCCTCAG AGTAAGCAGA 1680 
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CATTGQGAAA GTTQCCAACC ACTPGGrPAGA CCACTAGGTT CTCTCITTTC C XTi 'itXV i ' rr 1740 

CCTTTTCAAA TCCCACAGTT TCCTGTK3QG GAGAAGCTGT AATTAGCCTA GTCCAGGTAC 1800 

CAGATCCCAG CTAC3QGGCGC AOCTGNCTTG GATAACTCCA AfiAAAACCTG GtSCAOCAGTA 1860 

TTTTTCCAAT TATAAGGACT GTQGCATAAA TnTTAAATG AGTTATATTG AAACCAGATT 1920 

TCTCCAGCTG CCAAGQGAAG AAGGTAGGGC TQGACTOCCT GCTOTQGCCC AGCXXTnGTT 1980 

AGGGGmOGT CTCTGACTGC AGCCAGACAG GATGATCCTG QGTTCTQGGG AGGGTAAGCT 2040 

GCCCCTTQCC GAGTTCTGCA CX^GAATAAAG AGTOCAAACC CGCTQCTTCC GTOTCCTGAG 2100 

AGATGGGTAA ATGGGTCATG GATGGAGCAG ACTGAAGAGA CAGCAGATGA CTCAGTGGTC 2160 

GAAGAAGQGG GGAAGATQCT GGGCTGGCTA GCTAATGTTC CCCCCTTTCA GCGATTTACA 2220 

QGAAATGGAG CXX:AGCTTGG TGAIGAAGTT GGTTTGCTTC CACTGTQCGA TCCACTCCTC 2280 

AGAAATTTTG AAGTCAGCCT GCAACTTCTC GAAGACTTTC TTCTTGGGCT TGAGCTOCTC 2340 

ATCTGGTTGG CCCTTTTCAT AGCCCTTCAC AAACACCTGC TCACCAGGAG CAGAGCCTOC 2400 

CGGAGGGTCC AGAGGTTCAA CTGGCGGTTT ATCCCTTCTA TAGAAGCACA CAGAAGCATG 2460 

CCTTGQGACT CGACTCCTCT OVTCTTCTGG GGTTTCAGGT TGCACAGCAC CACTACCAGC 2520 

CTGTCCTGCA GTTCCTOCTT GQQCACGAAC TGTACCAGGC CGCTCACCAC AGTCCGTCGT 2580 

TCAGCTTCCC CCAOGTCAAT CTTCTCTACA TACAGGCTGT CTGCATCrGG GTGCTTCTCX: 2640 

ACAGTGATOA TTTTCCCCAC ACX3GATATCC AGCTGGGATG GGATGACCTC CTCTQGTTCT 2700 

GAATTCTTGG CAGGCCTTTG GCXMTGGCT TCTGCTTTGA GGGATCTGGG TAGGCAGCGC 2760 

TGQCXAGTTT TTTCAGGGCA GGQGTAITAA ACTTTTCCCG GATTQGATCC AGCAACTTGrr 2820 

TCAGTGCXSAC TTCAACAGAA TTCTTCAGGT CTCCAQGATG TACAACCTCA GCAGCAAAGT 2880 

CXOTTTCCAG GTCCACGTAA GCTGTGTAGG TTTTCnTTCC ACCCCATTTC TCATCTOGTA 2940 

GGATCACAAA CTCGGACTTA AGGGGAAAAA GGACATGCTT GATGAAGGAC AGAACCXXAT 3000 

TGTTCTCCAC ATTTCCTQGC TCACAGAAGG CCTTCTTCAG ' riTrriX, - riXJ ACATCCTCCT 3060 

TCCGATCAAG GAGATCAATC TTQGACTCCT CTTCTGAAGA GCTCATTTTG CTGCCTGTTA 3120 

ATCCTQGAAC CATAGGATTC ATCAGATGGA CCCGTTTTGA ATAGCXAAGT GCAGGGAGGT 3180 

ACTTCTCTGC AAAGGTGAAA ATCTTTCTCT GATCAATGCC TCCAAATTGG GCATCTACTT 3240 

TTAAATACTC TTCATCCAAA GCCTOCAGTC CX3GGGTATAA GAGGCCACTC AGCAAAGGGT 3300 

QCTCCACCTG CTTTACCACC TCAGCTCCAG CCTTCITOGA ATCGTCCTGrr GTTGACCACGG 3360 

AGGAGAGTCT GTACACATCT AGrPGTCTACT CTTTGCTGAG CTGGTAATCA GrTGCCTTTGA 3420 

TGAACTTGAG CTTCTCCAAG GGCACACCAA TGCnCTCCAG CATTGCTTTG ATCACATTCT 3480 
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CATAGrrAACT GACTCGGAGT TCTAGAAGTT CCCATGGQCX: TrrCATCTTA TCCAGGTATC 3540 

^ OGTOGAGCTC CGCAAACAGA ATTGTTACCT CACACCCTOC CTITAAGAAG TCTOCAATCT 3600 

1TCACATOQG CACAAAC3TAA GOCACATC3TC CnTTOCCOGT U Jl ' llj CC GVl ' CCCCAC5EAAA 3660 

TTTTAAGTrC CCGCTCCTTC AGTATCTCCT TCatfSCTTCTC ITCCCCCAGA ACCTCCTOCA 3720 

10 GGTTCCGGGT GATAAGGrrC5C AOmCTCTT CAGGGCTGGG AGCGTCCCCC ATOC3TXXGCT 3780 

AOCCCTOCTT CCCOCGCTCA GCXOGGCACC AGAGCCCCTT CXTOGCSTCAC CGTCGCXXXT 3840 

GCGTGCCGGG AACICTCACG CXSAGT 3865 



20 



25 



30 



(2) INFORMATION FOR SBQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEWGTH: 1910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEE2IESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 290: 

Aa3C3AGAGGA C3GAGAGGGGG TCTGCGCC3CG GCCGCTACCC AGAAGCCAGC C5GACGGCAGC 60 

ACGGAGTQC3G CTCTCCCCGA GCCCAGCCCC GAGCGAGCCC CCCCCCCGCC CCCC31AQGAC 120 

GCX3CCTYCCA GCCAGCCCGA CTCCTAGGAG GAGQC3GAGGC QGGAAAGCAG CTCAAGCCTC 180 

35 ACCCACCGCC CTGCCCOCAG CCCCGCCACT CCCAGGCTCC TOGGGACTCG GCGGCTCCTC 240 

CTGGGAGTCT CGGAGC3QC3AC OGNCTC3TQCA GACGCCATGG AGrrPGGTCCT GGTCTTCCTC 300 

^ TGCAGCCTGC TC5GCCCCCAT GGTCCTGC3CC AGTQCAGCTG AAAAGC5AGAA GGAAATOSAC 360 

CXrmrCATT ATGATTACCA GACCCTQAGG ATTGGGGGAC TCGTCTTCGC TCIXaGTCCTC 420 

TTCTOQGTTG GGATCCTCCT TATCCTAAGT CGCAGGTC3CA ACSTOCAGTTT CAATCAGAAG 480 
45 CCCCGGGCCC CAGGAGATGA GGAAGCCCAG CSTOGAGAACC TCA3XJVCCGC CAATCCAACA 
GAGOCCCAGA AAGCAGAGAA CTGAAGTCCA GCCATCADGT GGAAOCCTCT GGAACCTCAG 



540 
600 



GCGGCTQCTT GAACCTITGG ATGCAAATGT CGATGCTT7A GAAAACCGGC CACTTCAGCA 660 
ACAGCCCTTT CCCCAGGAGA AGCCAAGAAC TTGTCJICTCC CCCACCCTAT CCCCTCTAAC 720 
ACCATTCCTC CACCTGATGA TGCAACTAAC ACTTGCCTCC CCACTOCAGC CTCCCSCTCCT 780 
55 GCCCACCTCC CGTGATGrrGT GTGTGrPGrTGT GrrGTGTGTGT GACTOTCnCT CJITTCCTAAC 
TGTGGnCTrr GTGGCTACTT GTTTGTGGAT GGTATTGTGT TTGTrAGTGA ACTOK3GACT 



840 
900 



CGCTTTCCCA GGCAGGGGCT GAGCCACATG GCCATCTGCT CCTCCCTCCC CCCGTOGCCC 960 

oU 



wo 98/39448 



PCT/US98/04493 



519 



10 



15 



20 
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TCCATCACCT TCTQCTCCTA GGAQGCTGCT TGTTGCCCXav GACCftGCCCC CTCOCCTCAT 1020 

TTAGGGA3TX: GTAGGGTAAG AGCACGGGCA GrrGGrTCTTCA GTO3TCTTGG GACCTGGGAA 1080 

GGTTTGCAGC ACTTTOPCAT CATTCTTCAT GGftCTCCTTT CACTCCTTTA ACAAAAACCT 1140 

TOCTTCCrrA TCCCACCTGA TOCCAGTCTG AAGGTCTCTT AGCAACTOGA GATACAAAGC 1200 

AAGQAGCTGG TCAGCCCAGC GTTGACGTCA GGCAGGCTAT GCCCITCCGT GGTTAAITIC 1260 

TTCCCAGQGG CTTCCACGAG GAGTCCCCAT CTGCCCOGCC CCTTCACAGA GCX5CCCGGGG 1320 

ATTCCAGC3CC CAGGGCTICT ACTCTQCCCC T0C3GGAATCT GTCCCCTCCA TATCTTCTCA 1380 

GCAATAACTC CATGGGCTCT GCSGACXXTTAC CCCTTCCAAC CnCCClCCT TCTGAGACTT 1440 

CAATCTACAG CCCAGCTCAT CCAGATGCAG ACTACAGTCC CTOCAATTCG GTCTCTOGCA 1500 

GGCAATAGTT GAAGGACTCC TGrTTCOGrTTG GGGCCAGCAC ACCGGGAJQG ATOGftGGGAG 1560 

AGCAGftGCXX: TTTGCTTCTC TQCCTAOSTC CCCTTftGATG GQCAGCAGAG GCAACTCCCG 1620 

' CATCCTITCC TCTQCCTC3TC GCSTOGTCAGA GCGGTGAGCG AGC5TQGGTTG GAGACTCAGC 1680 

AQQCTCCGTG CAGCCCTTGG GAACAGTGAG AGGITGAAGG TCATAAOSAG AGTOGGAACT 1740 

CAACCCAGAT CCCGCCCCTC CTGrPCCTCTG TCTTCCCXSOG GAAACCAACC AAACCCJIGCG 1800 

CTGTGACCCA TTGCTGITCT CICTATOGTG ATCTATCCTC AACAACAACA GAAAAAAGGA 1860 

ATAAAATATC CTTTGTTTCM TAAAAAAAAA AAAAAAAAAA AGQGGGGGGG 1910 



35 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARTVCTERISTICS : 

(A) LENGTH: 3276 base pairs 

(B) TYPE: nucleic acid 

(C) Sa^UVNOECNESS : double 

(D) TOPOLOGY: linear 

(xi) SBC3UENCE DESCRIPTION: SBQ TD NO: 291: 

C30GACCGTCG TTTGAGTCGT CGCT0CCX3CT GCCGCTQCCA CTGCCACTGC CACCTCGCGG 60 

ATCAGGAGCC AGCGTTGrrrC GOCCGACGCC TCGCTCCCGG TGGGAGGAAG CGAGAGGGAA 120 

GCCGCTTGCG GGnTGTCGC CGCTGCTCGC CCACCGCCTG GAAGAGCCGA GCCCCGGCCC 180 

AGTCGGTCGC TTGCCACCX3C TCX3TAGCCGT TACCCGCGGG CCGCCACAQC CGCCGGCCGG 240 

GAGAGGCGCG CGCCATOGCT TCTQQAGCOG ATTCAAAAGG TGATGACCTA TCAACACCCA 300 

TTCTCAAACA GAAGAAOCGT CCCAATCGGT TAATTGTTGA TGAAGCCATC AATCAGGACA 360 

ACAGTGTGGT GTCCTTGTCC CAGCCCAAGA TGGATGAATT GCAGTTGrrTC CX^AGGTCACA 420 

CAGTGTTGCT GAAAGGAAAG AAGAGAOGAG AAGCICTITG CATOGTCCTT TCTGATCATA 480 
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520 



CTTGTICTCA TGAGAAGATT CGGATGAATA GAGTrGTTCG GAATAAOCTT CGnXTEAOSCC 
TAGCSGGftaXST CATCAGCATC CAGCXATOCC CTGATGTGAA GTACGGCAAA OGTATCCATC 
TGCTQCCCAT TGATGACACA GTQQAAGGCA TTACTOGTAA TCTCITCGAG CTATACCITA 
AGCOGTACTT CCTGGAAGCG TATCX3VCCCA TCCX3GAAAQG AGACATTTTT CViXJfiXXJJiX i 
GTGGGATGCX; TGCTGTGGAG TTCAAAGTOG TGGAAACAGA TCCTAGOCCT TATTCCArTC 
TTCCTCCAGA CACAGTGATC CACTOCGAAG GGGAGCCTAT CAAACGftGAG GATGAGGAAG 
AGTCCTTGAA TGAAGTAGGG TATGATGACA TTGCTrGGCriXS CAQGAAGCAG CTAGCTCAGA 
TAAAGGAGAT GGTGGAACTG COCCTCAGAC ATTCCTOCXXn' CTTXAAQGCA ATTGGTGIGA 
AGCCrCCTAG AGGAATCCTG CTTTACGGAC CTOCTOGAAC AGGAAAGACC CTGATTOCTC 
GAGCTGTAGC AAATGAGACT GGAGCCTTCT TCTTCTTGAT CAATOGTCXTT GAGATCATGA 
GCAAATTGGC TGGTGAGTCT GAGAGCAACC TTCGTAAAGC CTTTGAGGAG GCTGAGAAGA 
ATGCTCCTQC CATCATCTTC ATTGATGAGC TAGATGCCAT CX3CTCCCAAA AGAGAGAAAA 
CTCATQGOGA GGTGGAGCGG CGCATTGTAT CACAGTTCTT GACCCTCATG GATGGCCTAA 
AGCAGAGGGC ACATCTGATT GTTATGGCAG CAACCAACAG ACCCAACAGC ATTGACCCAG 
CTCTACX3GC3G ATTTGGTCGC TTTGACAGQG ACSCrTAGATAT TGGAATTCCT GATGCTACAG 
GACGCTTAGA GAITCTTCAG ATCCATACCA AGAACATGAA GCTGGCAGAr GATGTGGACC 
TGGAACAGTA GCXilAATGAGA CTCACGQGCA TCTGGGTCCT GACTTAGCAS CCCTGTOCTC 
AGAGGCTGCT CTOCAAQCCA TCCGCAASAA GATGGATCTC ATTGRCCTAG AGGATGAGAC 
CATTGATGCC GAGGTCATGA ACTCTCTAGC AGTTACTATG GATGACTTCC GGTOGGCXTTT 
GftGCCAGAGT AACCCATCAG CACTGCGGGA AACX3GTQGTA GAGGTGCCAC AGGrTAAOCTG 
GGAAGACATC GGGGQCCTAG AGGATGTCAA ACGPGAGCTA CAGGAGCTGG TCCAC3TATCX: 
TGTGGAGCAC CCAGACAAAT TCCTGAAGTT TGGCATGACA CCITCCAAGG GAGTTCTGTT 
CTATGGACCT OCTGGCTGTG GGAAAACTTT GTTGGCCAAA GCCATTOCTA ATGAATGCCA 
GGCCAACTTC ATCTCCATCA AGGGTCCTGA GCTGCTCACC ATGTGGTTTG GGGAGTCTGA 
GGCCAATOPC AGAGAAATCT TTGACAAOGC CCGCCAAGCT GCCCXXTCTG TCCTATTCIT 
TGATGAQCTG GATTCGATTG CCAAGGCTCG TGGAGGTAAC AITQGAGAIG GTGGTQGGGC 
TQCTGACXXSA GTCATCAACC AGATCCTGAC AGAAATGGAT GGCATGTCCA CAAAAAAAAA 
TGTOITCATC ATTGGCGCTA CCAACCGGCC TGACATCATT GATCCTOXA TCCTCAGACC 
TQGCCX?rCTT GATCAGCTCA TCTACATCCC ACTTCCTGAT GAGAAGTCCC GTCTTCOCAT 
CXTOIAQGCT AACCTGCXSCA AGTCCCCAGT TGCCAAGGAT GTGGACTTGG AlSTTCCrGGC 



wo 98/39448 



521 



PCT/US98/04493 



TAAAATGACT AATGGCTTCT CTOGAGCTGA CCTGACAGAG ATTTOCCAGC GTCCTTQCAA 2340 

^ GCTGGCCATC CGTGAATCCA TCGAGAGTGA GATTAGQOGA GAAOGAGAGA GGCAGACAAA 2400 

CrCATCflGCC ATQGAQGTAG AAGAGGATCA TCCAGTOCCT GAGATCOGTC GAGATCACTT 2460 

TOAAGAftGCC ATGCGCTTTG OOCGCCGTTC TCTCAGTGAC AATGACA3TC GGAAGTATCA 2520 

10 GATOTITCCC CAGACCXTTC AGCAGAGTCG GGGCTTTGGC AGCITCAGAT TCCCTTCAGG 2580 

GAACCAGGGT GGAQCTQCSCC CCAGTCAGGG CAGTGGAGGC QOCACAGGTC GCAGTGTATA 2640 

CftCAGAflGAC AATGATGATG ACCTGTATQG CTAAGTGGTG GTQGCCAGOG TGCAGTGAGC 2700 

TGGCCroCCT GGACCTTGTT CCCTGGQQGT GGGQGCGCTT GCXXIAGGAGA GOGACCAGQG 2760 

GFTGOXCCAC AGCCTGCTCC ATTCTCCAGT CTGAACAGIT CAGCTACACT CTGACTCTOG 2820 

ACAGGGGGTT TCTGTTGCAA AAATACAAAA CAAAAGCGAT AAAATAAAAG CGATTTTCAT 2880 

TTGGTAGGCG GAGAGTGAAT TACCAACAGG GAATTGGGCC TTGGGCTATG CCATTTCTGT 2940 

TGTAGTTTGG GGCAGTGCAG GGGACCTGTC TOQGGTGrPGA ACCAAGGCAC TACTGCCACC 3000 

TGCXIACflGTA AAGCATCTGC ACTTGACTCA ATGCTGCXXX5 AGCCCTCCCT TOXCCTATC 3060 

CAACCTGGGT AGGTGGGTAG GGGCCACAGT TGCTGGATGT TTATATAGAG AGTAGGTTCA 3120 

TTTATTTTAC ATGCTTTTGA GTTAATCTTG GAAAACTAAT CACAAGCACT TTCTAAACCA 3180 

AAAAATGACA TGTTGTAAAA GGACAATAAA CGTTGGGTCN AAATOGGWRA AAAAAAAAAA 3240 

AAAAAAGGGG GGCCCCTCTA AAGNNCCANN CTTCGT 3276 



(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERIOTICS: 

(A) LENGTH: 1695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOIjOGY: linear 

(xi> SEQUENCE DESCRIPTICN: SEQ ID NO: 292: 

TTGCAATQGT TGAATTCCCC TCCTCACGCC AGCCTAGGAG AAGAAGTTCG TAGICCCAGA 60 

C3CTGAGGCAG GAGGCGGCAG TTTCTGGCGG GTCAGGGCGG AGCTGAAGTG ACAGCGQAGG 120 

CGGAAGCAAC GGTOGGTQGG GCGGAGAAGG GOGCTGGCCC CAGGAGGAGG AGGAAACCCT 180 

TCCGAGAAAA CAGCAACAAG CTQAGCTGCT GTOACAGAGG GGAACAAGAT GGCGGCGCCG 240 

AAGQGAGCCT CTQGGTGAGG ACCCAACTGG GGCTCCOGCC GCTGCTGCTC CTGACCA3X3G 300 

CCTTGGCCGG AGGTrCGGGG AOCGCTTCGG CTGAAGCATT TGACTCGGTC TrGGGTGATA 360 
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CGQOGTCTTG CCMXGGCXC TG?rCAGTTGA OCTACCCCTT GCACACCTAC CCTAAGGAAG 420 

AGGAGTTGTA CGCATC3TCAG AGAGGnTOCA GGCTGnTTTC AATITCrCAG TTTCTCGATG 480 

5 ATGGAATTGA CTTAAATOGA ACTAAATTOG AATGTGAATC TOCA3X?EACA GAAGCATATT 540 

CCXyUVTCTGA TGAGCAATAT GCTTQCCATC TTOCTTQCCA GAATCAGCTC CCATTCGCTC 600 

AACTOAGACA AGAACAACTT ATGTCCCTGA TGCCAAAAAT GCACCTACTC TTTCCICTAA 660 

CTCTGGTGAG GTCATTCTOG AGTGACATGA TC3GACTCCGC ACAGAGCTTC ATAACCTCTT 720 

CATGGACrrT TEATCTTCAA GCCGATGACG GAAAAATAGT TATATTCCAG TCTAAGCCflG 780 

15 AAATOCAGTA CGCACCACAT TTOGAGCAGG AGCCTACAAA TTTGAGftGAA TCATCTCTAA 840 

GCAAAATGTC CTATCTOCAA ATGAGAAATT CACAAGCX3CA CAGGAATTTT CITCAAGATC 900 

GAGAAAGTGA TGGCTrnTA AGATGCCTCT CTCTTAACTC TOSGTOGATr TTAACTACAA 960 

CTCTTGrrCCT CTCGGTGATG GrEATTGCITT GGATTTGTTG TX3CAACTCTT GCTACAGCTC 1020 

TOGAGCAGTA TGTTCCCTCT GAGAAGCTGA GTATCTATGG TCACTTOGAG TTTAatykATC lOBO 

AACAAAAOCT AAACAGATAT CCAGCTTCTT CICTrGTQGT TCTTAGATCT AAAACTGAAG 1140 

ATCATGAAGA AGCAGGGCCT CTACCTACAA AAGTCAATCT TQCTCATTCT GAAATTEAAG 1200 

CATTITTCTT TTAAAAGACA AGTGTAATAG ACATCTAAAA -ITCCACTCCT CATAGAGCTT 1260 

TTAAAATCGT TTCATTQGAT ATAGGCXTTTA AGAAATCACT ATAAAATOCA AATAAACTPA 1320 

CTCAAATCTG TGAAGACTGT ATTTGCTATA ACTTTA1TQG TATTGTTTTT OTAGTAATTr 1380 

AAGAGGTOGA TGTITCGGAT TCTATTATTA TTTTACTAAT ATCTGTAGCT ATTITCiriT 1440 

TTGCTTPQGT TAITGrrrrTT TTCCCTTOTC TTAGCTATGA GCTGATCATT GCTCCTTCTC 1500 

ACCTCCTGCC ATGATACTGT CAGTTACCTT AGTTAACAAG CTGAATATTT ACTAGAAATC 1560 

ATCCTTCTGC TCAGGAATQG CCCACAAATC TGTAATTTGA AATTTAGCAG GAAATGACCT 1620 

TTAAtXSACAC TACATITPCA GGAACTGAAA TCATTAAAAT TTTATTTGAA TAATTAAAAA 1680 

AAAAAAAAAA AANCT ^g^^ 

(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1501 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

<xi) SBQUHIJCE DESCRIPTICN: SEQ ID NO: 293: 



CACTTTCAGC 



TCTCTTTGCT TCTACCTCAA ATAGCCCCAG GAGrTGGQCTT 
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TAGTCTCCAA rATGGAGCAT CTCAAGCTTC TCCTCQGGGA TOGGGATTOG GATOC3GCAGA 120 

^ ATCTGT-rPTG GWTCTCCGGG TTATITOCAG TGOC?rGTAAA AGCAGAGCTC GGCCTPTCCC 180 

TCTCTTATCC CTGiAGGC5TOG GTAAGAAGGA CICTATCTAC ACCTOTTCTT CCCTACCTIC 240 

TCTnTCTTA GGGAGGOCTC ATTCTAAGrrT CXTTCAAGAGA GmXTTCGCT TAAAGCTOTA 300 

10 GCAAGQGTGT GCTAGGTGGG GGATTTOGAG CAAAACOGTC GAGTAGGCAT GATACTGGTA 360 

TOGAGTOGGC CTGCAAAATC AGACAGAAAT GGCITCAGAA GCXX3CAGGGG AGCATOCCTG 420 

TCTCTCfiGTG AIStfJAGlATG GGAGGGACCT CCCTAGCTTC GAAAATGfiGA ATTCAAGGGG 480 

TTATGAACAA ATAGGATGCX: TAGTTGAGGA TGTTCCCAAA GrmTOTCCA ATCTTASCAT 540 

TAGTAGATTT TATAAGCCAC AGAGACAAAC CAGAAACGGA ATAA3T7ITAC TITOGATOCT 600 

20 TrAi-ri-i-i-rr GTTCTAOGTG TOGCTTPGrTA CATQCAGAAG AATGCTATAT GCTOCACATT 660 

TTGCCTITAA AGTCTTACGA CTTTCCCCAT TTrAGTCTAA TCGGAAGATA CAQAIXTTOCA 720 

^ AGrPCTGCTTT TTTCrTTTTT GTTATTATTT TTTTTlTrTT GCTCTCTGnT ATGGACATTT 780 

TCAGACATGC ACAGAAGTTGG AGAGGATGGT CCTTOGACCC MATCTTCTCCA TCACCTAGCT 840 

CSCATCACTTA TCAGCTATGG TCAACCTGGT TTCATCTGTA TCTCTCTCTT ITCACCTCTA 900 

30 TTOTrTATTG AAAATCXAAG ACACTATGCX: AATQCAACCG TGACTACTTT GGGAGATTGG 960 

TAGTCTCTTT TGATGGTCAT AGTGAJPGGGG TGCACTATCA TAATtlACATC AGCTTCTCCTT 1020 

TTTGCrrrrA ATGTTAACTA ATQAAGTTCC AGAGATQGGC CTTAGAAATC TCTTPTAAGA 1080 

ATTAACAAGG AGTCTCAAAA AGAAATGAGA GGGATQCTTC CTTTNCCCTT GCATCTACAA 1140 

AACMAGAGAG AGACTGTTCT GTTCTAAAAC TCTTTCAAAA ATTCTGATAT GGTAAGCJTAC 1200 

40 TTGAGAC3CCT TCACCAGAAT GrTCAATCTTT TTTTCTGriCT AACATCGAAA CTIGTOTCAC 1260 

CATTAGCArr GTTATCAGCT TCTACTQCJrC TCATAACTCT GCJITTTOGAA GAATAATITC 1320 

GAAATTGnTC CTOrGTTCTG TGAAAAIAAC CTCCCCAAAA TAATTAGTAA CTOGriTCnC 1380 

TACTTQGTAA TTTGACACCC TGTTAATAAC GCAAITATIT CTCTCrrTCTr AAACACTATA 1440 

AATAGrPICTA AGTITGCATG CATGATOGAA AAATAAAAAC CTGTATCTCT GTTAAAAAAA 1500 

55 (2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEtJGra: 2683 base paixs 

(B) TYPE: nucleic acid 
^ (C) STRANDEENESS: double 
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(D) TOPOLOGY: linear 
(xi) SBQqENCE DESCRIPTION: SEQ ID NO: 294: 

5 TGANTCTOGT COOGGGTOCN GATTQGCACN GCCTCCGCCG COGCTOCTraG TTGTCCCGCC 60 

ATOGCACTGT OGOQQGGQCT GCCOCGGGAG CTQGCTGAGG OGGTOGCXXSG GQGCCGGGTR 120 

CTOGTOGrrOG GOGCGGGCQG CATOGGCTGC GAGCTCCTCA AGAATCTCGT GCTCACCGGT IBO 

TTCTCCCACA TCGACCTGAT TGATCTQGAT ACTATTGATG TAAGCAACCT CAACAGflCAG 240 

TTTTTGTTTC AAAAGAAACA TGTTGGAAGA TCAAAGGCAC AOGriTGCXlAA GQAAAGTCTA 300 

15 CTGCAGrrTTT ACCCXEAAAGC TAATATOGTT QCCTACXTITG ACAGCATCAT GAACCCTCAC 360 

TATAATGTGG AATTTTTCCG ACACTTTATA CTGGrTTATGA ATQCTTTAGA TAACftGAGCT 420 

QCCCGAAACC ATGrTTAATAG AATCTGCCTG GCAGCTGATG TTCCTCTTAT TGAAAGTOGA 480 

20 

ACAGCTGQGT ATCTTQGACA AGTAACTACT ATCAAAAAGG GTCTCACCGA GTGTTATGAG 540 

TGTCATCCTA AGCCGACCCA GAGAACCTTT CCTGQCTGTA CAATTCGTAA CACACCTTCA 600 



660 



25 GAACCTATAC ATTGCATCGT TTGGGCAAAG TACTTGrTTCA ACCAGTTCTT TGGQGAASAA 

GATGCTGATC AAGAAGTATC TCCTGACAGA GCTGACCCTG AAGCTGCXTTG GGAACCAACG 720 

GAAGOrGAAG CXAGAGCTAG AGCATCTAAT GAAGATGGTC ACATTAAACG TATTTCTACT 780 

30 

AflGGAATGGG CTAAATCAAC TGGATATCAT CX:AGnTNAAA CTTTTTACCA AGCTTTTTAA 840 

AGATGACATC AGGTATCTGT TGACAATGGA CAAACTATGG CGGAAAAGGA AACCTCCAKT 900 

35 TCCGTTGGAC TGQGCTGAAG TACAAflOTCA AGGflGAfiGAA ACGAATGCAT CAGATCAACA 960 

GAATGAACCC CAGTTAGGCX: TGAAAGACCA GCAGGTTCTA GATGTAAAGA GCTATGCAOG 1020 

TCTTTTTTCA AAGAGCATCG AGACTTTGAG AGTTCATTTA GCAGAAAAGG GGGATPOGAGC 1080 

40 

TGAGCrCATA TGGGATAAGG ATGACCCATC TGCAATGGAT TTTGTCACCT CTGCTGCAAA 1140 

CXTTCAGGATG CATATTTTCA GTCATGAATAT GAAGAGTAGA TTTGATATCA AATCAATGGC 1200 

45 AGGGAACATT ATTCCTGCTA TTGCTACTAC TAATGCACTTA ATTGCTQGGT TGATAGTATT 1260 

GGAAGGATTG AAGATTTTAT CAGGAAAAAT AGADCAGTGC AGAACAATTT TTTTGAATAA 1320 

ACAAOCAAAC CCAAGAAAGA AGCTrCTTGT QCCTTGrTGCA CPGGATCCTC CXIAACCCCAA 1380 

50 

TTGTTATGTA TGTGCCAGCA AGCXMtAGGT GACTOPGCGG CTGAATGTCC ATAAAGTGAC 1440 

TGnrCTCACC TTACAAGftCA AGATAGTGAA AGAAAAATTT GCTATGGTAG CACCAGATGT 1500 

55 CCAAATTGAA GATGGGAAAG GAACAATCCT AATATCTTCC GAAGAGGGAG AGACGGAAGC 1560 

TAATAATCAC AAGAAGTTGT CAGAATTTGG AATTAGAAAT GGCAGCXXXX: TTCAAGCAGA 1620 

TGACTTCCTC CAGGACTATA CTTTATTGAT CAACATOCTT CATAGTGAAG ACCTAGGAAA 1680 

60 



20 
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GGACGTTGAA TTTGAAGTITC TTQGTGATGC CCCXXSAAAAA GTQGQGSCCA AACAAGCTCA 1740 

AGATGCTGOC AAAAGCATAA CX:AATGGQCA GTCATGATOG AGCTCAGCOC TCCACCrcCA 1800 

5 CAGCTCAAGA GCAAGftTCaC CTTCTCATAG PTGATTOCSGA TGAAGAAGAT TCTTCAAATA 1860 

ATGCOGACCrr CATGAAGAAG AGAGAAGCCG CAAGAGGAAA TTAGATGAGA AAGAGAflTCT 1920 

CAGTOCAAAG AGGTCAaSTA TAGAACAGAA GGAAGAGCTT GAaX3ATGTCA TAGCATTAGA 1980 

TTGAACAGAA ATQOCTCTAA ACAGAACCCT CTTACTATTT AGTTTATCTC GGCAGAACCA 2040 

GATTGnTATG TOCTrTGTTC CAAAGQGAAA AAATTCACAG CAGTGACTTG AAAATCATTC 2100 

15 TGCTCCCTTT GAAAGCATTC ATPTTCSCTAG AACTCTTAGA CACATTGCAG TAIXXTOTAt 2160 

TGAAAGTAGG AATATAGrTTT TAAAAACCCT TTGAACAAAG TGTGTGCATA ACCAGTCATG 2220 

AGATAAAACA ACACAA3X3CA TCJITCCCTTT TTAATOTAAA TACCCTTAGG TATCATTAAT 2280 

AGmCAAAA TATTGTOGTT TAGTAAAGriT GATACCTGGT TATAAATAIT ATOCCTTTAT 2340 

TrTTGGCTAG AAGAAGAATT ATTTTTAGCC TAGATCTAAC CATTTTCATA CTCTTAACTG 2400 

25 ATTGAAACAG ATTCAAAGAA CTATCGAGTG CrATGCA3TC AAACTTGTTT tTAAAlXTTTA 2460 

GATGGCACTA TGTATATTAA TGTAAAACAA TGTTAATTTA CTCAAGTTTT CAGmCTAC 2520 

CX3CCTC3GrrAT GTCTGTGTAA GAAGCCAATT TTTGTGTATT GTTACAGTTT CAGGTTATTT 2580 

ATATTCGATG TTTTGTAAAA CTCAAATAAC GACTATACTT ATGGACCAAA TAAATCGCAY 2640 

TGCATTCTKG TKAAAAAAAN NACAGAAAAA AAAAAAAACA AGA 2683 

35 

(2) INPOTMATICN FOR SBQ ID NO: 295: 

40 (i) SEQUENCE CHARACIERISTICS: 

(A) LQKTTH: 1454 base pciixs 

(B) TYPE: micleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SBQOENCE DE9CRIPTICN: SBQ ID NO: 295: 

GGACTCGGGG TGGCTCTAAG GGGCAGGGAT AGGGCTGGGG AGCGCCGGCC TGIGGCCXTTG 60 

50 MXAGCCCCr TCTCGrroCAG GTTCCACCCC GATGCAGGTG GTCACGrTGCT TCACGCGQGA 120 

CAGCTACCTG AOGCACTGCT TCCTCCAGCA CCTCATGGTC GTGCTOrcCT CTCTGGAACG 180 

CACGCXXTCG CCGGAGCCTG TTGACAAGGA CTTCTACTCC GAGnTTQGGA ACAAGACCAC 240 

AGQGAAGATG GAGAACTACG AGCTGATCCA CTCTAGTOGC GrTCAAGTTTA CCTACCOCAG 300 

TGAGGAGGAG ATTGGGGACC TGAOnTCAC TGTQGCCCAA AAGATQGCTG AGCCAGAGAA 360 

GGCXXXAGCC CTCAGCATCC TGCTGTACGT GCAGGCCTTC CAGGTGGGCA TCCCACCCCC 420 
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AGGGGCXrCC TGOQCCCCAA GACACTCCTG CTCACCftGCT CXXaGATCTT 480 

CCTCCTGGAT GAGGACTGTG TOCACTACCC ACTOCCOGAG TTTGCXIAAAG MXXXCCCCA 540 

GAGAGACAGG TACOQGCTOG AOGATOGCCG CCGOGTCOGG GACCTGGAOC GAGTCCTCAT 600 

GGQCTACCAG ACCTACOCGC AGCCCTCACX: CTCGTrETCG ATCACGTGCA AGGTCATCAC 650 

CTCATGGQCA GTGTCACCCT GGAOCACTTT GGGGAGGTGC CAGGTGGCCC GGCTAGAGCC 720 

AGCCAGGGCC GTGAAGTCCA GTQGCAQGTG TTTGTCOCCA CTPGCTGAGAG CAGAGAGAAG 780 

C7PCATCTCGC TGTTQGCPCG CCACTGGGAG GCCCTCTC3TG GOCTGAGCTC CCTGTOGAGC 840 

TCACCQQCTA GCCCAGGCCA CAGCCAGCCT GTCGrTGTCCA QCTTGAOGCC TACTOGGGCA 900 

GGGCAGCAGG CTnTGTOTT CTCTAAAAAT GTITTrATCCT CCCTrFGGTA CCTTAATTTG 960 

ACTGTCCTCX3 CAGAAATGTG AACATGTGTG TGTGTTGTGT TAATICTITC TCATGrTTQGG 1020 

ACrrGAGAAOX; CXrOOGCOCCT CAGGQCTOTT 0GGTC7PGCTG TCAGCCTCCC ACAGGTOGTA 1080 

CAGCCGrrGCA CACCAGTGTC GTGTCTGCTG TTCnGGGACC GrTGTTAACA CXTTGACACTG 1140 

TGGGTCTGAC TTTYTCTTCT ACAOSTOCTT TCCTGAAGrTG TCGAGTCCAG TCCTTTOTTG 1200 

CTGTTGCTGT TOCTGTTGCT CTITGCTCrTG GCATCTTCCT GCTAATCCTG AGGCTGCTAG 1260 

CAGAATGCAC ATTGGAAGCT CCCACCCCAT ATTGTTCTTC AAAGTGGAGG TCTCCCCTGA 1320 

TCCAGACAAG TGQGAGAGCC CXJPGGGGGCA GGGGACCTGG AGCTGCCAGC ACCAAGCGTG 1380 

ATTCCTGCTG CCIOTAITCT CTATTCCAAT AAAGCAGAGT TTGACACCX3W MAAAAAAAAA 1440 

AAAAAAAAAA AACN 1454 



40 
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(2) INFORMATION FOR SBQ ID NO: 296: 

(i) SBQUENCE CHARACTERISTICS: 

(A) LQ3GTH: 828 base paixs 

(B) T5fra: nucleic acid 

(C) STRANnSCMESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 296: 
ACCCTGGCAT GCCCCACAAA CAGATCACCA GCCAGCTTAC ACAGGCATTA ACTCTCCTCA 
ATGAGGAAGA ATCATPCACA ACTGAGCAAG ACATTCATAT GATCATTTAA GGAAGTGTTT 
CCCTTATGTG TTAGCAAGTA TAATCGGCTA ACTCCTAAAT CCCAATQAAT AGTCCTAGGC 
TGGACAGCAA TGGGCTGCAA TTAGGCAGAT AAAGACATCA GTCGCAGTAA ATGAATCCAT 
AGACTCATCT AGCACCAACT ACCATTAGCA CTATGTTAGG AGCTGCAAGG CCCCAAAGTA 
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GAAGAITCTGC AJAATOTCTG CTCITCrGTA GCTCAGGftGA CAATTCCAGC ACAGACftCTA 360 

CAGrTTAACGC TGAACTGCAG CTOCAAGrTAA TAGCAWGAAC AGTCAGAAAA ATAOCTTATG 420 

5 AGGGGQCAGG GCTGAAGCTG GGCCITGAAG GATQGATGAA ATTTGGAnaG AGAATGAGGA 480 

AGACAGAGQG NCTOCAflGTG AGAGAAGCAT GAAAAATGAG CARGGGCCTG GATCAGTC3GG 540 

GTGrrATTCAG AGCACCTYTC CAGATGCACC ATGCATGCTC ACAGTCCCTT GCCTATGTGT 600 

10 

GGCAGAGrrCT CCCAGCCAGA TGrTGrrGCCCC CACXXrATGT CCATTTACAT GTCCTTCAAT 660 

GCCCACCTCA AAAGGYACYT CTPCTGTAAA GCTTTOOCTK GGTATCAGGA ATCAAAATTA 720 

15 ATCAGGGATC TTTTCACACT GCTGTmTT CCTCTTTGGT CXTTTCTArCA CTAAAACTCA 780 

TCPCATPCAG CXTTTACAGCA TAACTAATTA TITGTTTTCC TCACTACA 828 



20 



(2) INFX3RMATI0N FOR SBQ ID NO: 297: 



(1) SEQUENCE CHARACTERISTICS: 
25 (A) LQKJni: 2416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 297: 

TCAATTTCCA TTAACTCAGA TCAGCCATTG TGATTCACCA TTTOTCAGGC TCTCAQGrTTT 60 

AACAAAACCT ACTATCACCA TCATCCTTCA ACAGCCACAG TCTGAATTGA GCCAACATTT 120 

35 

TTTTTTCTTT GAGAAAGAAG TGGACTTGGGG CACAACTTTT AGTCTGAGGG GAGCTAGTGG 180 

AAATCTAGAC AATAGAAGTC ATCGATAGCA GCTTTTCCTC AAATGTGTGA CTCCTCAGGG 240 

40 GCTAAACTGC TCTTAGCTTA GAATTATGCT TTACTAGAGA TCTAGCAGAT AAGTQGGTTA 300 

ATCACTACCA TCCTOTAACT AGrTTATATAG CTTCCAGACA TGAGGGAGAC ATCAAACAGG 360 

GATGGAAGCA ACCCCAAGGA TATQCAAGAA GGGCATQATG AACCCCCTTC CCTCTGGCAG 420 

45 

GAGAACAAG6 CCAACCAAGG GACAGACTOG AAAGCAdTA GATOTTTAAG GAGGAGAAAG 480 

QGGAAGCTTT GACCAGTCCT TGCCTTITGC CAAGTTCAGC CAGnTCTCCG CTGCTTGCAA 540 

50 CCTCTAGCGC AGTAACATTT GCAGAATTGC AGATTTTCCC CCAGATACTA GGAGGAAAGG 600 

GACnTQGQG GGTGGGGAAG GGGTCGTQGT GmTAAAAG CATAAGnTAC CTGrTTQCAC 660 

TGTTTTAAGA TAGGAAAAAA AAATAGTGGG CAAGC7PGAAC ATCAGACGTA AATTTGTGTG 720 

55 

TTTTTArTTT GTCATQCTCT TGAAAATGTT TGAOCATTTG TAGTATACAC AGTGAAACTT 780 

GATTCTCTGT TGCATAAAAC ACTATATTTT TTTGGAAATG TTACrGTCCA AAAGCCTCTT 840 

60 CCCTCCCTTT CCTTTTOCTA TGrTACTTCCT TCATACTTGC TTTACTGATC AGCCAQGCAA 900 
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TAGCCATCCA AGAGCTAGAG CATCAAACAG GGCCCTTTCC AflGTAQGCTC TQGGTGTCCT 960 

^ AAGCCAGCGT GTOCCCTCTC GTTTAGTCAG TGTAATAGAG TCCCTGGCflC CTTTCTTTOC 1020 

AAATGAGGCT AACAGACCAG ACPGCAGCAA GTTMCAGAT TCCTCAATCA GATGCACTAG 1080 

GAGTGAGGAG CCCAGGGATG GAGGGGCTTTC CTGAAGTATT GCAGTTQGCT GTAGTAGCTG 1140 

10 AGTTCTTTTC CATGTTACCG AAACTGTAGC CAGTTACAGT TTACTCAGGA AAAOSGTAGA 1200 

TCAATTCAGC CATOGTAGTG CTOCnTOGCA GQGATTOGTA AOQGAGAGAA CTGCTCATCA 1260 

GCCAAAACTC AAGCCTTGCC TTTTAGGAGG CXACCAGCAG AGGGACTTGG TCCTCCTTGT 1320 

CTOGrCACTTG TGTACATGCC GGTGACCTGA GGACTCCACT CACACTQGOG AGCAAAAAGG 1380 

GAGCAGTGAT TCTCTTTTCT CTCOCCACCC CXTTGCCXmr GTTACCAACA OCAGTTTCCC 1440 

20 AGGGGGTACA TGAGTTTCTG AATTTTTAAA AAATGmTT GGTrrGCTTT TTCTQGQGAC 1500 

TGATAAGTGC TTTAAGCAAT GTCCATACCX: CX3TCAAGACT CCCAGCTTAG TCATTTTCTT 1560 

GTATTTTTCT GTTCACAGTA TTTGICTGTG TOCTTGTTTT GQCAGCTCAT TTTGGCTGTA 1620 

25 

TTATATATTG AGTGATGAAT TGATCCTCTT TTTTCCCTAA GGGATATGAA TTGTTTTTCT 1680 

TGTGTTATAT TCTQCTTGTG AATAGCTQGA GCAAACCTGG QGCTGACACG OGTAAGOTAG 1740 

30 GGCTGCAAAR CGAGAAGAGA GCCGGTGGAG TGTACTTGTC CCTGACAGGC TSACCTACCT 1800 

GAGTCTCTGA GCTTTTCAGT CXTJUITCTTT GCAAGGCTCA AAATGCCACA GAACCTCTCC 1860 

TCTTCTOCCC ACTCCCCATG GCAGGGACCG GACCATCXOT ACATGCAACA TQCTGTTCCT 1920 

35 

CCAGCXXXrrC CCATTGCCAT GGCAAAACAG GTAOCTTTGG GGCATGGGGG CSmACATQG 1980 

GATGCTTGrPG TAATCGACCA CCTAGCXnTTC TCTCTCOCCT CCOGTCCTCC CCCAGAATCA 2040 

40 CTTCCTAGGA CACCCGAGCT GCTTGCCCAG GGTCCTGTTT CCXTIQCTAAC TCCAGAGAAG 2100 

CATCCTAGGG CTTTGrrGACA GTCTCTAATT CCCTTCCCTT CTCGTTAAGA ATCATATrGT 2160 

ATACTAGCTT TCAGACCATA CAGTATTCAT TQGGTTACTC CTATTATTAT CAAGTAGCTG 2220 

45 

GAATTGTGAA GGTCGGAGTA GTTAGATCTT TAGCTTTTAT TCCTTATTTT TnCTATTAC 2280 

TCrcCATGTG TATAAATTAT TGATCATOIT GCTQGCTTTT ATAAACTCTA AGCGAAGGftG 2340 

50 GAGCACTGCC TCAGCCTTTG CACATGGTAA TGAAGCACTG TTTTTAAATA AAAGRGRGAA 2400 

HCHCCAAAAA AAAAAA 2416 

55 

(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS: 
60 <A) LENGTH: 545 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLCX3Y: linear 

5 (xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 298: 

GAATTCGGCA CGAQCCATGC YTQGCCTCTC CTTGATTCTT ACAGTCACTT TGTrGGCTCT SO 

TTCTGACTCA GCAGCTACCT GCATTGTOGC CAAAQGATGA OCTATTCCTT CTCAGGAGGG 120 

CAAAAATOTG GAATAGTGTC TGTOCATCCC TCTCCTCATG GGCTACCAOC TCTGCCACCG 180 

TQGTTAATCA GTTUWCAACCA GGAGAGAAGC TGCTQGAACT GACCTCTGGG AACTCCTTQG 240 

15 ATGGTTTGGT GCAGGAATGT AGTAGGCATA CACGTOGTTG OtSTGGATCTG GGCCCTCCTG 300 

ATGTGAGTAG AGAGC3TAAAA GGSCACCATC TCCTTGACCT YTGQGGAACT CATCCACAAA 360 

GAftGATGTTT CCAAGATGCT TCTGAAGATT GSCTAAAAAT AGCCGGmTC CACCCCOGTC 420 

AATGCATCCA TTCTAGAATG CTCCTTCACC AGGACCAGAG AACTGATTTA CAGAAGTGAC 480 

ATGAAAACAT TCCATCCCAG AATTTGCANT ACCTCAAATT NAATTTCTAC CTATTAAAAA 540 

25 NAAAA 345 



30 (2) INFORMATION FOR SEQ ID NO: 299: 

(i) SBQUa^CE CHARACTERISTICS: 

(A) LQJGrra: 1530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANEEENESS: double 

(D) TOPOLOGY: linear 

(xi) SBQUQICE DESCRIPTION: SEQ ID NO: 299: 

GGCTCTGCTG GGCATCATAC TTGTCACTGG GTAAACAGTT TGCCCACTTA CCGCAGATGA 60 

AGCTGCTTGC CAGGGCTCTC CGGCTCTGTG AGTTTGQGAG GCAGQCATCT TCCAGGAGGC 120 

TGCJTQGCTGG CCAGGGATGT GTGGGGCCCC GGCGAGQGTG CTGCGCTCCC GrrCCAGGTQG 180 

TTGGQCOCAG GGCTGATCTC CCACCCTGTG GAGCCTGCAT TACTGGAAGG ATCATGCGGC 240 

CAGATGATGC CAADGTOGCC GQCAATGrrCC ACGGGGGGAC CATCCTGAAG ATGATCGAGG 300 

AGGCAGGCGC CATCATCAGC ACCOGGCATT GCAACAGCCA GAACGGGGAG CGCTGTGrTQG 360 

CCGCCCTGGC TCGTGTOGAG CGCACCGACT TCCTGTCTCC CATGTGCATC QGTGAGGTGG 420 

CGCATGTCAG CGCGGAGATC ACCTACACCT CCAAGCACTC TGTQGAGGTG CAGGTCAAOG 480 

TGATGTCCGA AAACATCCTC ACAGGTrGCCA AAAAGCTGAC CAATAAGC3CC ACCCTGTGGT 540 

ATGTGCOCCT GTOGCTGAAG AATGTGGACA AGGTCCTCGA GGTGCCTCCT GTTGrTGTATT 600 

CCCGGCANGA GCAGGAGGAG GAGGGCCGGA AGCGGTATGA AGCCCAGAAG CTGGAGCGCA 660 
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TGGAGACCAA GTGGAGGAAC GQGGACATCG TCCAGCXAGT OTPCAACCCA GAGCCGAACA 720 

^ CTGTCAGCTA CAGCX3tf?PCC AGCTTGATCC ACCTGGTOGG GCCTTCAGAC TGCACCCTGC 780 

AaSGCTTTCT GCAOQGfiGGT GTCACCATOA AGCICATQGA TGAGGTOGCX: GGGATCGrTGG 840 

CTOCftOQCCA CTGCAAGACC AACATCGTCA CAGCTTCCGT QGAOGCCATT AATTTTCATC 900 

0 ACAAGATCAG AAAAGQCTQC GPCATCACCA TTTCGOGACG CTVrGACCTTC ACGAGCAATA 960 

AGTCCATOGA GftrTCGAGGTG OTOGTOGACG CXX3ACCCTGT TGTGQACAGC TCTCAGAAGC 1020 

^ GCTACCGGGC CGCCAGTQCC TTCTTCACCT ACGrrGTOQCT GAGCX»GGAA QC3CAGGTCGC 1080 

TQCCTGrrGCX: CCAGCTGGTG CCCGAGACCX; AGGACGAGAA GAAGOGCTTT GAGGAAGGCA 1140 

AAGGGCGGTA CCTQCAGATG AAGGCGAACC GACAGGGCCA CX5CGGAGCXT CAGCCCTAGA 1200 

0 CTCCCTCCTC CTCCCACTOG TGCCTCGftGT AGCCATQGCA ACXSQCXXXIAG TOTCCACTCA 1260 

CTTAGAAGTT COCCCCTTGG CCAAAAACCC AATTCACATT GAGflGCTGC3T GTTCrPCTCAA 1320 

^ GTmCGTAT Cy0U3TGTTA ACCTGTACTC TCTOCTGCAA ACCTACACAC CAAAGCTTTA 1380 

TTTATATCAT TCCftSTAlXrA ATGCTACACA GnxnTCTCCC GAGCGCCGGG AGGCGTTCGG 1440 

CAGAAACCCT CQC3C5AATGCT TCCGAGCACX5 CTGTAGGGTA TQGGAAGAAC CCAGCACCAC 1500 

* TMATAAftGCT GWrGCTTQQC TGGQGAAOIA 1530 

(2) INPORMATICN FOR SEQ ID NO: 300: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LOKSra: 997 base pairs 

(B) TYPE: nucleic acid 

(C) STRM/nyEOmSS: double 

(D) TOPOLOGY: linear 

(xi) SEQUWCB DESCRIPTICN: SBQ ID NO: 300: 

AGGTAGTQAG AGACACATTA CACCTAACCA ACAAGAAGAA GGATCCTCCC CCTTATAATT 60 

TAACTATGTT TACAGGGAAT GCGTACATTG TGGCTTCCCG AGNATTTCGT (XAACATGTT 120 

TTGAAGAACC CTAAATCCCA ACAACTGATT GAATQGGTAA AAGACAdTA TAGCCCAGAT ISO 

GAACACCTCT GGGCCACCCT TCAGCGTGCA CGGrTOGATGC CTGGCTCTGT TCCCAACCAC 240 

CCCAAGTACG ACATCTPCAG ACATGACTTC TATTGCCAGG CTQGTCAAGT GGCAGGGTCA 300 

TGAGGGfiGAC ATCGATAAGG GTGCTCCTTA TGCTCCCTGC TCTGGEAATCC ACCAGOGGGC 360 

TATCTGCGTT TATGGQGCTG GQGACTTGAA TTQGATGCTT CAAAAOCATC ACCTGrTTQGC 420 

CAACAAGTTT GACOCAAAGG TAGATGATAA TC3CTCTTCAG TCSCTTAGAAG AATACCTACG 480 
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TTATAAGQCC ATCTATQQGA CTGAACTTTG AGACACflCTA TGAGAGOGTT GCTACCTCTC 540 

GGGCAAGAGC ATGTACAAAC ATGCTCAGAA CTTGCTQGGA CAGTGTGC3GT GGGAGACCAG 600 

C3QCTTTGCAA TTCCJTOGCAT CCTTTAGGAT AAGAGGGCTG HrATTAGATT GTGGGrrAAC?r 660 

AGATCTTTTG CCTTGCAAAT TQCTC3CCTQG GnX2WVTGCTG CTmf n VlV r CACOCCTAAC 720 

CCTAGTAGTT CCTCCACTAA CTTTCTCACT AAGTOAGAAT GAGAACTQCT GTGATAGGC3A 780 

GAGTPGAAGGA GQC3ATATOK; GTAGAGCACT TGATTTCAGT TGAATCCCTG CTOOTAGCTT 840 

TTCCATTCTG TGC5AGCTQCC GTTOCTAATA ATTOCAGGTT TOGrTAGCGTG GAQGAGAACT 900 

15 TTGATOGAAA GAGAACCTTC CCTTCTGTAC TGriTAACTTA AAAATAAATA GCTCCTCATT 960 

CAAAGTAAGG AAAAARAAAA AAAGAAAAAA AACTOGA 997 

20 

(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SBQOaJCE CHARACTERISTICS: 
25 (A) LHflGTH: 2345 base paizrs 

(B) TYPE: nucleic acid 

(C) STRANDEEDNESS: double 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 301: 

TTGAGGCCGA CGCTAGGGGC CCGGAACSIAA ACTQCGAGGC GAAQGTGACC GGGGACOGAG 60 

CATTTCAGAT CTQCTCGGTA GACCTOGTGC ACCACCACCA TGTTGGCTGC AAGGCrGGTC 120 

TGTCTCOGGA CACTACCTTC TAGGGTTTTC CACCCAGCTT TCACCAAGGC CTCCCCTGTT 180 

GTGAAGAATT CCATCAOGAA GAATCAATGG CTGTTAACAC CTAGCAGQGA ATATGCCACC 240 

40 AAAACAAGAA TTGQGATCCG GOGTGGGAGA ACTQGCCAAG AACTCAAAGA GGCAGCATTG 300 

GAACCATCGA TGGAAAAAAT ATTTAAAATT GATCAGATGG GAAGATQGTT TGTTGCTQGA 360 

GGGGCTGCTG TTGGTCTTGG AGCATTOPGC TACTATGGCT TGGGACTGTC TAATGAGATT 420 

GGAGCTATTG AAAAGGCTGT AAITTTGGCCT CAGTATGTCA AGGATAGAAT TCATTCCACC 480 

TATATCTTACT TAGCAGGGAG TATTGGTTTA ACAGCTITGT CTGCXATAGC AATCAGCAGA 540 

AOGCCTGTTC TCATGAACTT CATGATGAGA GGCTCTTGGG TGACAATTGG TGTGACCTTT 600 

GCAGCCATGG TTQGAGCTGG AATQCTQGTA CGATCAATAC CATATGACCA GAGCCCAGGC 660 

CCAAAGCATC TTGCTTGGTT GCTACATTCT QGTGTGATGG GrTGCAGTOGT GGCTCCTCTG 720 

ACAATATTAG GGGGTCCTCT TCTCATCAGA GCTGCATQGT ACACAGCTGG CATTGTGGGA 780 

GOCCTCTCCA CTGTGQCCAT GTGTGOGCCC AGTGAAAAGT TTCTGAACAT GGGTOCACCC 840 

CTGGGAGTGG GCCTQGGTCT CGrCTTTCTG TCCTCAITGG GATCTATGTT TCTTCCACCT 900 
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AOCACOGTPGG CTGGntXXyiC TCTTTACTCA GTQGCAATC3T ACGGTQGATT AGTTCTmC 960 

AGCATGTTCC TTCTGrTATGA TACCCAGAAA GTAATCAAGC GTGCAGAAGT ATCACCAATG 1020 

5 

TATGGAGrrC AAAAATATGA TCTCATTAflC TCGATGCTGA GrEATCTACAT QQATACflTEA 1080 

AATATATTTA TGCGAGrTTGC AACTATGCTG QCAACTGGAG GCAACAGAAA GAAA3X5AAGT 1140 

10 GACTCAGCTT CTGGCTTCTC TGCTACATCA AATAIdTGT TTAATOGQGC AGATATOCAT 1200 

TAAATAGTTT GTACAAGCAG CTTTOGrTTGA AGTTTAGAAG ATAftGAAACA TGTCATCATA 1260 

TTTAAATOTT COGGTAATGT GATQCCTCAG GTCTQCCTTT TTTTCTGGAG AATAAAaXXIA 1320 

15 

GTAATCCTCT COCAAATAAG CACACACATT TTCAATTCTC ATOTTTGAGT GATTTTAAAA 1380 

TGTTTrGGTG AATOTGAAAA CTAAAGTTTG TGTCATGAGA ATGrTAAGTCT TTTTTCTACT 1440 

20 TTAAAATTTA GTAGGTTCAC TGAGTAACTA AAATTTAGCA AACCTGrTGTT TGCATATTTT 1500 

TTTGGAGrTGC AGAATAITCT AATTAATGTC ATAAGrPGATT TGGAGCTTTG GTAAAGGGAC 1560 

CAGAQAGAAG GAGTCACCTG CAGTCTTTTG TTTTTTTAAA TACTTAGAAC TTAGCACTTG 1620 

25 

TGriTATTGAT TAGTGAOGAG CCAGTAAGAA ACATCTOGGT ATTTGGAAAC AAGTrOGTCAT 1680 

TGTTACATTC ATCTCCTGAA CTTAACAAAA CTGTTCATCC TGAAACAGGC ACAGGTGATG 1740 

30 CATTCTCCTG CTGTTGCITC TCAGTCCTCT CTTTCCAATA TAGATGrTOGT CATGTTTGAC IBOO 

TTGTACAGAA TOTTAATCAT ACAGAGAATC CTTGATGGAA TTATATATGT GTGTTTTACT 1860 

TTTGAATGrrr ACAAAAGGAA ATAACTTTAA AACTATTCTC AAGAGAAAAT ATTCAAAGCA 1920 

35 

TGAAATATGT TGCTTTTTCC AGAATACAAA CAGTATACTC ATGATTGCTA AGTGrTTTTT 1980 

TATTTTTGCA TATTTATTGA ACTGTCTAAT TGAATACAGC TTGCTCTTGT CACCTCTTCA 2040 

40 AQCTTTCAAG CCTTTATAGA AAAGCTTCTT TGTQQCTTAC ACTQGAAATT ATGAAAGCAG 2100 

TTTTTCTCCT AAGACTTTTG CnTTCTOGCTV TTCCXTPCTCA GACTAAGCAC TAAAAAGCAA 2160 

AGCAAAACAG AACTAGTTKrr GTCTTAATGA AATATAlt^^ CCCAAAACTC TAATGAGQAA 2220 

45 

AATGCTTCAT TAGTTTCCCC TAGCAGACTT TTACTTCTCT TACACTGCTA CACCATTACT 2280 

TTCTTGAGAC ATTTGTAAGT CCTTTGATAC AGAAGAGTTA TATTTAOGAG GNCTTTAATG 2340 

SO AAOOG 2345 



55 <2) INFORMATION FOR SBQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2369 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: double 
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(D) TOPOUOGY-: linear 
(xi) SEQM^fCE DESCRIPTION: SBQ ID NO: 302: 

TTTTrrTTTT TmrrmT TTTTTNCAAG ATCATTCTTT MTTATTACT TCAGATAAAA 60 

AGATAGTATA CATAUTAGGG AATCCCTTAA AATTCAACTC TAGAOTEATA CACCATCTAG 120 

TACmroCA ATGAATGTTA ACAACAACAA AAAAAATCTC TAAACACCTG AAAGCCOCAC 180 

TATTAACATG GACTATGGTA ATAAAAAATT TTGACATTTA ATTTGTTCAA CATATPAGTAT 240 

TTACATTATG AAACCAArTGG TGATGATACA ATAAAGTTGAT AAAGAAATAG TAAAAATAAA 300 

15 CTTTAAAAAG CAAAQGnTA TAGTCTGACA ATQCTAATTA TCCTAATTGT ATATAAAAAA 360 

TTAAAACATA GAGCTTTCTG TTACAAAATT CTTAATOCTC TGGGTTGTAA TCATTACTTG 420 

CTACCAATTT ACATGCAACA TCTGCTAGGA CTGACATTTG AITTTTTTCC CCAAGAATGT 480 

20 

GnGAGTAGAT AAATGACATT TCAGAGCAGA TATTAATOTA CTTGTQGACA GAAAAAGAAA 540 

CTCAAGATTG GTACTQGTCA CAAGCCTCTT CXX:AATASAA ATTATAAAAA CAGTAAGATA 600 

25 AAATTTAAAA AAAATCTAAA AAGGQGATGC ATAGGCAAAG AGTACCATAA ATQGCACAGC 660 

TCAAAAAATC CCAGGACCAA TCAGACACAC ATCTTTTCTC TCTCCTTCAG OGACAAGAGG 720 

TCGATrrTGC CATCAAATAA CCATCATTGA AQCAAGCX5AG GQGCACCAGG TGTACAACTG 780 

30 

A1TAGATCTT GCAAAATACT AAGATGGGAG CAGGGGTGGC CAGAAGAAGG GGTAATTTAT 840 

ATATAATTCA AACTATATAC AGCATAAATG GAATGCflGCC CATCCCAAAC TQGCTCTGPG 900 

35 AAACAATTGG ACCTTTATAG TTAAAATTAT AACAAGTGTA ATAATACAAT AGATTTACAT 960 

GGGAAGCAAA ATCCAAGGGA CATTTTATAT ' TAAGTATTTA CTGTGCTGrrT TCAATTTAAA 1020 

AATAATTTTG CTAAGTATAC ATCTCAACTG AAGTCTATGT AAAAAATGTC CTAATAGATA 1080 

40 

CAGATATTTA CXirTTQGTGA GTTGAAGGCC TTTTTGrTGAC TTCTCTCTCA ACTCrTAGGCA 1140 

GAATGCTAGA TGTACAIGCA CATATGGAGA AACTCAAGCT GAGGTCATCC AAAAGCTGTG 1200 

45 CGTATGAGGA QGCTGGAGGT ACTTTGAAAG TCAAAGrTAGA CCAGAAACCC AAAACAGGTA 1260 

ACAGTGAGGA TC3QCAACAGG GAATQGAATG CCAATATGGC AGTAAAACTT TTTTTAAAAA 1320 

CAGAAAGAQG AAGGCCTCTC GTACCAGCAG AATCCTGTAC ACGTACAAAA AAGAAAAAGC 1380 

50 

CACCCACCAT TITGTAAAAC AGAAGCCAAT TATAGTXnXSG GAAAGTACAA ATTACAGAAA 1440 

ACCAGAAGTC AACAGAAGAA AAACTACTGG TTTACTTGAG AGAAAGGAGA ATGGTTCACC 1500 

55 CXXSAGCAGAG TTACTTGGTG AACGCCGCCA CXaUXGOCCA CAGAACCTCA TTGGrrGTTGG 1560 

CCTTCAGACA TTCXACTTCA GGGTCTAAGT OGAGAARNTG CXX3CACTCTC TTGGTAGCCA 1620 

AATCATACTG CTCX3TCCAGA AGAGGAGCAA AAGCATTCTC CAGGACGICC GAGGCATGAG 1680 
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CXMGTAAAT GAGQGCCAGC AAGOGCCTGT CCAIXSOXTTG AQGCTTCATTC ACCCATTTGT 1740 

CAAGAAOQGC TTCCTGTACT TTCTTGATGA GGCGCTGCTT AATGTTGTTA TTGGTGaGGG 1800 

5 GAIOTGrrrGT CATGTCAAAA AGTAGGAAGT TCTGTTTCTC TGTTCTCAAT ACACCCTTTT 1860 

ccAccAGGrrr tttagctaat cxtttcccxsta C3vrrrcTTAA crcATAAra: aattttaatg 1920 

GATTCCATGT CPCACCACTA AGTAATTCAA TOCAGTTCTG GACCGTrTCT QGftGQCTOAG 1980 

10 

TTTCCTTAAC ATOCTTTCAGA GCTTCATCAA GAAGAACATC CCCTGTTGGA GCATCTGACT 2040 

TACAGATTAC CTTTCTTOTT AATAGACTTT TACGTCTCAT TCCACAAGCC TCTAGTTt3TA 2100 

15 ACCTTCCTCT CAATQCTAAT TCAATTAACA TACAGCCACG TAATCCflGAT GATATACAGT 2160 

CATTCCAAAA TGATGTGrTAA ACCTTCGOGG TCCITGAGGC CCAGCAGGAG CACTTCCTCC 2220 

ATCAGC3GTCA GCCXXX3TTTC CTTOGAGrTCG CCCTTGTCGT OGTCGrTCCTG CTCGTOGCGG 2280 

20 

OGGCTCTGCG CGTCGTCCTC GCTGCTAGCC GCGCCGCCGC CCGCCGCCCG CilXTll^'iUi 2340 

C50GGCGrrGC GQGAGGCCTC GGrTQCGCOG 2369 



(2) INFCH^MATION FOR SEQ ID NO: 303: 

<i) SEQUENCE CHARACTERISTICS: 

(A) hEIXTTHz 1181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SBQUENCE DESCRIPnCN: SEQ ID NO: 303: 

GGGACGTOTG GTTTCAGCTC GTOCGCCTCC CCGTGGGTTT GOGACGTrTA GCGACTAaTC 60 

CGCCTOCGCC ACGCCGGCTG CGAGACTGGG GCCGTGGYTG CTOGTCCCGG GTGATGCTAG 120 

GOGGCTCCCT GQGCTCCAGG CTGTTGCGGG GrTGTAGGTGG GAGTCAOGGA CGGTTCGGGG 180 

CCCGAGGTGT COGCGAAGGT GGCGCACATG GGCGGCAGGG GAGAGCATGG CTCAGCGGAT 240 

GGTCTOGGTG GACCTT3GAGA TGACAGGATT OGACATTGAG AAGGACCAGA TTATTGAGAT 300 

GGCCTGTCTG ATAACTGACT CTGATCTCAA CATTTTGGCT GAAGGTrCCTA ACCTGATTAT 360 

AAAACAACCA GATGAGTTQC TQGACAGCAT GTCAGATTGG TGTAAOGAGC ATCACGGGAA 420 

GTCTGGCCTT ACCAAGGCAG TGAAGGAGAG TACAATTACA TTGCAGCAGG CAGAGTATGA 480 

ATTTCTGTCC TTTGTAOGAC AGCAGACTCC TCCAGGGCTC TGTCCACTTG CAGGAAATTC 540 

AGTTCATGAA GATAAGAAGT TTCTTGACAA ATACATGCCC CAGrPTCAITCA AACATdTCA 600 

TTATAGRATA ATTGATGTGA GCACTOTTAA AGAACTGTGC AGACGCTQGT ATCCAGAAGA 660 

ATATX3AATTT GCACCAAAGA AGGCTGCTTC TCATAGGGCA CTTGATGACA TTAGTGAAAG 720 
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CATCAAAGAG CTTCAGTriT AOXSAAATAA COTCTTCAAG AAAAAAATAG ATGAAAAGAA 780 

^ GAGGAAAATT ATAGAAAATG GQGAAAATGA GAAGACOGTG AGTTGAfPGCX: AGTTATCATG 840 

CTGCCACTAC ATOGTTATCT GGAGGCAACT TCTGGTOGTT TriTmVIi: ACGCTGAfPGG 900 

CTTGGCAGAG CMCTTCGGTr AACTTGCATC TCCAGATTGA TTACTCAAGC AGACAGCACA 960 

10 CGAAATACTA TTTTTCTCCT AATATGCTOr TTCCATTATO ACACAGCAGC TCCTTTGTAA 1020 

GTACCAGGTC ATGTCCATCC CTTGGTACAT ATATGCATTT GCTTTTAAAC CATTTCTTTT 1080 

GTITAAATAA ATAAATAAGT AAATAAAGCT AGTTCTATTG AAATCCAAAA AAAAAAAAAA 1140 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA N 1181 



15 



20 



30 



(2) INFORMATION ¥X>R SEQ ID NO: 304: 



(i) SEQUENCE CHARACreRISTICS : 

<A) LQJKra: 1537 base, pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEENESS: double 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 304: 

CTTTTTGTGT TCCGGCCGAT CCCACCTCTC CPCGACCCTG GACGTCTACC TTCOGGAGGC 60 

CCACATCTTG CCCACTOCGC GOGOGGGGCT AGOGCGGGTT TCAGOGACGG GAGOCCTCAA 120 

35 GGGACATGGC AACTACAGCG GCGOCGGOGG GCGGCGCCCG AANATGGAGC TQGCCOCSGAA 180 

TC5GGGAGGGT TCGAAGAAAA CATCCAQQGC GGAGGCTCAG CTGTGATTGA CATGGAGAAC 240 

ATOGTVPGATA CCTCAGGCTC TAGCTTOGAG GATATQQGTG AGCTQCATCA GOGCCTGOGC 300 

40 

GAGGAAGAAG TAGACGCTGA TGCAGCTGAT GCAGCTGCTG CTGAAGAGGA GGATGGAGAG 360 

TrCCTGGGCA TGAAGGGCTT TAAGGGACAG CTGAGCOGGC AGGTC3GCAGA TCAGATGTTGG 420 

45 CAGGCTOGGA AAAGACAAGC CTCCAGGGCC TTCAGCTPGT ACGCCAACAT OGACATCCTC 480 

AGACCCTACT TTGATGTGGA GCCTQCTCAG GTTGCGAACAG GGCTCCTGGA GTCCATGATC 540 

CCTATCAAGA TQGTCAACTT CCCCCAGAAA ATTGCAGGTG AACTCTATGG ACCTCTCATG 600 

50 

CTGGTCTTCA CTCTGGTTGC TATCCTACTC CATGGGATGA AGAOGTCTGA CACTATTATC 660 

CGGGAGGGCA CCCTGATGGG CACAGCCATT GGCACCTGCT TCGGCTACTG GCTGGGAGTC 720 

55 TCATOCTTCA TTTACTTCCT TGCCTACCTG TGCAACQCCC AGATCACCAT GCTQCAGATG 780 

TTGGCACTQC TQGGCTATGG CCTCnTGGG CATTGCATTG TCCTGTTCAT CACCTATAAT 840 

ATCCACCTCC AOGCCCTCTT CTACCTCTTC TGGCTGTTQG TOGGTOGACT GTCCACACTG 900 

60 
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CGCATOGTAG CAGrTGrTTOGT GTCTCGGACC GTGGGCCCCA CACAGCGGCT GCTOCTCTGT 960 

QGCACCCTGG CTGCCCTACA CMOTCTTC CTGCTCTATC TXXAmTGC CTACCACAAA 1020 

GTOJTAGAGG GGATCCTGGA CACACPQGAG GGCCCCAACA TCCOQCCCAT CXAGAGGGTC 1080 

CCCAGAGACA TCCCTGCTAT GCTCCCTGCT GCTOGGCTTC CTACCACCGT OCTCAAOXX: 1140 

ACAGCXMAG CTGTTQCGGT GACOCTOCAG TCACACTCAC CXXZACCTGAA ATTCTTQC3CC 1200 

AGTCCTCTTT CCCXX:AGCTG CAGAGAGGAG GAAGACTATT AAAGC5ACAGT CXTPGATCSACA 1260 

TGTrTCCTAG ATGGGGrTTG CAGCTGCCAC TGAGCTGTAG CTGCGTAAGT ACCTCCTra* 1320 

15 AGCTGTOGGC ACTTCTGAAA GCACAAGGCC AAGAACTCCT GGCX:AGGACT GCAAGGCTCT 1380 

GCMXCMiTG CAGAAAA3t3G CSTCAGCTCCT TTGAGAACCC CPOCCCACXT ACCCCTTCCT 1440 

TCCTCTTTAT CTCTCCXACA ITCTCTTGCT AAATATAGAC TTQGTAATTA AAAAAAAAAA 1500 

AAAAAAAAAA AAAAAAAAAA AAAAAAGGQG GCa«CCCC 1537 



20 



25 

(2) IKFraMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) ItQKTFH: 1493 base pairs 

(B) TtPE: nucleic acid 

(C) STRANOECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQOEtSCE DESCRIPTION: SEQ ID NO; 305: 

TQCATQCCAA AACCAATQCC TGCCAAACAA AATCTTAGAC ATCCCAATAT AATATGTTAG 60 

TTATATTTCT ATTCACATCA TTATTGAAAA TACOCAGCTC AGrTGCCTCSGC TTAATAAATG 120 

TTTAATTCCC TTACCTACTC TTGCTCTATT TmTATTTG AAATQGAGAT GAGCAAAATA 180 

ACACATTCAT GGCTGAAGCA ATTTTTTGGA CATTTCTTGT TACCAAAAGA TCTATAATCA 240 

QGATGATCCT GAGCTGTTCA AACAAGCTGT ATATAAACAG ACAATGAAAC TCTTTQCAGA 300 

GCTGGAAATT AAAAGGAAAG AGAGAGAAGC CAAAGAGATG CATGAAAGGA AACGACAAAG 360 

GGAAGAAGAG ATTGAAGCTC AAGAAAAAGC CAAACGQGAA AGAGAGTGGC AGAAAAACTT 420 

TGAGGAAAGT CGAGATGGTC GTGTQGACAG CTGGCGAAAC TTCCAAGCCA ATACGAAGGG 480 

GAAGAAAGAG AAGAAAAATC GGACCTTCCT 6AGACCACCG AAAGTAAAAA TGGAGCAACG 540 

TGAGTQACOG CCCAAQGTCA CAGQCACAGA ACCTTTCCCC TGCTATCTCC CTTCCTGCTT 600 

CGAAGGACTC ATTCTTTCCT CCCACTTCCA CCCCAACATA GAGTAGTATT TGCTTTTTAG 660 

TCCATTTTGT TTTCAATACG ATTTAATATC GATCAGAGTA ATTCnTTGT ACATTGAAAT 720 

GAGGGGCTTG GmTAAAAAA AGACCTTTCC CTCTCCCTGC OCTTAGAACA ACCAGTA3TA 780 
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GAAOGTOCCA CCATTOGrTGC TOCCTTCTCT TCCCACAGCC TGTAACTCAG TOmTOTAC 
TTCACTGAAT TGrTGATGGTT AGAAACTTCG TGGATAGTTT (7PGGAAATCA aCCAATEAAA 



5 

cATAcrocrr aaaacagtot toctgtgact tcagagacaa gcctggaagg ggcaccttag 



840 
900 
960 

GAAQCXXXrrr CGCTrCAGTT GCTOGCTTCT GCSGTCTGCTC CTTTCGAAGG (XCAGATAAG 1020 
10 ACAGGGAACA CTTGTGAGCA CACAGAGCAG CMCTGATQC CCTCK5GTGT TTOGCATGTC 1080 
CCCCCTGTCT ACTGACCAAT CAGTGTQGCA TGAGGCCCAC QCXACCCAAA CmTCACTT 1140 
TCCAAAGAGC TAGCCGTCCT CCACCCAGTA OCATGTCCTA GCCTCPCTOC ATTTCITAGr 1200 
GGTAATATTC TTTATC5TATA ATAAATTTTT ATACCCAAGC CATTGATOEA CTmi,VritJ 1260 
TACTCTOCXT TGTGGGTCCC TTGTCTGGCT TGGCTGAACC CCAAAATCCT TTOGGC7ITOG 1320 
20 ACAGACCTGG CTGAACCTTA GTTTCTTCAT CTATGAAATG GGAATAIT3AA TTACTGCAGC 1380 
AQCITITAGG GCAGATTTGC CATGGCATAT ACAAQGTAAC TACCATAGTC CTCCTrGGCyT 1440 
ATTGCCAATA TCXTATTATT TCTGTOTAAA ATCAAGATAC TGATTCTTTT GAG 1493 



30 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 306: 

(i) SBQU^ICE CHARACTEEIISTICS : 

(A) ItQIGrrH: 577 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: doxible 

(D) TOPQUX?^: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 306: 

AATTOGQCAG AGOIATTATA TACACTATAC TGGCATTTAC TGTTTCACCC AGCCCGGAAA 60 

GTCAGAGATG TATA3TGGAA AATTTACAAC TCCATCTACA TrGGTTCCCA GGACGCTCTC 120 

ATAGCACATT ACCCAAGAAT CTACAACGAT GATAAGAACA CCTATATTCG TTATGAACTT 180 

45 GACTATATCT TATAAriTTA TTGTrTATrr TGTGTTTAAT GCACAGCTAC TTCACACCTT 240 

AAACTTGCTT TGATTTGGTG ATGTAAACTT TTAAACATTG CAGATCftGTG TAGAACTCCT 300 

CATAGAQGAA GAGCTAGAAA TCCAGTAGCA TGATTTTTAA ATAACCTOTC UTlUXTmu 360 

ATGTTAAACA GTAAATGCCA GTAGTGACCA AGAACACAGT GATTATATAC ACTATACTOG 420 

AQQGATTTCA TTTrrAATTC ATCTTTATGA AGATTTAGAA CTCATTCCTT GTCrTTTAAAG 480 

55 GGAATGTTTA ATTGAGAAAT AAACATTTGT GWACAAAATG YTAAAAAAAA AAAAAAAAAA 540 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AACTCGA 577 
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(2) INPORMATICN FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2860 base pairs 

(B) Tn*E: nucleic acid 

(C) STRA wnRTM ggg. doxsble 

(D) TOPOLOGY: linear 



10 



15 



(xi) SBQUHCE DESCRIPTICN: SBQ ID NO: 307: 

CSTGmJSACCG CTCTCNCAAT ATGC3CTCCCC CGQQCTGQCA (3?MRieiO?GT CWCKRC3TCGC 60 

TAGCCICTCC TGACAGGGGA GAGTTAAGCT CCCGTTCTCC ACajTOCCGG CTOGCCflGGT 120 

QGCSCTGfiGGG TGACCGAGAG AOCAGAACCT GCTTQCTCGA GCTTAGTCCT CAGAGCTCQG 180 

CSflGGGAGGTT CCGCC3C3CTCC TCTOCTGTCA GCGCCGGCAG CCCCTCCCQG CTTCACTPCC 240 

20 TCCOGCAGCC CCTGCTACTG AGAAGCTCCG GGATCCCRGC AGCCGCCACG CCCTCGCCTC 300 

AGCCTGOC3GG GCTTCCAGTC AGGCCAACAC OGAOGSCGCAC TC3GGGAQGAA GACAGGACCC 360 

TTGACATCTC CATCTOCACA GAGGTCCPC3G CTGGAACCGA GCAGCCTCCT CCTCCTAGGA 420 

TGACCTCACC CTCCAGCTCT CCACJITITCA GGTTOGAGAC ATTAGATCGA GGCCAAGAAG 480 

ATCSGCTCTGA GX3CGGACAGA GGAAAGCTGG ATmOGQAG CGGCXTOCCT CCCATOGAOT 540 

30 CACAGTTCCA GGCXX3AGGAC CGGAAATTCG CCCCTTCAGA TAAGAGTCAA CCTCCAACTA 600 

COGAAAGQGA ACAQGTGCCA GTCAGCOQGA TCCAAACCGA TTTC3ACCGAG ATCGGCTCTT 660 

CAATC3CGC3TC TCCCGGQGTG TCCCCGAGGA TCTGtSCTOGA CTTCCAGAGT ACCTCAGCAA 720 

GACCAGCAAG TACCTCACCG ACTTOGGAAA TACACAGAQG GCTCCACAGG TAAGACGGCC 780 

TC5ATGAAGGC TC3TC3CTGAAA CCTTAAGGAC GC3GGTCAATG CCTOCA3TCT GCCACTOCTC 840 

40 CAGATCGACC GQGACTCTC3G CAATCCTCAG CCCCTOOTAA AIXSCCCAGTC CACAGATCAC 900 

TATTACOGAG GCCACAGCGC TCTGCACATC CXXZATTGAGA AAGAGGAGTC rCCNTIGFGT 960 

GAAGCTCCTC; GTOGAGAATG GQGCCAATGT C5CATQCCCGG GTCTCCGGCG ACnCTTCCA 1020 

GAAGGGCCAA GGGACTTOCT TrPA3TTCGG TGAGCTACCC GTCTCTrTOG CCGCTTOCAC 1080 

CAAGCAC?rGG GMXJTQCSTAA GCTACCTCCT GGAGAACCCA CACCAGCCCG CCAGCCTGCA 1140 

50 GGCCACTGAC TCCCAGGGCA ACACAGTCCT GCATOCCCTA GTOGATCATC TCGGACAACT 1200 

CAGCTGAGAA CATTGCACTG GTGACCAGCA TGTATC5ATGG GCTCCTCCAA CXmOGGGSCC 1260 

SCCYTCTGCC CTACOGTC3CA GCrTGflGGAC ATCCGCAACC TGCAGGATCT CACGCCTCTC 1320 

AAGCTGOCCG CCAAGGAGGG CAAGATOGAG ATTITCAGGC ACATCCTCCA OCGGGACTTTr 1380 

TCAGGACTGA GCCACCTTTC COGAAAGTTC ACCGAGTC3GT GCTATCGGCC TOTCCGQGTC 1440 

60 TCGCTGTATG ACCTQGCTTC TGTGGACAGC TGTCAGGAGA ACTCAGTCCT GGAGATCATT 1500 
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GCCTTTCATT CSCAAGAGCCC GCACOSACAC OSAATOGTCG TPITOC3AGCC CXHXSAACAAA 1560 

CTCCTCCAGG CGAAATQGGA TCTGCTCATC CCCAAGrTPCT TCTTAAACTT CCTCTOTAAT 1620 

CTGATCTACA TGTTCATCIT CftCOCSCICTT OXTACCATC AGCCTACCCT GftAGAAGCAG 1680 

GCXDGCCOCTC ACCTCAAAGC GGAGGTTQGA AACTCCATCC TGCTGA00C3G CCACftTCCIT 1740 

ATCCTGCTAG GGC3QGATCTA CCTOCTOGTG GCC3CCAGCTG TGGrACTTCT GGCGGCGCCA 1800 

Ojiv, riCATC TOGATCTOGT TCATAC3ACAG CTACTTTQGA AATCCTCTTC CroiTCCAGG 1860 

CCCTGCTPPCA CAGTQCnCTC CCflOTrCSCTO TO rril,Vil« GCCA3CGAOT GCTACCTOCC 1920 

CCTGCTTOTG TCTOCGCTQG TGGCTOt3GCT GGCTGAflCCT GCTTTACTAA TACACOTOGC 1980 

C3TTCCAGCAC ACAGGCAGTC TACAGTTrCA TGWTCCCTGA AGCCCTOCTG AGCXTOAGCC 2040 

AGGAGGCTTG C3CGCCOCGAA GCTCCTACAG GCCCCAATCC CACAGAGTCA CTTGCAGCCCA 2100 

'K3GAQQGACA GOAGGACXSAG GGCAACGGGG CXXrAGTACAG GGGTATCCTC GAAGCCTCCT 2160 

raSAGCTCTT CAAATTCACC ATOGGCATGG QCGfiGCTQC3C CTTCCAQGAG CAGCTCCACT 2220 

TCCGCGQCAT GCTTGCTGCTG CTOCTGCrGG CCTACGTOCT CXTCACCTAC ATCCIGCTOC 2280 

TCAACATC3CT CATCGCCCTC ATCAAGOGAA CCJrCACAGTG TCGCCACTCA CAGCTCGAGC 2340 

ATCTGGAAGC TQCftGAAAGC CATOTCTCTC CTOQAGATGG AGAATCGCTA TK3CTGGK3C 2400 

AGGAAAAADC AGCQGGCftGG TOTGATGCTG ACCGITOGCA CTAAC3CCCAG ATCGCAGCCC 2460 

CGATGAGCCX: TCX3TGCTTCA aXTTOGAGGA GGTGAACTGG C3CTTCATCGG GAGCAGAOSC 2520 

TQCCTACGCT GrPGTPGAGGAC CXX3TCAGC3GG CM3GTGTCCC TOGAACTCTC GAGAACCCTC 2580 

TCCTGGCrrC CCCrCCCAflG GAQGATCSAGG ATGGTGCCTC TGAGGAAAAC TATOTCCCCG 2640 

TCCAGCTCCT CXAGrTCCAAC TCATCGOCCA GATGCAGCAG GAGGCCAGAG C3ACAGAGCAG 2700 

AC5GATCTTrC CAACCACATC TQCTOGCTCT GGGCTTCCCAG TGAATTCTQG TCGCAAATAT 2760 

ATATTTTCAC TAACTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAVGAGQG GQGGCCOGKT 2820 

ASCCAAWTTC GCCCTATAAG TGAGTOCCWA TTACGATAAA 2860 



50 
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(2) INFORMATION FOR SEQ ID NO: 308; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANEECNESS : dotible 

(D) TOPOLOGY: linear 



60 



(xi) SBCjqENCE DE9CRIPnC»a: SEQ ID NO: 308: 
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(2> INFORMATION FOR SEQ ID NO: 309: 

(i) SBC2UEMCE CHARACTERISTICS: 

(A) UaroTH: 2025 base pairs 

(B) TyPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 



CrGGTICTCST CroOGCTGGT GCTOGGCTQG CTGAACCTOC TTTACTATAC ACCnGGClTC 
CAGCACACAG GCATCTACAG TCTCATGATC CAGAAfiCCCT GtSTCAGCCIG AGCCA£3GANN 
5 TTQGCGCCCC GAAGCTOCTA CAGCXXXXZAA TGCCACAGAG TCAGKXAGC CCATCQAGOG 

^tf^AGGAGGAC GAGGGCAACG GC3G0CCACTA CAGGQC3TATC CTOGAAGCCT CCITOGAGCT 240 
CTTCAAATTC AOCATCX3C3CA TGC3G0GAGCT GGCCTTCCAG GAGCAGCTOC ACTTCCGCGG 300 
CATCXHOTX; COtSCTOCKX: TCX^ . 
GCTCATOCSCC CTCATCa^AGC GAGACOGWCA ACAGTGTCGC CACTCACAOC TOGAGCATOT 420 
15 C3GAAGCTOCA GAAAQCCATC TCTGTCCTOG AGATGGAGAA TGGCTAITOG OGCSTCCAGGA 480 
AGAAGCAGCG GGCAGGrTGTC ATGCTGACCG TO3CXACTAA OCCAGATCGC AGCCCOGATC 540 
AGCGCTGGTG CITCAGC3GTG GAGGAGCTGA ACTC3GC3Cm: ATCGGAGCAG ACGCKSCCTA 600 
OCSCTGnOTGA GGACCCGTCA GGGGCAGGTG TCCCTCGAAC TCraS^USAAC CCTOTCCTCG 660 

crrcoccrcc caaggaggat gaggatggtg octctgagga aaactatoig cccc?rccAGc 720 

25 TCCTCCAGTC CAACTGATGG CCCAGATGCA GCAC3GAGGCC AGAGGACAGA GCAGAGGATC 
TTTCCAACCA CATCTGCTCG CTCTGGGGTC CCAGTGAATT CTOGKXXrAA ATATATATTT 



780 
840 



TCACTAAMWM AAAAAAAAAA AAAAAAAAAA ACTOGA o,^ 

30 



(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 309: 
CATGACCCGC CTGATGCGAT CCCGCACAGC CTCTGGTTCC AGCCTCACTT CTCTGGATOG 
CACCCGCAGC CGCTCCCACA CCAGCGAGGG CACCCGAAGC CGCTCCCACA CCAGCGAGGG 
CACCCGCAGC CGCTCGCACA CCAGCGAGGG GGCOCACCTC GACATCACCC CCAACTCGGG 
TOCTOCTGGG AACASGCCGG GCCCAAGTCC ATGGAQGTCT CCTOCTAGGC GGCCTCCCCA 
GCTQCCGCCC CCGGACTCTG ATCTCTGTAG TGGCCCCCTC CTCCCCGGCC CCITITOGCC 
CCCTGCCTGC CATACTGCGC CTAACTOGGT ATTAATCCAA AGCTTATnT CTAAGAGIGA 360 
CSCTCTQGTGG AGACAAAT6A GGTCTATTAC GTGGGTGCCC TCTCCAAAGG CGGGGTOGCG 420 
OT^GACCAAA GGAAG6AAGC AAGCATCTCC GCATOGCATC CTCTTCCATr AACCAGTCGC 
CGGTTGCCAC TCTCCTCCCC TCCCTCAGAG ACACCAAACT GCCAAAAACA AGAOGCCTAC 
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AGCACACACT TCACAAAGCC AflGCCTACSGC OSCCCTCAGC MtCTXXnTC AAAOGGGICC 
CraTTCAGAA GGCCAGCC^ 

AGTrrCCXXSA AACAAAATCC ITITCTCArr TOGGGAGC3GG GGTAATAGTC ACATOCAGCSC 
ACCTCmTA AACAGC3CAAA ACAGQAAGGG GCiAAAAGCSro GGATTCAtXTT CGAGC3CTAGA 
GGCATITOGA ACAACAAATC TACGTAGTTA ACTTGAAGAA ACCGATTTIT AAAC3ITOGTC 
CATCTAGAAA GCTTIGAATG CAGAAGCAAA CAAC5C1TCAT TmCTAGCA TCCICPTAAT 
GTCCAGCAAA AGCAGGCRAC AAAA3CTCCT QCSCTTTACAG ACAAAAATAT TTCAGCAAAC 
GTTGGGCATC ATOGTmTC AAGGCTTTAG 'TrCTCCTITC TCCCTCTCCT CCACAGCCXX: 
AACCTCCCAC CCCTGATACA TGAGCCAGTG ATTAITCITC TTCAGGGAGA AGATCATITA 
GATTTGTrrT GCATTCCITA GAATQGAGGG CAACATTCCA CAGCTCCCCT C3GCTOIGATC 
AGTGTCCTO3 CAGC3GGCXX3G AGTAGGAGCA OSCGGAATTC GGC?ITACTCG 

A3CTAAGGGA TTCCTTGTTG ITCTCTTGAG ATCCAGTCCA CTTCrcATrr CTCTGGaTCC 
CAGCTTGGTT CCAOGAATTT TGTOrcATTG GCTTAAATCC AGnTTCAAT CTTOGACAGC 
TCGCSCTQGAA CGTGAACTCA OTAGCTGAAC CTC3TCTGACC aSGTCACXSTT CTTOGAICCT 
CflGAACTCTT TGCTCTICTC GQC3C?rC3C30GG TGGGAACTCA CCTX3GGGAGC GC3TOC3CIGAG 
AAAATC3TAAG GAITCTGGAA TACATATTCC ATC3GGACTTT CCTICCCTCT CCTOCrTCCT 
CTTTTCCTGC TOCTAACCT TTCGCCGAAT GGGGCAGCAC CACTCACC7IT TCTQGGCGCX: 
atf7rC30C5C3CT GCCAGGTTCC TGTACTACTG CXTTCTACTT TTCAIITTTOG CTCACCGTOG 
ATTTTCTCAT AGGAAGTTTG GTCAGAGrTGA ATPGAATATT GTAAGTCAGC CACTCGGACC 
OGAQGATITC TGGGACCCCX; CAGTTGOGAG GAGC5AAGTAG ICCAGCCTTC CA(3GTX3GCCT 
GAGAGC3CAAT GACTCGTTAC CTGCOGCXXyV TCAOCTTOGA GCSCCPTCCCT CSGCCTTCAGT 
AGAAAAGTCG GGGATCQGGG CAAGAGAGGC TGAGrTAOCSGA TOC5GAAACTA ITOTCCACAA 
CHCTITCCAG AGGAGTTTCT TAATGAGATA TPTGrATrTA TTTCCAGACC AATAAATTTG 
TAACTTTGCA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAACTC 
GAGGC5GGGCC COTACCCAAT TCGCCGTATA TGATCGTAAA CAATC 
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660 
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(2) INFORMATION FOR SBQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3026 base pairs 

(B) TyPE: nucleic acid 

(C) STRANDEDMESS: double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE nESCRIPTION: SEQ ID NO: 310: 
TAGGCAGCAC TGAAATATCC TAACCCCCTA AGCTCCAGCTr GCCCIt3K3C3l ACGAGCAACT 
GGACTATAGC AGGGCTGGGC TCTOlt^rilX: TGGTCATAGG CTCACTCTIT CCCCCAAATC 
TTCCTCTOGA GCTTTGCAGC CAAGGTGCTA AAAGGAATAG CTAGGAGACC TCTICTAICT 
AATCCTTAAA AGCATAATGT TGAACATTCA TTCAACAGCT GATCCCCTAT AACCCCTCCC 
TGGATrTCTT CCTATTAGGC TATAAGAAGT AGCAAGAICT 1TACATAATT CAGAGIGGIT 
TCATTGCCTT CCTACCCTCT CEAATGGCCC CTCGATTTAT TTGACTAAAG CATCACACAG 
TGGCACTAGC ATTATACCAA GAGTATGAGA AATACAGTGC TTTAaXXSCTC TAACATTACT 
GCCITCAGTA TCAAGGCTGC CTCGAGAAAG GATGGCflGCX: TCAGGGCTIC CTTATOTCCT 
CCACCACAAG AGCTCCTTGA TGAAGGTCAT CTTTITCCCC TATCCICTTC TTCCCCTCCC 
CGCTCCTAAT GGTAOGTQQG TACCCAGGCT GGTTCTTGGG CTAGGTAGTO GGGACCAACTT 
TCATTACCTC CCTATCAGTT CTAGCATAGT AAACTAOGGT ACrAGTOTTA CTOQGAAGAG 
CTGGGTTTTC CTAGTATACC CACTCCATCC TACTCCTACC TCGrTCAACCC GCTCCTTCCA 
GGTATOQGAC CTGCTAAGTG TGGAATEACC TGATAAGGGA GAQGGAAATA CAAGGAGGGC 
CTCTGGTGTT CCTQGCCTCA GCCAGCTGCX: CACAAGCCAT AAACCAATAA AACAAGAATA 
CTGAGrrCAGT TrrTTATCTG GGTTCTCTTC ATTCCCACTG CACTTCCJPGC TCCTTTOGCT 
GACTGGGAAC ACCCCATAAC TACAGAGTCT GACAGGAAGA CTOGAGACTC TCCACTTCTA 
GCTOQGAACT TACTGTGTAA ATAAACTTTC AGAACTGCTA CCAIGAAC?ro AAAATGCCAC 
ATTTTGCTTT A'EAAITTCTA CCCATGTTGG GAAAAACTGG CTTTTTCCCA GCCCTTTCCA 
C3GGCATAAAA CTCAACCCCT TCGATAGCAA GTCCCATCAG CCTAITAnT TTTTAAAGAA 
AACTTOCACT TCTTTTrCTT TTTACAGriTA CTTCCTICCT GCCCCAAAAT TATAAACTCT 
AAGTOTAAAA AAAAGTCTTA ACAACAGCTT CTTGCTTGTA AAAATAItTTA TTATACAOCT 
GTAirrTTAA ATTCTGCTCC TGAAAAATGA CTGTCCCATT CTCCACTCAC TCCArrTOGG 

GccrrrcccA ttqgtctgca TGrrcrrTTAT cattocaggc cagtxsqacag agggagaagg 

GAGAACAGGG GTCGCCAACA CTTGTOTTGC ITTCTQACTG ATCCTGAACA AGAAAGACTA 
ACACTGAGGC GCTOGCTCCC ATCCACAACT CTCCAAAACA CTTATCCTCC OGCAAGACnx; 
CSGCTTTCCAG GGriCTTTACT GGGAAGCAGT TAAGCCCCCT OCTCACCCCT TCCirrPTrc 
TTTCTITACT CCTTrGGCTT CAAAGGATIT TQGAAAAGAA ACAATATCCT TTACACICAT 
TTTCAAinC TAAATTTGCA GGGGATACTG AAAAATACGG CAGGTCGCCT AAGGCTCCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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10 



15 



20 



25 



30 



35 



40 



45 



TAAAOTTGAG GGGAGAGGAA ATCTTAAGAT TACAAGATAA AAAACGAATC CCCTAAACAA 1740 

AAAGAACAAT AGAACTQGrPC TTCCATTTTG CXAOCTTTCC TGTTCATGAC AGCTACTAAC 1800 

CTOGAGACAG TAACATITCA TTAACCAAAG AAAGTOOtJPC ACCTGACCTC TCAASAGCTC 1860 

AGTACTCAGG CCACTCCAAT CACCCTACAA GATGCCAAGG AGGTCCCAGG AAGTCCAGCT 1920 

CXTTTAAACTG AOGCTAGNMA AOAAWXTOG GCAAGTGAGG CAAGAGAAAT GAGGAAGftAT 1980 

CCATCTC3TCA GGTCAXAQGC AAGGATGAAA GACAAAGAAG GAAAAGAGTA TCAAAQGCAG 2040 

AAAGGAGATC ATTTAGrrPOG GTCTGAAAGG AAAAGrPCTTT GCTATCOGAC ATGTACTOCT 2100 

AGTAOCrGTA AGCATTTTAG GTCCCAGAAT GGAAAAAAAA ATCAGCTATT GGTAATATAA 2160 

TAATGTCCTT TCrCTGGAGT CAGTTTrTTT AAAAAGTTAA CTCTTAGTTT TTACrTGTTr 2220 

AATICTAAAA GAGAAGGGAG CTGAGGOCAT TCCCTGTAGG AGTAAAGATA AAAGQATAQG 2280 

AAAAGATTCA AAGCTCTAAT AGAGTCACAG CTTTCCCAGG TATAAAACCT AAAATTAAGA 2340 

AGTACAATAA GCAGMSGTCG AAAATCATCT AGrTTCCTGAT AGCTACCCAC AGAGCAAGTC 2400 

ATTTATAAAT TTGAAATCCA AACTACTTTC TTAATATCAC TrTGGTCTCC AnTTTCCCA 2460 

GGACAGGAAA TATGTCCXCC CCTAACTTTC TTGCTTCAAA AATTAAAATC CAGCATCCCA 2520 

AGATCATPCT ACAAGTAATT TTGCACAGAC ATCTCCTCAC CCCAGrPGCCT CTTCTOGAGCT 2580 

CACCCAAGGT CANCCAAACA ACTTOGTTGT GAACCCAACT GCCTTAACCT TCTOGGGGAG 2640 

GGGGATTAGC TAGACTAGGA GACCCAGAAG TGAATGGGAA AGGC3TCAGGA CITCACAATC 2700 

TTOGCCrcrrC AGAGCTTGAT TAGAAGOCAA GACACTGGCA GCAAAGGAAG ACTTGGCCCA 2760 

GGAAAAACCT GTQGGTTGTG CEAAITTCTG TOCAGAAAAT AGGGrTGGACA GAAGCITOIG 2820 

GGGTGCATGG AGGAAOTGGG ACCTGGTTAT GTTOrTATTC TCGGACTGTG AAnTTOCTG 2880 

ATGTAAAACA GAATATTCTG TAAACCTAAT GTCTGTATAA ATAATGAGCG ITAACACACT 2940 

AAAA3ATTCA ATAAGAAGTC AAAAAAAAAA AAAAAAAACT CGAGGGGGGG CXXX3GTACCC 3000 

AATTTOCCAA ATAGAGATOG TATTAC 3026 



50 (2) INFORMATION FOR SEQ ID NO: 311: 

(i) SBQUaJCE CHARACTEMSTICS: 

(A) LQ9GTH: 712 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 



60 GCAGGCTTTG TGCTCACCTA CAAGCTGGGT GAGCAGGGTG CCAGCAGCCT GmCCTCTT 



60 
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CTCCTGCTQG ACCAOSGCGT TTCTGCTCCC GAGTK3QGAC TCTQGAATGG TGrPOGGTCCT 120 

^ GTGC3TCTGCT CCATCGCTQG CTCCTCCCTG GGrPQGGACCT TC3CTGGOCAA GCACTOGAAA 180 

CTGCTOCCTC TGrPPGARGTC OGrTOCTGCGC TTCOGCCTCG GGQGCC CA GC CTOTCAGACT 240 

GCCTTGCTCT TOCACCTOGA CACCCTOGGG GCCflGCATQG ACGCTQQCAC AATCITCAGA 300 

10 GGGrPCAOCCT TGCTGAGCCT ATGTCTC3CAG CACTTCTTGG GAGGCCTCGT CftCCflCAGTC 360 

AOCTTCACTG GGATGATGCG CTQCAGCCAG CTGGCCCCCA GC3GCCTC3CAG GCCACACACT 420 

ACAGCCTPCT CGCCACGCTG GAGCTGCTGG GGAAGCTGCT GCTGQGCACT CTQOQGAGCSC 480 

CTGGCTGATG GGTTGGGGCC ACATCCCTGC TTCTTGCTCC aXXTrCAlCCT CTCTGCCTTT 540 

ccxxsrrcTGT ACCTGGACCT AGCACCCAGC ACCTTTCTCT GAGCTGAGTC GCTOGAGTOG 600 

20 TCAATAAAGC CACKTGTCCC TGrTGCXXXZAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 660 

AACTGGAGGG GGQGCCOQGT ACCCAAATCG CXXXSATATGA TCGTAAACAA TC 712 

25 

(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SE3QUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1289 base pairs 

(B) TYPE: n\acleic acid 

(C) STRANDEDMESS: doiible 

(D) TOPOLOGY: linear 

35 (xi) SECyiENCE DESCRIPTION; SEQ ID NO: 312: 

CAAAATTTCA GAACTTTCAG GAGGGCAAGA GAATATCAAA CAAAGATTTC TQGAAGTATT 60 

TTGCCAACCT TCTGGrTTGAG CTGCAAGAAA ATATTTATOG TGAGAACTTT TCTGTTTCCC 120 

40 

GTTATTGGGT TTrTQGTTGG TTTTTGnTTG TTTTTTACTA TGCTTTGCTC TGTAAAAATA 180 

TGCAACTGAA CTACATTCAG AAGGAAATAT TGTCTACATA GAATATTATA TGAAGTTGGTr 240 

45 ACATAATTCT GATGAGGAAA AAAAATCTTT GCAATTCTTT AAGCCATATT GTmiTlTlt, ' 300 

TGrrGTTGTTT TOCCTOGATG AAAATATCAG TATTAAGTAG ACAGCATATT ATTCAAGTGT 360 

TTAGACTTAT TAATATGTTC TTGTCCTGTA TTTATACATA TGTGTATTTT GGAAAGTATT 420 

50 

GCdTTTTTA AGGGAAGCTA TAATTCGATA CATAGTGAAA AAGGGAATGG TGACOCCTTT 480 

GTGCCTCTTC CACTGAGQAT AACAAACAGC ATTGTAATCC ATTCTCTTGC ACCTTCTTCT 540 

55 TCTTATCTTG TTATTACGGT TTTATTAATT TTGTAGAGGG ACAGGGAGTG GGCAAGGGGA 600 

AGAAGCAGCT TATTTGACTA AOCAGCCCCT CTGTGGTCCA CCAGOGICTT GGCTTQGrPQG 660 

GAGGGCTCTC AATCAGCAGG GCCCCAGGAG GGAAGAAGAA GTGGGGCAAA GCCTQGCCTC 720 

60 
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GCOGCTOSGG AGCTTTGCCA TCTGAGCCAC 
GAAATGTCAA CXX3GAGCCCT TACACCAGCX: 
5 TAAAOSAATA TrTAATCCAA CCTCACTACA 
ATQGGGGTGT TCCAGCCTTC AGOGAGATOG 
GQCTTATCCT TCTCTGTTQC CAACCTTGCC 

10 

CCCGTCOGTG TCTGTGTCTG TCCATACGIO 
CGTOQGOCCA GCTCGGAAGG TGCGTGGAGA 
15 GGGATQQCAT TCXGTTGTGT GCCTTATTCC 
AATAiaGCCT CTTCATGCTG TATTAAAAQ6 
CTTGAGGQGG GGNCCGGTAC CCAATTOTC 

20 



545 

GCCTCCTCCA GGCCATQCTC CTTGAACTITC 780 

CTCCAGCftTC TAAIMACTT GAATCTACTC 840 

TrcrTAGCTCA GTCCAACGAC TAACCCTGAA 900 

CCAAGCGGTC CCCTOGGGGC TGTGGCAGCG 960 

GTCOGACCTC CTCCGCCCCC ATGOGGTGAC 1020 

TGAGTCCAGC TAAAAAGACA AAACAGAACC 1080 

AGGCTCCGAC GTCTCOGAAG TGCAGCCCTT 1140 

IXSGAGAArCT GTATACGGCT CGCCTATAGA 1200 

ACTTTTAAAA GCAAAAAAAA AAAAAAAAAA 1260 

1289 



25 



(2) INFORMATION FOR SEQ ID NO: 313: 



(i) SBCJUENCE CHARACTERISTICS: 

(A) LSNGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOUX3Y: linear 

30 (xi) SEQUQJCE DESCRIPTION: SBQ ID NO: 313: 

Met Phe Leu lie Ptie Val Tyr Phe Leu Lys lie Leu Phe Ser Ser Ser 
^5 10 15 

35 Leu Pro Phe Leu Trp Leu 
20 



40 (2) INFORMATION FOR SEQ ID IK): 314: 

(i) SBCyjENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino cicid 
(D) TOPOLOGY: linear 

(xi) SBQUHK:E DESCRIPTION: SBQ ID NO: 314: 

Met Met Phe Leu Thr Gin Gly Gly Pro Leu Pro Ser Thr Arg Ala Arg 
1 5 10 15 

Pro Thr Cys Gin Ala Gly Ala Leu Pro Lys Pro Ser Gly Leu Leu Gly 
20 25 30 



45 



50 



Val Thr Cys Trp Asn Gly Leu Lys Gly Pro Leu Cys Gly Asn Arg Cys 
^5 35 40 45 

Ser Pro Asn Thr Leu Leu Leu Ala Ala Arg Gin Ala Leu Trp Lys Gly 
50 55 60 

60 Arg Gly Arg Thr His Gin Asp Leu Pro Gly Pro Leu Gin Gly Arg Gin 
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65 



70 



75 



80 



Leu Gly Pro Glu Pro Lys His Leu Ala Leu Leu Pro Pro Arg Gly Gin 
85 90 95 

Glu Ala Ser Trp Ala Ser Ser Leu Pro Gly Gin Gly Pro Leu Pro Leu 
100 105 110 



Pro His lie Asn Cys Thr Val Phe Ser Leu Lys Ala Ser Phe lie Lys 
10 115 120 125 



15 

(2) INFORMATICai FOR SBQ ID NO: 315: 

(i) SEQOQKZB CHARACTERISTICS: 
20 (A) liOXTTH: 28 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 315: 

25 Met Gin Phe Leu Leu Thr Ala Phe Leu Leu Val Pro Leu Xjeu Ala Leu 
15 10 15 

Cys Asp Val Pro lie Ser Leu Gly Phe Ser Pro Ser 
20 25 

30 



(2) INPORMATICN FOR SEQ ID NO: 316: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 316: 

40 

Met Asp Gly I^e Ser Ser Arg Leu Phe Ser Ser Leu Pro ttie Val Ala 
15 10 15 

Leu Gin Trp Phe lie Val lie Ser His Leu Leu Ser Leu Ser Leu Ser 
45 20 25 30 

Ala Cys Cys Tyr Gin Thr His Cys Ser Leu Xcia Gin Leu Ser Ser Ala 
35 40 45 

50 Phe Ser Xaa Met Gly Glu. Ser Cys Val Gly Glu Arg Glu Tyr Xaa Phe 
50 55 60 



55 



(2) INPORMATION FOR SEQ ID NO: 317: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENfflH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

^ (xi) SEC^mtCE EESCRIPTICN: SEQ ID NO: 317: 

Met Pro Leu He Asn LeuLeuLeuLeuTyrTyrValProAsn Gly Gly 
15 10 15 

Lys Gin Asp Lys Lys 
10 20 



(2) INFt)RMATICN FOR SEQ ID NO: 318: 

ti) SBQUQJCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPnON: SEQ ID NO: 318: 

Met Gly Arg His Leu Val Leu Val Met Phe He Thr Thr Ser Leu His 
15 10 15 

25 Ser Gly Thr Pro Val Pro Glu Asn Val He Cys Gly Val Thr Lys Gly 
20 25 30 



15 



20 



30 



Pro Gin Gly Lys Lys Lys Lys 
35 



(2) INFORMATION FOR SEQ ID NO: 319: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LQKjTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIGN: SEQ ID NO: 319: 



40 



45 



Met Leu Trp Trp Ser Arg Asp Tyr Thr Met Val Phe Leu Leu Phe Tte 
^ 5 10 15 

Met Val Phe Thr Gly Asp Leu Val He Arg Gly Arg Thr Glu Leu Ser 
20 25 30 

Leu 



50 

(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUEWCE CHARACTERISTICS: 

(A) LQIGTH: 88 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

60 Met Val Cys Ser Ser Leu Cys Asp He Gly Gly He He 'flu: Pro Phe 



wo 98/39448 

i 



548 



PCTAJS98/04493 



10 



15 



He Val Phe Arg Leu Arg Glu Val Trp Gin Ala Leu Pro Leu He Leu 
^ 20 25 30 

Phe Ala Val Leu Gly Leu Leu Ala Ala Gly Val Thr Leu Leu Leu Pro 
35 40 45 

Glu Thr Lys Gly Val Ala Leu Pro Glu Thr Met Lys Asp Ala Glu Asn 
10 50 55 60 



Leu Gly Arg Lys Ala Lys Pro Lys Glu Asn Thr He Tyr Leu Lys Val 
65 70 75 



80 



15 Gin Thr Ser Glu Pro Ser Gly Thr 
85 



20 (2) mPQRMATICN FOR SBQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 23 amino acids 

(B) TiTPE: amino acid 
25 (D) TOPOLOGy; linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 321: 

Met Gin Pro Gly Ala Gly Val Leu Val Leu Gly Leu Leu Leu Pro Pro 



Pro Gin Ser Pro Ser Leu Ser 
20 



35 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SBQ ID NO: 322: 

(i) SBC^JENCE CHARACTERISTICS: 

(A) LQIGIH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 322: 

Met Thr Phe Thr Leu Gly Asp Ser Gin Val Leu Leu He Asn Leu Phe 
^5 10 15 

Pro Ser Met Pro Ser Gly Ser Cys Ala Arg Pro 
20 25 



<2) INFORMATION FOR SEQ ID NO: 323: 

(i) SBQUQICE CHARACTERISTICS: 

(A) LQIGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 323: 

Met cys Leu Glu Cys Trp Ala Glu Asn Leu Gly Pro His His "nir Ser 
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549 



10 15 



5 



Ser Leu Leu Asn Pro Arg His Leu Pro Ser He Pro Ala Met Phe Pro 
20 25 30 

Val Ser Ser Gly Cys Phe Gin Glu Gin Gin Glu Met Asn Lys Ser Leu 
35 40 45 



Val Ser Cys Leu Kie Val Leu His Phe Val Lai His Cys He Phe Xaa 
10 50 55 60 



15 

(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) hENGFTH: 196 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE OESCRIPTION: SEQ ID NO: 324: 

25 Met Leu Ser Thr Ser Glu Tyr Ser Gin Ser Pro Lys Jfet Glu Ser Leu 
15 10 15 



20 



30 



45 



Ser Ser His Arg He Asp Glu Asp Gly Glu Asn Thr Gin He Glu Asp 
20 25 30 

Thr Glu Pro Met Ser Pro Val Leu Asn Ser Lys Phe Val Pro Ala Glu 
35 40 45 



Asn Asp Ser He Leu Met Asn Pro Ala Gin Asp Gly Glu Val Gin Leu 
35 50 55 60 

Ser Gin Asn Asp Asp Lys Thr Lys Gly Asp Asp Thr Asp Thr Arg Asp 
^5 70 75 80 

40 Asp He Ser He Leu Ala Thr Gly Cys Lys Gly Arg Glu Glu Thr Val 
85 90 95 



Ala Glu Glu Val Cys He Asp Leu Thr Cys Asp Ser Gly Ser Gin Ala 
100 105 110 

Val Pro Ser Pro Ala Thr Arg Ser Glu Ala Leu Ser Ser Val Leu Asp 
115 120 125 



Gin Glu Glu Ala Met Glu He Lys Glu His His Pro Glu Glu Gly Ser 
50 130 135 140 

Ser Gly Ser Glu Val Glu Glu He Pro Glu Thr Pro Cys Glu Ser Gin 
150 155 160 

55 Gly Glu Glu Leu Lys Glu Glu Asn Met Glu Ser Val Pro Leu His Leu 
165 170 175 



60 



Ser Leu Thr Glu Thr Gin Ser Gin Gly Leu Cys Leu Arg Arg His Pro 
180 185 190 



PCT/US98/04493 



550 



Lys Lys Lys Lys 
195 



5 

(2) INPORMATION PC3R SEQ ID NO: 325: 

(i) SBC20ENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 
10 (B) TTPEt amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIGN: SBQ ID NO: 325: 

Met Gly Gly Asp Leu Val Leu Gly Leu Gly Ala Leu Arg Arg Arg Lys 
15 1 5 10 15 

Arg Leu Leu Glu Gin Glu Lys Ser Leu Ala Gly Trp Ala Leu Val Leu 
20 25 30 

20 Ala Xaa Xaa Gly lie Gly Leu Met Val Leu His Ala Glu Met Leu Trp 
35 40 45 



Phe Gly Gly Cys Ser Ala Val Asn Ala Thr Gly His Leu Ser Asp Thr 
50 55 60 

Leu Trp Leu He Pro He Thr Phe Leu T^u: He Gly Tyr Gly Asp Val 
65 70 75 80 

Val Pro Gly Thr Miet Trp Gly Lys He Val Cys Leu Cys Thr Gly Val 
85 90 95 

Met Gly Val Cys Cys Thr Ala Leu Leu Val Ala Val Val Ala Arg Lys 
100 105 110 

35 Leu Glu Phe Asn Lys Ala Glu Lys His Val His Asn Phe Met Met Asp 
115 120 125 



25 



30 



He Gin Tyr Thr Lys Glu Met Lys Glu Ser Ala Ala Arg Val Leu Gin 
130 135 140 

Glu Ala Trp Met Phe Tyr Lys His Thr Arg Arg Lys Glu Ser His Ala 
145 150 155 160 

Ala Arg Xaa His Gin Arg Xaa Leu Leu Ala Ala He Asn Ala Phe Arg 
165 170 175 

Gin Val Arg Leu Lys His Arg Lys Leu Arg Glu Gin Val Asn Ser Met 
180 185 190 

50 Val Asp He Ser Lys Met His Met He Leu Tyr Asp Leu Gin Gin Asn 
195 200 205 



40 



45 



55 



60 



Leu Ser Ser Ser His Arg Ala Leu Glu Lys Gin He Asp Thr Leu Ala 
210 215 220 

Gly Lys Leu Asp Ala Leu Thr Glu Leu Leu Ser Thr Ala Leu Gly Pro 
225 230 235 240 

Arg Gin Leu Pro Glu Pro Ser Gin Gin Ser Lys Xaa 
245 250 
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^ (2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) liSKTIH: 68 amino acids 

(B) TYPE: amino acid 
(D) T0POUX3Y: linear 

10 (xi) SECXffiNCB DESCRIPTION: SEQ ID NO: 326: 

Met Trp Arg Cys Arg Gly Lys Leu Ser Phe Pro Leu Phe Ala Val Val 
15 10 15 

15 He Val Ser Cys Arg Lys Asp Gly Pro Asp Ala Ala Ala Ala Pro Ala 
20 25 30 



20 



30 



35 



45 



55 



60 



Val He Lys Asn Asn Ser His Tyr Gin Thr Ser Lys Ala Leu Glu Leu 
35 40 45 

Glu Lys Thr Tlir Glu Asn Lys Glu Ser Asn Pro Phe He Leu Gin Val 
50 55 60 



Asn Lys Leu Xaa 
25 65 



(2) INFORMATION FOR SEQ ID NO: 327: 

(X) SBQUajCE CHARACTERISTICS: 

(A) LQ*GTH: 84 amino acids 

(B) TypE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 327: 

Met Gly Glu Gly Lys Asn Gly Phe Gly Gly Phe Val His Thr Ala Asp 
1 5 10 15 

40 Ala Cys Trp Glu Gly Val His Ser Glu Pro Val Cys Arg Thr Val His 
20 25 30 



Thr Val His Thr Cys His His Gin Ala Phe Leu Val Leu He Gly Trp 
35 40 45 

Ser Lys Ser Gly Lys Glu Arg Lys Glu Ala Phe Leu Thr Ala He He 
50 55 60 



Leu Asn Ser Arg Ser He His He Ser Cys Ser Trp Pro Pro Ser Pro 
^0 65 70 75 80 



Val Pro Gin Xaa 



(2) INFORMATICN FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQ3GTH: 36 amino acids 



W09S^SM48 
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(B) TYPE: amino acid 
(D) TOPOUOGY: linear 
(xi) SB&JEXtCB DESCRIPTION: SEQ ID NO: 328: 

Met Leu Leu He Asn Leu Leu Trp Leu Val Thr Met He Lys Ser Val 
^5 10 15 

He Asn Asn Asn He He Leu Phe Leu Lys Lys Lys Ser Leu Wie Phe 
20 25 30 

He Asp Ser Val 
35 



15 

(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 
20 (B) TlfPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUQJCE DESCRIPTION: SEQ ID NO: 329: 

Met Thr Phe Pro Phe Glu Lys Lys He Val Ala Phe Ser Ala Phe Tyr 
1 5 10 15 

Leu He Pro Gly Glu Ser Arg Leu Ala Pro Thr Phe Asn Pro Ser Ala 
20 25 30 

30 Asp Met Thr Val He Leu Arg Gly Arg Ala Gin His Lys Thr Ala Met 
35 40 45 



35 



40 



45 



50 



55 



Leu Glu Ser Tyr Asn Trp Lys Val Ser Cys Gin Leu Arg Glu Xaa 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 330: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LQ9GTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUQJCE DESCRIPTION: SEQ ID NO: 330: 

Met His Ser Lys Gly Ser Ser Leu Leu Leu Phe Leu Pro Gin Leu He 
1 5 10 15 

Leu He Leu Pro Val Cys Ala His Leu His Glu Glu Leu Asn Cys Cys 
20 25 30 

Kie His Arg 
35 



(2) INFORMATION FOR SEQ ID NO: 331: 



60 



(i) SBQUEI^JCE CHARACTERISTICS: 

(A) LQJC?rH: 23 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 331: 

5 Met Gly Ala Leu Val Leu Leu Leu Cys Leu Leu Val Gly Val Gin Gin 
1 5 . 10 15 

Ser Gly Ser Val Trp Asp Ser 
20 

10 



<2) INPORMATION FCR SBQ ID NO: 332: 

15 (i) SBQUSNCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 



20 



Met Gin Ser Ala Glu He Leu Ser Trp Thr Asp Val Leu His Asp Phe 
1 5 10 15 



Leu Phe Ser Leu Phe Leu Trp Pro Ala Phe Glu Asp Arg Ala Leu Leu 
25 20 25 30 



He Phe Thr Leu Asn Gin lie Val 
35 40 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQKTTH: 111 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBCyjENCE DESCRIPTION: SBQ ID NO: 333: 

40 Met Gin Ser Leu Val Gin Trp Gly Leu Asp Ser Tyr Asp Tyr Leu Gin 
15 10 15 

Asn Ala Pro Pro Gly Phe Phe Pro Arg Leu Gly Val He Gly Phe Ala 
20 25 30 

45 

Gly Leu He Gly Leu Leu Leu Ala Arg Gly Ser Lys He Lys Lys Leu 
35 40 45 

Val Tyr Pro Pro Gly Phe Met Gly Leu Ala Ala Ser Leu Tyr Tyr Pro 
50 50 55 60 

Gin Gin Ala He Val Phe Ala Gin Val Ser Gly Glu Arg Leu Tyr Asp 
65 70 75 80 

55 Trp Gly Leu Arg Gly Tyr He Val He Glu Asp Leu Trp Lys Glu Asn 
85 90 95 

Phe Gin Lys Pro Gly Asn Val Lys Asn Ser Pro Gly Thr Lys Xaa 
100 105 110 

60 
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(2) INFXJRMATIOM FtJR SEQ ID NO: 334: 

5 (i) SBQUaiCE C3ftRACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) nrpE: amino acid 
(D) TOPOLOGY: linear 

<xi) SBQOEWCE DESCRIPTION: SEQ ID NO: 334: 

Met Ala Pro Ser Leu Leu Lai Leu Ala Pro Leu cys Ser Leu Glu Ala 
1 5 10 15 



Val Leu Ser Ser Pro Leu Glu Lys Gin Cys Gin Leu Pro Gly He Phe 

30 



15 20 25 



Cys Gin Leu Gin Leu Pro Cys Pro Leu Leu Leu Ser Ala Gin Leu Leu 
35 40 45 

20 Lys Gly He Val Xaa Pro Arg Cys Pro Ala Ser Leu Pro Gin Pro Pro 
50 55 60 



His Pro Ala Pro Ser Trp His Leu Pro Leu His Cys Thr Glu Arg Xaa 

25 

Pro His His Leu Pro Leu Gin Gly Gly Ser Ser Asn Met Glu Glu Xaa 



85 90 95 



Asn Tyr Arg Gly Tyr Xaa Asp Ala Gin Leu 
30 100 105 



(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SBQOENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

Met Thr Thr Cys Leu Phe Gly Leu Leu Ser Cys Glu Met Ser Ala Gin 
15 10 15 

45 Val Ser Gin Lys Ser Cys Val Tyr Asp Glu Ser Glu Cys Phe Ser Ser 
20 25 30 

Val Gly Gin Leu Leu Ala Leu Leu He Leu Val Tyr Val Leu Pro Ser 

50 

He Xaa 
50 



55 

(2) INTORMATICN FOR SBQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LHXTTH: 48 amino acids 
<B) TYPE: amino acid 
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(D) TOPQLOGY: linear 
(xi) SEJQUaiCE DESCRIPTION: SBQ ID NO: 336: 

Met Leu Trp Lys Cys Ser Gin Asn He Ala Arg Cys Leu Leu Leu Leu 
^1 5 10 15 

Leu Ala Leu Val Glu He Lys Leu Glu Asp Leu Gin Ser Gin Leu His 
20 25 30 

10 Pro Thr Trp Lys Ser He Pro Gly Pro Ser Pro Arg Asn Gin His Arg 
35 40 45 



15 



(2) INFORMATION FOR SBQ ID NO: 337: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LBIGTH: 41 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 337: 



25 



30 



35 



40 



50 



55 



60 



Met I^u He Pro Leu Gin Cys Leu Phe Ser Ser Asp Arg Met Leu Thr 
^5 10 15 

Phe Leu Thr Pro Trp Gin Lys Gly Glu Lys Cys Val Leu Gly Trp Val 
20 25 30 

Thr Lys Phe Leu Ser Glu He Ser Xaa 
35 40 



(2) INFORMATION FOR SEQ ID IK): 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 76 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 338: 

45 Met Thr Phe Ser Ser Leu Lys Leu Phe Val Leu Thr Cys He He Lys 
1 5 10 15 



Gly Leu Glu Arg Phe He He Leu Arg Glu Val Cys Asn Gin Glu He 
20 25 30 

Gin Arg Ser Leu Ser Ser Asn Leu Val His Val Leu Leu Gin Pro Ala 
35 40 45 

Thr Phe Lys Asp Val Leu Val Thr Glu He He Cys Leu Cys Met Cys 
50 55 60 

Leu Tyr Ser He Lys Tyr Met Pro Pro Gin Lys Lys 
65 70 75 
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10 



15 



(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTmiSTICS : 

(A) LQISTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPnC3N : SEQ ID NO: 339: 

Lys val IVr He Phe He Phe Met Val Leu He Leu Pro Ser Leu 

Gly Leu Thr Arg lyr Met Pro Pro Xaa Ser Xaa Leu Asn Ser Glu 
20 25 . 30 



(2) INFORMATION FOR SBQ ID NO: 340: 



20 



(i) SBQUaJCE CHARACTERISTICS: 

(A) LQflGTH: 42 amino acids 

(B) TXTE: amino acid 
(D) TOPOLOGY: linear 

22 SEQOQJCE DESCRIPTION: SEQ ID NO: 340: 

Met Ala Lys He Ser Pro Phe Glu Val Val Lys Arg Thr Ser Val Pro 
^ 10 15 



30 



35 



40 



45 



50 



val Leu Val Gly Leu Val He Val He Val Ala Thr Glu Leu Met Val 



30 



20 25 

Pro Gly Thr Ala Ala Ala Val Tbr Gly Lys 
35 40 



(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SBQOaiCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TypE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE nE9CRIPTIC»r: SBQ ID NO; 341: 

Met Arg Leu Phe Phe He Gly Phe L«i Leu Leu Phe Ser Phe Gly Leu 
^ 5 10 15 

Leu Arg Gin Pro Ser Leu Ser Ala Glu His 
20 25 



(2) INFORMATION FOR SBQ ID NO: 342: 

(i) SEQUENCE CHARACTEEUSTICS : 

CA) LEWGTH: 26 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 342: 
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Met val Phe Ser val ser Ser Ala Leu Ala Leu Leu Leu Met Leu Leu 
■^5 10 



15 



Ser ser Asp Leu Ala Lys Lys ■But Glu 
5 20 . 25 



(2) INFORMATION FOR SBQ ID NO: 343: 

(i) SEQUENCE CHftRACTERISPICS: 

(A) lanSTH: 157 amino acids 

(B) TlfPE: amino acid 
(D) TOPOLOGY: linear 

^■^ SEQUENCE DESCRIPTICN: SBQ ID NO: 343: 

Met ser Leu Glu Phe IVr Gin Lys Lys Lys Ser Arg Trp Pro Phe Ser 
5 10 15 



20 Asp Glu Cys lie Pro Ttp Glu Val Trp Thr Val Lys Val His Val Val 
20 25 30 

Ala Ala -mr Glu Gin Glu Arg Gin He Cys Arg Glu Lys Val Gly 
25 « 

Glu L^ Leu cys Glu Lys He He Asn He Val Glu Val Met Asn Arg 
^° 55 60 

30 ^° Lys Met Pro Thr Gin Ser Glu Val Asp Asn Val 

^ «5 ,0 75 80 

Phe Asp Thr Gly Leu Arg Asp Val Gin Pro Tyr Leu Tyr Lys He Ser 
85 90 gg 

35 Phe Gin He Thr Asp Ala Leu Gly Thr Ser Val Thr Thr Thr Met Arg 
1°° 105 110 



40 



Arg Leu lie Lys Asp Thr Leu Pro Ser Glu Arg Arg Trp He Ser Gly 
115 120 125 

Ser Ser Leu Met Ala Pro Arg Pro Ttp Leu Leu Gly He Ala Leu Leu 
130 135 140 



Gly Leu Trp Ala Leu Glu Pro Ala Leu Gly His Trp Xaa 
1« 150 155 



50 



(2) INPORMMION FOR SEQ ID NO: 344: 



(i) SEQUENCE CHftRACTERISTICS : 

(A) LQIGTH: 520 amino acids 

(B) TyPE: amino acid 
(D) TOPOLOGy: linear 

SEQUENCE DESCRIPTICW: SEQ ID NO: 344: 



Met Phe Leu Leu Pro Leu Pro Ala Ala Gly Arg Val Val Val Arg Arg 
5 10 15 



60 



l*u Ala val Arg Arg Phe Gly Ser Arg Ser Leu Ser Thr Ala 



Asp Met 
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20 



25 



30 



Thr Lys Gly Leu Val Leu Gly He IVr Ser Lys Glu Lys Glu Asp Asp 
35 40 



45 



10 



15 



20 



25 



30 



35 



Val Pro Gin Phe -nir ser Ala Gly Glu Asn Phe Asp Lys Leu Leu Ala 
5^ 55 60 

Gly Lys Leu Arg Glu Thr Leu Asn He Ser Gly Pro Pro Leu Lys Ala 
^5 ^0 75 80 

Gly Lys Thr Arg Thr E^ie Tyr Gly Leu His Gin Asp Phe Pro Ser Val 
85 90 95 

Val Leu Val Gly Leu Gly Lys Lys Ala Ala Gly He Asp Glu Gin Glu 

105 110 

Asn Trp His Glu Gly Lys Glu Asn He Arg Ala Ala Val Ala Ala Gly 
115 120 125 

Cys Arg Gin He Gin Asp Leu Glu Leu Ser Ser Val Glu Val Asp Pro 
130 135 140 

Cys Gly Asp Ala Gin Ala Ala Ala Glu Gly Ala Val Leu Gly Leu Tyr 
150 155 160 

Glu Tyr Asp Asp Leu Lys Gin Lys Lys Lys Met Ala Val Ser Ala Lys 
165 170 175 

Leu Tyr Gly Ser Gly Asp Gin Glu Ala Trp Gin Lys Gly Val Leu Phe 
180 185 190 

Ala Ser Gly Gin Asn Leu Ala Arg Gin Leu Met Glu Thr Pro Ala Asn 
195 200 205 

Glu Ifet -nu: Pro Thr Arg Phe Ala Glu He He Glu Lys Asn Leu Lys 
210 215 220 



40 



45 



50 



Ser Ala ser Ser Lys Thr Glu Val His He Arg Pro Lys Ser Trp He 
230 235 240 

Glu Glu Gin Ala Met Gly Ser Phe Leu Ser Val Ala Lys Gly Ser Asp 
245 250 255 

Glu Pro Pro Val rtie Leu Glu He His Tyr Lys Gly Ser Pro Asn Ala 
260 265 270 

Asn Glu Pro Pro Leu Val Phe Val Gly Lys Gly He Thr Phe Asp Ser 
275 280 285 

Gly Gly He Ser He Lys Ala Ser Ala Asn Met Asp Leu Met Arg Ala 
290 295 300 



55 



60 



Asp Met Gly Gly Ala Ala Thr He Cys Ser Ala He Val Ser Ala Ala 
310 315 320 

Lys Leu Asn Leu Pro He Asn He He Gly Leu Ala Pro Leu Cys Glu 
325 330 335 

Asn Met Pro Ser Gly Lys Ala Asn Lys Pro Gly Asp Val Val Arg Ala 
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340 345 350 

Lys Asn Gly Lys Thr He Gin Val Asp Asn -nir Asp Ala Glu Gly Arg 
^ 355 360 365 

Leu He Leu Ala Asp Ala Leu Cys Tyr Ala His Thr Phe Asn Pro Lys 
370 375 380 

Xaa He Leu Asn Ala Ala -nir Leu Thr Gly Ala Met Asp Val Ala Leu 
10 385 390 395 400 

Gly Ser Gly Ala Thr Gly Val Phe Thr Asn Ser Ser Trp Leu Trp Asn 
405 410 415 

15 Lys Leu Phe Glu Ala Ser He Glu Thr Gly Asp Arg Val Trp Arg Met 
420 425 430 

Pro Leu Phe Glu His Tyr Thr Arg Gin Val Val Asp Cys Gin Leu Ala 
2Q 445 

Asp Val Asn Asn He Gly Lys Tyr Arg Ser Ala Gly Ala Cys Thr Ala 
450 455 460 

Ala Ala Phe Leu Lys Glu Phe Val Thr His Pro Lys Trp Ala His Leu 
25 465 470 475 480 

Asp He Ala Gly Val Met Thr Asn Lys Asp Glu Val Pro Tyr Leu Arg 
485 490 495 

30 Lys Gly Met Thr Gly Arg Pro Thr Arg Thr Leu He Glu Phe Leu Leu 
500 505 510 

Arg Phe Ser Gin Asp Asn Ala Xaa 
515 520 

35 



(2) INFORMATION PC3R SBQ ID NO: 345; 

40 (i). SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 39 amino acids 

(B) T5fPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 345: 



45 



Thr He Leu Phe Leu Phe Leu Gin Leu Ser Ala Leu Arg Leu He Val 
15 10 15 



Gly Lys Asp Ser He Asp He Asp He Ser Ser Arg Arg Arg Glu Asp 
50 20 25 30 

Gin Ser Leu Arg Leu Asn Ala 
35 

55 

(2) INPORMATION FOR SEQ ID NO: 346: 



60 



(i) SBQUm::E CHARACTERISTICS: 

(A) LE^iGTH: 234 amino acids 
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(B) TYPE: amino acid 
(D> TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 346: 

5 Met Thr Ser Glu Leu Asp He Phe Val Gly Asn Thr Thr Leu He Asp 
^5 10 15 



10 



25 



40 



50 



55 



60 



Glu Asp Val Tyr Arg Leu Trp Leu Asp Gly Tyr Ser Val Thr Asp Ala 
20 25 30 

Val Ala Leu Arg Val Arg Ser Gly He Leu Glu Gin Thr Gly Ala Thx 
35 40 45 



Ala Ala Val Leu Gin Ser Asp Thr Met Asp His Tyr Arg Thr Phe His 
15 50 55 go 

Met Leu Glu Arg Leu Leu His Ala Pro Pro Lys Leu Leu His Gin Leu 
^5 70 75 80 

20 He Phe Gin He Pro Pro Ser Arg Gin Ala Leu Leu He Glu Arg Tyr 
85 90 95 



Tyr Ala Phe Asp Glu Ala Phe Val Arg Glu Val Leu Gly Lys Lys Leu 
100 105 110 

Ser Lys Gly 'ttir Lys Lys Asp Leu Asp Asp He Ser Thr Lys Thr Gly 
115 120 125 



He Thr Leu Lys Ser Cys Arg Arg Gin Phe Asp Asn Phe Lys Arg Val 
30 130 135 140 

Phe Lys Val Val Glu Glu Met Arg Gly Ser Leu Val Asp Asn He Gin 
150 155 160 

35 Gin His Phe Leu Leu Ser Asp Arg Leu Ala Arg Asp Tyr Ala Ala He 
165 170 175 



Val Phe Phe Ala Asn Asn Arg Phe Glu Thr Gly Lys Lys Lys Leu Gin 
180 185 190 

Tyr Leu Ser Phe Gly Asp Phe Ala Phe Cys Ala Glu Leu Met He Gin 
195 200 205 



Asn Trp Thr Leu Gly Pro Val Asp Ser Gin Met Asp Asp Met Asp Met 
45 210 215 220 



Asp Leu Asp Arg Asn Phe Ser Arg Thr Xaa 
225 230 



(2) INFORMATION FOR SBQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 169 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUEKCE DESCRIPTICN: SBQ ID NO: 347: 

Met Ala Ala Ala Val Ala Gly Met Leu Arg Gly Gly Leu Leu Pro Gin 
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AS 10 15 

Ala Gly Arg Pro Thr Leu Gin Thr Val Arg Tyr Gly Ser Lys Ala 
^ 20 25 30 

Val Thr Arg His Arg Arg Val Met His Phe Gin Arg Gin Lys Leu Het 
35 40 45 

Ala Val Thr Glu Tyr He Pro Pro Lys Pro Ala He His Pro Ser Cys 
1^ .50 55 60 

Leu Pro Ser Pro Pro Ser Pro Pro Gin Glu Glu He Gly Leu He Arg 
65 70 75 80 

15 L^ Leu Arg Arg Glu He Ala Ala Val Phe Gin Asp Asn Arg Met He 
85 90 95 

Ala Val Cys Gin Asn Val Ala Leu Ser Ala Glu Asp Lys Leu Leu He 
2Q 105 110 

Ala Thr Pro Ala Ala Glu Thr Gin Asp Pro Asp Glu Gly Leu Pro Gin 
115 120 125 

Pro Gly Pro Glu Ser Pro Ser Trp Arg He Pro Ser Thr Lys He Cys 
130 135 

Cys Pro Phe Leu Trp Gly Thr Thr Cys Cys Trp Ser Val Lys Ser Pro 
145 .150 155 

30 Arg Ser Arg Arg Trp Tyr Gly Ser Xaa 
165 



35 (2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) La«GlH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGy: liiiear 

(xi) SEQUENCE DESPCRIPTION : SEQ ID NO: 348: 

Met Lys Arg Ser Phe Leu Leu Pro Leu Leu Leu Val Gly Phe Leu Asp 
15 10 15 

•nir Ala His Leu He Leu Leu Glu liu: Leu Ser Val Cys Leu Trp Leu 
20 25 30 

Pro Ser Leu He Asp Ser Arg Cys Val Met Ser 
35 40 



160 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACIERISTICS : 

(A) LEMGIH: 78 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 349: 
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Met Lys Glu Gly Pro Pro Cys Lys Arg His His Tyr Tyr Gin Asn Cys 
1 5 10 15 

5 Gly Ala Lys Leu Leu Val Ser Leu Phe Gly Glu Thr Asn Gin lie His 
20 25 30 

Leu Leu Glu Thr Gin Val Gly Thr Glu Lys Gly Gly Glu Arg He Trp 
35 40 45 

10 

Glu Glu Lys Trp Arg He Ser Ser Thr Val Leu Phe He Ser Val Asn 
50 55 60 

Ser Tyr Val Glu Gly Ser Val Leu Glu lie Lys Leu Phe Tyr 
15 65 70 75 



20 



(2) INFDRMATICN FOR SEQ ID NO: 350: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LraSGTH: 24 andno acids 

(B) TYPE: amino acid 
<D) TOPOLOGy: linear 

25 (xi) SBC3UENCE DESCRIPTICN: SEQ ID NO: 350: 

Met Ser Glu He Leu Ser Leu Leu Phe Cys Leu Leu Gly Pro Ala Leu 
1 5 10 . 15 

30 Asp Glu Arg Arg Glu Glu Lys Asp 
20 



35 (2) INFQRMATICN FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 274 amino acids 

(B) TYPE: amino acid 
40 <D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

Met Ser Ser Ala Gly Thr Ala Thr Pro Leu Glu Met Asp His Lys Leu 

45 " 

Thr Ser Gin Pro Gly Arg Pro Ser Phe Tyr Cys Asn Ser Arg His Ser 
20 25 30 

He Val Gly Ser Ser His Gin Leu Gly Phe Trp Phe Ser His Leu Glu 
50 35 40 45 

Ser Ser Gly I^eu Lys Val Phe Gin Val Ser Leu Pro Cys Glu Cys Val 
50 55 60 

55 Asn Leu Pro Thr Arg He Ala Ser Val Val Leu Ser Leu Met Ser Leu 
65 70 75 80 

Leu Val Val Gly Gin Ala Pro Ala Trp Glu Gly Ser Leu Leu Arg Gly 
^ . 85 
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Arg Pro Ala Gly Gly Ala His Leu Cys Ala Met Xaa Val lie Glu Gly 
100 105 110 

Leu Val Val Asp Val Gly Glu Arg lie Leu His Gly Gin Arg Glu Val 
5 115 120 125 

Gly Gin Val Ser Gin Val Leu Pro Ala Leu Ser Leu Gly Leu Val Phe 
130 135 140 

10 Leu cys Gin Gly Thr Val Glu Lys Val Ser Gly Ala Ala His Cys Ser 
145 150 155 160 



15 



Ser l^u Leu Cys Cys Leu Pro Trp Gin Cys Ser Gly Gly Gly Phe Pro 
165 170 ^ 175 

Thr Xaa Arg Cys Ser Arg Pro Tyr Phe Ser Ser His Lys Gly Val Ala 
180 185 190 



Ala Thr Leu Ala Leu Thr Cys His Cys Asp Lys Val His Val Ala Gly 
20 195 200 205 

Leu Gly Lys Asp Trp Ala lie Glu Gin Arg Arg Arg Thr Cys Glu Ser 
210 215 220 

25 Asp Xaa Glu Xaa Xaa Pro Phe Thr Leu Ala Gly Leu Val Leu Val Leu 
225 230 235 240 



30 



Arg Phe Cys Gin Val Val Leu Val Trp He Pro Gin Leu Gly Asp Lys 
245 250 255 

His Trp Arg Gly Met Thr Arg Leu Gly Arg Val Ser Leu Thr Ser Ser 
260 265 270 



35 



He Xaa 



40 



(2) INFORMATION FOR SEQ ID NO: 352: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 352: 



50 



55 



Met lie Phe Thr Ser Val Thr Lys Gly He Leu Leu He Ala L^ Trp 
15 10 15 

Val Pro Leu Phe His Phe Met Leu He Asp Ser He Leu Gly Pro Ser 
20 25 30 

Arg Leu Leu Thr Asp Gly Val Pro Phe Asn Pro Trp His Val Xaa 
35 40 45 



(2) INFORMATION FOR SBQ ID NO: 353: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LQKSra: 3 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(xi) SBOUENCE DESCRIPTICN: SBQ ID NO: 353: 

Met Lys Ttir 
1 



10 

(2) INFORMATICN FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGIH: 52 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SBQUHOCE DESCRIPTION: SBQ ID NO: 354: 

Met Ser lie Ser Gly Thr Asp Gly Leu He Leu Leu Leu Val Gly Leu 
20 1 5 10 15 

Glu Ala Xaa Val Arg Ser Ser Lys Lys Trp He Pro Lys Ala Leu Xaa 
20 25 30 

25 Val Thr Gin Ala Lys Trp Asn Ser Trp Pro Ser Arg Arg Asn Ala Gly 
35 40 45 

Phe Ala Leu His 
50 

30 



(2) INFORMATION FOR SBQ ID NO: 355: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LEKQIH: 132 amino acids 

(B) TYPE: amijio acid 
(D) TOPOLOGY: linear 

(xi) SBC^JENCE DESCRIPTION: SBQ ID NO: 355: 

40 

Met Glu His Cys Leu Tyr His Ser Val His Gly He Asn Pro Tyr He 
^5 10 15 

His Lys Asn Thr His Pro Ser He Asn He Tyr Met Val Trp Asp Glu 
45 20 25 30 

Gin Val Asn Ser Phe Glu Arg Glu Phe Val Pro Ptie Phe Phe Leu He 
35 40 45 

50 He Leu Leu Asn Cys Cys Gin Leu Ser Asn Lys Gin Thr Glu Lys Leu 
50 55 60 

Phe Gly Lys Thr Leu His Tfbr Pro Phe Leu Ser Ser Ala Leu Lys Tyr 

55 . 

Arg Leu Asn Thr His He Leu Pro Val Phe Ser Tyr Ser Asp Ser He 
85 90 95 

Leu Thr Cys His Leu He Leu Ala Ser Tyr Phe Ser His Val Tyr Leu 
00 100 105 110 
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Pro Val Thr Cys He Cys Tyr Leu Asn Arg Lys Lys Asn He Gin Lys 
115 120 125 

5 Lys Lys Asn Xaa 
130 



10 <2) INFORMATION FOR SEQ ID ND: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LSJGTH: 204 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SBCJfUENCE DESCRIPTION: SBQ ID NO: 356: 

Met Gly Ser Arg Asp His Leu Phe Lys Val Leu Val Val Gly Asp Ala 

20 

Ala Val Gly Lys Thr Ser Leu Val Gin Asp Tyr Ser Gin Asp Ser Pte 
20 25 30 

Ser Lys His Tyr Lys Ser Thr Val Gly Val Asp Phe Ala Leu Lys Val 
25 35 40 45 

Leu Gin Trp Ser Asp Tyr Glu He Val Arg Leu Gin Leu Trp Asp He 
50 55 60 

30 Ala Gly Gin Glu Arg Phe Thr Ser Met Thr Arg Leu Tyr Tyr Arg Asp 
65 70 75 80 

Ala Ser Ala Cys Val He Met Phe Asp Val Thr Asn Ala Thr Thr Phe 
85 90 95 

35 

Ser Asn Ser Gin Arg Trp Lys Gin Asp Leu Asp Ser Lys Leu Thr Leu 
100 105 110 

Pro Asn Gly Glu Pro Val Pro Cys Leu Leu Leu Ala Asn Lys Cys Asp 
40 115 120 125 

Leu Ser Pro Trp Ala Val Ser Arg Asp Gin He Asp Arg Kie Ser Lys 
130 135 140 

45 Glu Asn Gly i^e Thr Gly Trp Thr Glu Thr Ser Val Lys Glu Asn Lys 
145 150 155 160 

Asn He Asn Glu Ala Met Arg Val Leu He Glu Lys Met Met Arg Asn 
165 170 175 

50 

Ser Thr Glu Asp He Met Ser Leu Ser Thr Gin Gly Asp Tyr He Asn 
180 185 190 

Leu Gin Thr Lys Ser Ser Ser Trp Ser Cys Cys Xaa 
55 195 200 



60 



(2) INFORMATION FOR SEQ ID NO: 357: 
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U) SEJCyjENCE CHARACTERISTICS: 

(A) LEWSIH: 47 aznino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY-: linear 

5 <xi) SEQUEJKrE DESCRIPTION: SEQ ID NO: 357: 

Met lie Ser Leu lie Phe Gin Leu Glu Glu Glu Lys Leu Val Glu Lys 
1 5 10 15 

10 Phe Phe Phe Phe Leu Phe Phe Phe Leu Lys Lys Gly Ser Gin Gly Ser 
20 25 30 



15 



Asn Leu Lys lie Val Pro Arg His Met Arg Val Val Leu Arg Gly 
35 40 45 



<2) INFORMATION FOR SEQ ID NO: 358: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SBQUQICE HESCRIPTICN: SEQ ID NO: 358: 



25 



40 



Met Thr Tyr Val Thr Cys Leu His Val Cys Leu Leu Val Glu Phe Leu 
15 10 15 



Asn Ser Gin Leu Thr Asn His Arg Lys Tyr Tyr Phe Leu Ser Tyr Gly 
30 20 25 30 

Phe Trp Phe Thr Gly Leu Arg Gly Phe Ser Glu Tyr Leu Trp Pro Gin 
35 40 45 

35 Gin His Thr Ser Phe His Pro Asn Arg Asn Glu He Asn Phe Val Ser 
50 55 60 



Thr Asp Asn Arg He Trp Val Thr Xaa 
65 70 



(2) INFORMATION FOR SEQ ID NO: 359: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino cicid 
(D) TOPOLOGY: linear 

(xi) SEQUm^E DESCRIPTION: SEQ ID NO: 359- 

50 

Met Ser Asp Gin Glu Ala Lys Pro Ser Thr Glu Asp Leu Gly Asp Lys 
15 10 15 

Lys Glu Gly Glu Tyr lie Lys Leu Lys Val lie Gly Gin Asp Ser Ser 
55 20 25 30 

Glu He His Phe Lys Val Lys Met Thr Thr His Leu Lys Lys Leu Lys 
35 40 45 

60 Glu Ser Tyr Cys Gin Arg Gin Gly Val Pro Met Asn Ser Leu Arg Phe 
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50 55 60 

Leu Phe Glu Gly Gin Arg He Ala Asp Asn His llir Pro Lys Glu Leu 
^ ^5 70 75 80 

Gly Met Glu Glu Glu Asp Val He Glu Val Tyr Gin Glu Gin Tfcr Gly 
85 90 95 

Gly His Ser Thr Val Xaa 
10 100 



(2) . INFORMATION FOR SBQ ID NO: 360: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LBNGTO: 48 amino acids 

(B) TypE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 360: 

Met Gly Phe Pro Gin Trp His Leu Gly Asn His Ala Val Glu Pro Val 
15 10 15 

25 Thr Ser He Leu Leu Leu Phe Leu Leu Met Met Leu Gly Val Arg Gly 
20 25 30 



15 



20 



30 



40 



55 



60 



Leu Leu Leu Val Gly Leu Val Tyr Leu Val Ser His Leu Ser Gin Arg 
35 40 45 



35 

(2) INPORMATICN FOR SBQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQ^GTH: 179 amino acids 

(B) TYTE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 3(51: 

Met Ser Ala Glu Val Lys Val Thr Gly Gin Asn Gin Glu Gin Phe Leu 
45 1 5 10 15 

Leu Leu Ala Lys Ser Ala Lys Gly Ala Ala Leu Ala Thr Leu He His 
20 25 30 

50 Gin Val Leu Glu Ala Pro Gly Val Tyr Val Phe Gly Glu Leu Leu Asp 
35 40 45 



Met Pro Asn Val Arg Glu Leu Ala Glu Ser Asp Phe Ala Ser Thr Phe 
50 55 60 

Arg Leu Leu Thr Val Phe Ala Tyr Gly Thr Tyr Ala Asp Tyr Leu Ala 
65 70 75 80 

Glu Ala Arg Asn Leu Pro Pro Leu Thr Glu Ala Gin Lys Asn Lys Leu 
85 90 95 
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Arg His Leu Ser Val Val rhr Leu Ala Ala Lys Val Lys Cys He Pro 
100 105 110 

5 Tyr Ala Val Leu Leu Glu Ala Leu Ala Leu Arg Asn Val Arg Gin Leu 
115 120 125 



10 



Glu Asp Leu Val He Glu Ala Val Tyr Ala Asp Val Leu Arg Gly Ser 
130 135 140 

Leu Asp Gin Arg Asn Gin Arg Leu Glu Val Asp Tyr Ser He Gly Arg 
145 150 155 160 



Asp He Gin Arg Gin Asp Leu Ser Ala He Ala Arg Thr Leu Xaa Lys 
15 165 170 



175 



Asn His Xaa 



20 



25 



(2) INFORMATION FOR SBQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) Lm?rH: 25 amino acids 
<B) TTPE: amino acid 
(b) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 

30 Met Lys Ser Ser Ser Leu Phe E^e Phe Phe Leu Ala His Phe He His 
1 5 10 15 



35 



Ser His Asp Leu Pro Gly Leu Cys Arg 
20 25 



(2) DJFORMATICN FOR SEQ ID NO: 363: 

40 (i) SBQOENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

45 

Met Lys Phe Ala Ala Ser Gly Xaa Phe Leu His His Met Ala Gly Leu 
1 5 10 15 

Ser Ser Ser Lys Leu Ser Met Ser Lys Ala Leu Pro Leu Thr Lys Val 
50 20 25 30 

Val Gin Asn Asp Ala Tyr Thr Ala Pro Ala Leu Pro Ser Ser He Arg 
35 40 45 

55 Thr Lys Ala Leu Thr Asn Met Ser Arg Thr Leu Val Asn Lys Glu Glu 
50 55 60 

Pro Pro Lys Glu Leu Pro Ala Ala Glu Pro Val Leu Ser Pro Leu Glu 
^5 70 75 80 
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Gly rbr Lys Met Thr Val Asn Asn Leu His Pro Arg Val Thr Glu Glu 
85 90 95 

Asp lie Val Glu Leu Phe Cys Val Cys Gly Ala Leu Lys Arg Ala Arg 
5 100 105 110 

Leu Val His Pro Gly Val Ala Glu Val Val Phe Val Lys Lys Asp Asp 
115 120 125 

10 Ala lie Thr Ala Tyr Lys Lys Tyr Asn Asn Arg Cys Leu Asp Gly Gin 
130 135 140 

Pro Met Lys Cys Asn Leu His Met Asn Gly Asn Val lie Thr Ser Asp 
145 150 155 160 

15 

Gin Pro lie Leu Leu Arg Leu Ser Asp Ser Pro Ser Met Lys Lys Glu 
165 170 175 

Ser Glu Leu Pro Arg Arg Val Asn Ser Ala Ser Ser Ser Asn Pro Pro 
20 180 185 190 

Ala Glu Val Asp Pro Asp Thr lie Leu Lys Ala Leu Phe Lys Ser Ser 
195 200 205 

25 Gly Ala Ser Xaa Thr Thr Gin Pro Thr Glu Phe Lys He Lys h&i Xaa 
210 215 220 



30 



(2) INFORMATION FOR SEQ ID NO: 364: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) L^IGTH: 349 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUajCE DESCRIPTION: SEQ ID NOi 364* 

40 

Met Ser Lys Asn Cys He Lys Leu Leu Cys Glu Asp Pro Val Phe Ala 
15 10 15 

Glu Tyr He Lys Cys He Leu Met Asp Glu Arg Tlir E^e Leu Asn Asn 
20 25 30 

Asn He Val Tyr Thr Phe Met Thr His Phe Leu Leu Lys Val Gin Ser 
35 40 45 

50 Gin Val Phe Ser Glu Ala Asn Cys Ala Asn Leu He Ser Thr Leu He 
50 55 60 

Thr Asn Leu He Ser Gin Tyr Gin Asn Leu Gin Ser Asp Phe Ser Asn 
65 70 75 80 

Arg Val Glu He Ser Lys Ala Ser Ala Ser Leu Asn Gly Asp Leu Arg 
85 90 95 



45 



Ala Leu Ala Leu Leu Leu Ser Val His Thr Pro Lys Gin Leu Asn Pro 
60 100 105 110 
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10 



Ala Leu He Pro Thr Leu Gin Glu L^ Leu Ser Lys Cys Arg Thr Cys 
115 120 125 

Leu Gin Gin Arg Asn Ser Leu Gin Glu Gin Glu Ala Lys Glu Arg Lys 
130 135 140 

Thr Lys Asp Asp Glu Gly Ala Thr Pro lie Lys Arg Arg Arg Val Ser 
145 150 155 160 

Ser Asp Glu Glu His Thr Val Asp Ser Cys lie Ser Asp Met Lys *l4ir 
165 170 175 



Glu Thr Arg Glu Val Leu Thr Pro Thr Ser Thr Ser Asp Asn Glu Thr 
15 180 185 190 



20 



Arg Asp Ser Ser He He Asp Pro Gly Thr Glu Gin Asp Leu Pro Ser 
195 200 205 

Pro Glu Asn Ser Ser Val Lys Glu Tyr Arg Met Glu Val Pro Ser Ser 
210 215 220 



25 



Phe Ser Glu Asp Met Ser Asn He Arg Ser Gin His Ala Glu Glu Gin 
225 230 235 240 

Ser Asn Asn Gly Arg Tyr Asp Asp Cys Lys Glu Phe Lys Asp Leu His 
245 250 255 



Cys Ser Lys Asp Ser Thr Leu Ala Glu Glu Glu Ser Glu Phe Pro Ser 
30 260 265 270 



35 



Thr Ser He Ser Ala Val Leu Ser Asp Leu Ala Asp Leu Arg Ser Cys 
275 280 285 

Asp Gly Gin Ala Leu Pro Ser Gin Asp Pro Glu Val Ala Leu Ser Leu 
290 295 300 



40 



Ser Cys Gly His Ser Arg Gly Leu Phe Ser His Met Gin Gin His Asp 
305 310 315 320 

He Leu Asp Thr Leu Cys Arg Thr He Glu Ser Thr He His Val Val 
325 330 335 



Thr Arg He Ser Gly Lys Gly Asn Gin Ala Ala Ser Xaa 
45 340 345 



50 



55 



60 



(2) INFTORMATION FOR SBQ ID NO: 365: 

(i) SEQUENCE Cffl^RACTERISTICS : 

(A) LENGTH: 467 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DBSCRIPriON: SEQ ID NO: 365: 

Met Leu His Gin Asp His He Thr Phe Ala Met Leu Leu Ala Arg He 
1 5 10 15 

Lys Leu Lys Gly Thr Val Gly Glu Pro Thr Tyr Asp Ala Glu Phe Gin 
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2S 



30 



His Phe Leu Arg Gly Asn Glu He Val Leu Ser Ala Gly Ser Thr Pro 
^35 40 45 

Arg lie Gin Gly Leu Thr Val Glu Gin Ala Glu Ala Val Val Arg Leu 
SO 55 60 

Ser Cys Leu Pro Ala Phe Lys Asp Leu He Ala Lys Val Gin Ala Asp 
10 65 70 75 80 



15 



Glu Gin Phe Gly He Trp Leu Asp Ser Ser Ser Pro Glu Gin Thr Val 
85 90 95 

Pro Tyr Leu Trp Ser Glu Glu Thr Pro Ala Thr Pro He Gly Gin Ala 
100 105 110 



20 



He His Arg Leu Leu Leu He Gin Ala Phe Arg Pro Asp Arg Leu Leu 
115 120 125 

Ala Met Ala His Met Phe Val Ser Thr Asn Leu Gly Glu Ser Phe Met 
130 135 140 



Ser He Met Glu Gin Pro Leu Asp Leu Thr His He Val Xaa Thr Glu 
25 145 150 155 160 



30 



Val Lys Pro Asn Thr Pro Val Leu Met Cys Ser Val Pro Gly Tyr Asp 
165 170 175 

Ala Ser Gly His Val Glu Asp Leu Ala Ala Glu Gin Asn Thr Gin He 
180 185 190 



35 



Thr Ser He Ala He Gly Ser Ala Glu Gly Phe Asn Gin Ala Asp Lys 
195 200 205 

Ala He Asn Thr Ala Val Lys Ser Gly Arg Trp Val Met Leu Lys Asn 
210 215 220 



Val His Leu Ala Pro Gly Trp Leu Met Gin Leu Glu Lys Lys Leu His 
40 225 230 235 240 



45 



Ser Leu Gin Pro His Ala Cys Phe Arg Leu Phe Leu Thr Met Glu He 
245 250 255 

Asn Pro Lys Val Pro Val Asn Leu Leu Arg Ala Gly Arg He Phe Val 
260 265 270 



50 



Phe Glu Pro Pro Pro Gly Xaa Lys Ala Asn Met Leu Arg Thr Phe Ser 
275 280 285 

Ser He Pro Val Ser Arg He Cys Lys Ser Pro Asn Glu Arg Ala Arg 
290 295 300 



Leu Tyr Phe Leu Leu Ala Trp Phe His Ala He He Gin Glu Arg Leu 
55 305 310 315 320 



60 



Arg Tyr Ala Pro Leu Gly Trp Ser Lys Lys Tyr Glu Phe Gly Glu Ser 
325 330 335 

Asp Leu Arg Ser Xaa Cys Asp Thr Val Asp Thr Trp Leu Asp Asp Thr 
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340 



345 



350 



Ala Lys Gly Arg Gin Asn lie Ser Pro Asp Lys He Pro Trp Ser Ala 
^ 355 360 365 

Leu Lys Thr Leu Met Ala Gin Ser He Tyr Gly Gly Arg Val Asp Asn 
370 375 380 

Glu Phe Asp Gin Arg Leu Leu Asn Thr Phe Leu Glu Arg Leu Phe Tlir 
10 385 390 395 400 

Thr Arg Ser Phe Asp Ser Glu Phe Lys Leu Ala Cys Lys Val Asp Gly 
405 410 415 

15 His Lys Asp He Gin Met Pro Asp Gly Met Gin Ala Arg Gly Val Cys 
420 425 430 



20 



Ala val Gly Gly Val Ala Pro Arg His Pro Asp Ala Leu Leu Ala Gly 
435 440 445 

Pro Ala Gin Gin Arg Arg Glu Ser Pro Pro Tyr His Thr Gly Cys Gly 
450 455 460 



His Asp Gin 
25 465 



(2) INFORMATION FOR SEQ ID NO: 366: 

30 

(i) SEQUSJCB CHARACTERISTICS: 

<A) I^ENCmii 152 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 
35 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

Met Ala Asp Glu Ala Thr Arg Arg Val Val Ser Glu He Pro Val Leu 
15 10 15 

40 Lys Thr Asn Ala Gly Pro Arg Asp Arg Glu Leu Trp Val Gin Arg Leu 
20 25 30 

Lys Glu Glu Tyr Gin Ser Leu He Arg Tyr Val Glu Asn Asn Lys Asn 
35 40 45 

45 

Ala Asp Asn Asp Trp Phe Arg Leu Glu Ser Asn Lys Glu Gly Thr Arg 
50 55 60 

Trp Phe Gly Lys Cys Trp Tyr He His Asp Leu Leu Lys Tyr Glu Phe 
50 65 70 75 80 

Asp He Glu Phe Asp He Pro He Thr Tyr Pro Thr Thr Ala Pro Glu 
85 90 95 

55 He Ala Val Pro Glu Leu Asp Gly Lys Tftac Ala Lys Met Tyr Arg Gly 
100 105 110 

Gly Lys He Cys Leu Thr Asp His Phe Lys Pro Leu Trp Gly Gin Glu 
115 120 125 

60 
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Cys Ala Gin He Trp Thr Ser Ser Ser His Gly Ser Gly Ala Gly Ser 

135 140 

Met Xaa Gly ser Gly Asn Pro Xaa 
5 145 150 



(2) JNPORMATION POER SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LESKSTH: 373 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

^5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 

Met Tyr Asp Gly Thr Lys Glu Val Pro Met Asn Pro Val Lys He Tyr 

i 5 , 10 15 . 

20 Gin Val Cys Asp He Pro Gin Pro Gin Gly Ser He He Asn Pro Gly 
20 25 30 



25 



Ser Thr Gly Ser Ala Pro Trp Asp Glu Lys Asp Asn Asp Val Asp Glu 
35 40 45 

Glu Asp Glu Glu Asp Glu Leu Asp Gin Ser Gin His His Val Pro He 
50 55 go 

Gin Asp Thr Phe Pro Phe Leu Asn He Asn Gly Ser Pro Met Ala Pro 
30 65 70 75 80 

Ala Ser Val Gly Asn Cys Ser Val Gly Asn Cys Ser Pro Glu Ala Val 
85 90 95 

35 Trp Pro Lys Thr Glu Pro Leu Glu Met Glu Val Pro Gin Ala Pro He 
100 105 110 



40 



55 



Gin Pro Phe Tyr Ser Ser Pro Glu Leu Trp He Ser Ser Leu Pro Met 
115 120 125 

Thr Asp Leu Asp He Lys Kie Gin Tyr Arg Gly Lys Glu Tyr Gly Gin 
130 135 140 



Thr Met Thr Val Ser Asn Pro Gin Gly Cys Arg Leu Phe Tyr Gly Asp 
150 155 160 

Leu Gly Pro Met Pro Asp Gin Glu Glu Leu Phe Gly Pro Val Xaa Leu 
165 170 175 

50 Glu Gin Val Lys Phe Pro Gly Pro Glu His He Thr Asn Glu Lys Gin 
180 185 190 



Lys Leu Phe Thr Ser Lys Leu Leu Asp Val Met Asp Arg Gly Leu He 
195 200 205 

Leu Glu Val Ser Gly His Ala He Tyr Ala He Arg Leu Cys Gin Cys 
210 215 220 



Lys Val Tyr Trp Ser Gly Pro Cys Ala Pro Ser Leu Val Ala Pro Asn 

240 



60 225 230 235 



574 



Leu He Glu Arg Gin Lys Lys Val Lys Leu Phe Cys Leu Glu "Kir Phe 
245 250 255 

5 Leu Ser Asp Leu He Ala His Gin Lys Gly Gin He Glu Lys Gin Pro 
260 265 270 

Pro £^e Glu He Tyr Leu Cys Phe Gly Glu Glu Trp Pro Asp Gly Lys 
jQ 275 280 285 

Pro Leu Glu Arg Lys Leu He Leu Val Gin Val He Pro Val Val Ala 
290 295 300 

Arg Met He Tyr Glu Met Wie Ser Gly Asp Phe Thr Arg Ser Phe Asp 
15 305 310 315 320 

Ser Gly Ser Val Arg Leu Gin He Ser Thr Pro Asp He Lys Asp Asn 
325 330 335 

20 He Val Ala Gin Leu Lys Gin Leu Tyr Arg He Leu Gin Thr Gin Glu 
340 345 ^350 

Ser Trp Gin Pro Met Gin Pro Thr Pro Ser Met Gin Leu Pro Pro Ala 
25 360 365 

Leu Pro Pro Gin Xaa 
370 



30 

(2) INFORMATION PGR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LESXTTHz 83 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 368: 

Met Gly Ser Ser Val Leu Pro Phe Cys Val Cys Val Thr Ser Pro Ser 
40 1 5 10 15 

Leu Gly Gly Arg Cys He Gin Gly Arg Kie Ala Ser His Ser Lys Phe 
20 25 30 

45 Trp Gly Phe Gly Arg Lys Thr Ala Ser Phe Gly Ala Val Gly Glu Thr 
35 40 45 

Pro Pro Asp Gin Glu Pro Gin Lys Glu Thr Glu Pro Ala Thr Ser Ser 

50 

His Ala Arg Pro Trp Ala Arg Val He Gly Leu Arg He Trp Pro Gin 
^5 70 75 80 

Pro Asn Xaa 

55 



(2) INFORMATION FXDR SEQ ID NO: 369: 

60 
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(i) SBQUQICE CHRRACTEEIISTICS: 

<A) LBKrVH: 21 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
J (xi) SEQUEICE DESCRIPnCN: SBQ ID NO: 369: 

net tei Leu ser Val Ala He Phe He Leu Leu T»ir Leu Val Tyr Ala 

10 Tyr Trp Thr Met Xaa 
20 



15 



15 (2) mFORMATICM PC» SEQ ID NO: 



370: 



(i) SBQUaiCE CHARACTERISriCS: 

(A) LEM3TH: 227 amino acids 

<B) TYPE: amino acid 

ID) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIOI: SBQ ID NO: 370: 

Met Gly Ala Ser Ala Arg Leu Leu Arg Ala Val lie Met Gly Ala Pro 
25 ^ " 15 

Gly Ser Gly Lys Gly Thr Val Ser Ser Arg He Thr Thr His Phe Glu 
2° 25 30 

I*u Lys His Leu ser ser Gly ASP Leu Leu Arg Asp Asn Met Leu Arg 
35 40 45 

Gly Ito Glu He Gly Val Leu Ala Lys Ala Phe He Asp Gin Gly Lys 

55 60 

Leu He Pro Asp Asp Val Met Ita Arg Leu Ala Leu His Glu Leu Lys 

Asn Leu Thr Gin Tyr Ser Trp Leu Leu Asp Gly Phe Pro Arg Thr Leu 
40 '° 95 

Pro Gin Ala Glu Ala Leu Asp Arg Ala Tyr Gin He Asp Thr Val He 

105 



30 



35 



45 



50 



Asn Leu Asn val Pro Phe Glu Val He Lys Gin Arg Leu «u: Ala Arg 
115 120 125 

Trp He His Pro Ala Ser Gly Arg Val Tyr Asn He Glu Phe Asn Pro 
130 135 140 



£^o Lys Thr Val Gly He Asp Asp Leu Thr Gly Glu Pro Leu He 



145 



150 155 



Gin 
160 



Arg Glu Asp Asp Lys Pro Glu Thr Val He Lys Arg Leu Lys Ala Tyr 
55 170 175 

Glu Asp Gin Thr Lys Pro Val Leu Glu Tyr tyr Gin Lys Lys Gly Val 
180 185 190 

Leu Glu Thr Phe Ser Gly Thr Glu Thr Asn Lys He Trp Pro IVr Val 
195 200 205 
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Tyr Ala Ete Leu Gin Ite Lys Val Pro Gin Arg Ser Gin Lys Ala Ser 
210 215 220 



Val Thr Pro 
225 



10 (2) INFORMATION FOR SBQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LmJTH: 79 amino acids 

(B) TyPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEOTENCE DESCRIPTICN: SEQ ID NO: 371: 

Met Phe Leu Asn Cys Glu He Leu Glu Tyr Cys Tyr Tyr Leu Thr Gin 

20 ' 

Leu Lys He Ser Met Gly Lys Tyr Leu Ser He Pro Thr Val Leu Leu 
20 25 30 

Lys He He Arg Cys Ser He ito Ala Val Ser Asp Ser Ser Thr Ser 
^■^ 35 40 45 

Trp Ala He Lys Ala Gin Leu Lys He Glu Asn Lys Asp Leu Asp Asn 
50 55 60 

30 Lys Thr Ala Lys Gly Gly Gly Gin Glu Ala Leu Thr Cys Thr Xaa 
65 70 75 



35 (2) INFORMATION FOR SEQ ID NO: 372: 

(i) SBCJUENCE CHARACTERISTICS: 

(A) LQKSTH: 51 amino acids 

(B) TYPE: amino acid 
^ (D) TOPOLOGy: linear 

(xi) SBCSJENCE DESCRIPTION: SBQ ID NO: 372: 



45 



55 



Met Arg Ala Val Phe Pro Cys Cys Pro Phe Leu Tiir Leu Met Leu Pro 
1 5 10 15 

Leu Leu Glu Cys Leu Val Gly Met He Met Cys Tyr Leu Gly He Ser 
20 25 30 



Phe Thr Asp Thr Arg Lys Thr Ala Gly Leu Lys Lys Lys Lys Lys Lys 
35 40 45 



Lys Xaa Xaa 
50 



(2) INFORMATION FOR SBQ ID NO: 373: 



60 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 
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<B) TYPE: amino acid 
<D) TOPOUX^: linear 
(xi) SEQUENCE DESCRIPTICaN: SBQ ID NO: 373: 

Met f^e Met Arg Met His Cys Phe Cys Lys Tyr Cys Cys Ser 
1 5 10 15 

Phe lie Val Thr Pro Thr Ser Thr Ser Asn Thr Ala Ser IVr Leu Trp 
20 25 30 

Pro Trp He Ser Ala Ser Met Ala Gly Arg Gly Ser Ser Trp Ala Cys 
35 40 45 



Thr Leu Asn Ala Val Thr Arg Glu Gly Leu Pro Glu Xaa 
15 50 55 60 



(2) INFORMATICH FOR SBQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TVPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SBQUQJCE DESCRIPTION: SEQ ID NO: 374: 

Met Ser Leu Leu Asn Thr His Thr Leu Cys Phe Val Leu Phe Cys Phe 
15 10 15 

30 Thr Leu Ser He Asn Gin Glu Lys Leu Ala Asn His Leu Ala Phe Arg 
20 25 30 



20 



25 



35 



He Leu Phe Phe He Val E^e Xaa 
35 40 



(2) mPORMATIGN FOR SEQ ID NO: 375: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTtH: 44 amino acids 

(B) TyPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 375: 



45 



50 



55 



Met Cys Ser Gly Gin Ser Gin Val Trp Lys Met Ala Leu Gin Ala Leu 
1 5 10 15 

Asp Ser Glu Thr Val Val He Leu Pro Asp Met His Leu He Leu Ser 
20 25 30 

Leu Arg Leu He His Asn Ala Arg Pro Cys Leu Xaa 
35 40 



(2) INPORMATICN FOR SBQ ID NO: 376: 



60 



(i) SEQUENCE CrHARACTERISTICS : 

(A) LENGTH: 203 amino acids 
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(B) TYPE: amino acid 
(D) TOPOljOGy: linear 
(xi) SBQUQJCE E3SSC3irPnON: SBQ ID NO: 376: 

5 Het Leu He Ser Glu Glu Glu He Pro Phe Lys Asp Asp Pro Arg Asp 
15 10 15 



10 



25 



40 



45 



50 



Glu Thr Tyr Lys Pro His Leu Glu Arg Glu Thr Pro Lys Pro Arg Arg 
20 25 30 

Lys Ser Gly Lys Val Lys Glu Glu Lys Glu Lys Lys Glu He Lys Val 
35 40 45 



Glu Val Glu Val Glu Val Lys Glu Glu Glu Asri Glu He Arg Glu Asp 
15 . 50 55 60 

Glu Glu Pro Pro Arg Lys Arg Gly Arg Arg Arg Lys Asp Asp Lys Ser 
^5 70 75 80 

20 Pro Arg Leu. Pro Lys Arg Arg Lys Lys Pro Pro He Gin Tyr Val Arg 
85 90 95 



Cys Glu Met Glu Gly Cys Gly Thr Val Leu Ala His Pro Arg Tyr Leu 
100 105 110 

Gin His His He Lys Tyr Gin His Leu Leu Ly^ Lys Lys Tyr Val Cys 
115 120 125 



Pro His Pro Ser Cys Gly Arg Leu Phe Arg Leu Gin Lys Gin Leu Leu 
30 130 135 140 

Arg His Ala Lys His His Thr Asp Gin Arg Asp Tyr He Cys Glu Tyr 
"5 150 155 160 

35 Cys Ala Arg Ala Phe Lys Ser Ser His Asn Leu Ala Val His Arg Met 
165 170 175 



He His Thr Gly Glu Lys His Tyr Asn Val Arg Ser Val Asp Leu Leu 
180 185 190 

Val Asp Lys Arg His Leu Leu He Gly Thr Xaa 
195 200 



(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LOfGTH: 29 amino acids 

(B) TyPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID m: 377: 



Met Leu Pro Arg Arg Thr Phe Tyr Phe Tyr Phe He Phe He Phe Phe 

15 



55 1 5 10 



Leu Ala Ser Phe Trp Gly Phe Thr Leu Arg Ala Ser Phe 
20 25 



60 
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(2) INPORMATICN FOR SEQ ID NO: 378: 

(i) SSOUETKrE CHARACTERISTICS: 
5 (A) I^KSTH: 136 amino acids 

(B) TYPE: amino acid 

<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 

10 Met Phe Asp Ser Leu Ser Tyr Phe Lys Gly Ser Ser Leu Leu Leu Met 
15 10 15 



15 



30 



Leu Lys Thr Tyr Leu Ser Glu Asp Val Phe Gin His Ala Val Val Leu 
20 25 30 

Tyr Leu His Asn His Ser Tyr Ala Ser He Gin Ser Asp Asp Leu Trp 
35 40 45 



Asp Ser Phe Asn Glu Val Thr Asn Gin Tte Leu Asp Val Lys Arg Met 
20 50 55 60 

Met Lys Thr Trp Thr Leu Gin Lys Gly Phe Pro Leu Val Thr Val Gin 
65 70 75 80 

25 Lys Lys Gly Lys Glu Leu Phe He Gin Gin Glu Arg Phe Phe Leu Asn 
85 90 95 



Met Lys Pro Glu He Gin Pro Ser Asp Thr Arg Tyr Met Pro Ser Phe 
100 105 110 

Phe Ser Cys His Leu Phe Cys Thr Leu Arg Trp Lys Tyr Phe Glu Val 
115 120 125 



Phe Tyr Asn His Lys Phe Leu Xaa 
35 130 ' 135 



40 



(2) INFORMATION FOR SEQ ID NO: 379: 



<i> SEQUENCE CHARACTERISTICS: 

<A) LaiGTH: 41 amino acids 
(B) TYPE: amino acid 
<D) TOPOLOGY: linear 
45 (xi) SEQUENCE DESCRIPTICaT: SEQ ID NO: 379: 



50 



55 



Met Ala Trp Arg Arg Arg Glu Pro Ala Ser Gly Leu Ala Ala Cys Trp 
15 10 15 

Leu Trp Arg Cys Ser Pro Trp Pro Cys Ala Cys Pro Gly Pro Gly Ala 
20 25 . 30 

Gly Leu Ser Ser Gly Ser Arg Pro Trp 
35 40 



60 



(2) INFORMATION FOR SEQ ID NO: 380: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENffTH: 468 amino acids 

(B) TYPE: amino acid 
(D) TOPQLOGY: linear 

^ (xi) SB(^JQ3CE DESCRIPTION: SBQ ID NO: 380: 

Met Glu Phe Leu Lys Val Ala Arg Arg Asn Lys Arg Glu Gin Leu Glu 
15 10 15 

Gin lie Gin Lys Glu Leu Ser Val Leu Glu Glu Asp lie Lys Arg Val 
10 20 25 30 

Glu Glu Met Ser Gly Leu Tyr Ser Pro Val Ser Glu Asp Ser Thr Val 
35 40^ 45 

15 Pro Gin Phe Glu Ala Pro Ser Pro Ser His Ser Ser He He Asp Ser 
50 55 60 



20 



35 



50 



Thr Glu Tyr Ser Gin Pro Pro Gly Phe Ser Gly Ser Ser Gin Thr Lys 
65 70 75 80 

Lys Gin Pro Trp Tyr Asn Ser Thr Leu Ala Ser Arg Arg Lys Arg Leu 
85 90 95 



Thr Ala His Phe Glu Asp Leu Glu Gin Cys Tyr Phe Ser Thr Arg Met 
25 100 105 110 

Ser Arg He Ser Asp Asp Ser Arg Thr Ala Ser Gin Leu Asp Glu Phe 
115 120 125 

30 Gin Glu Cys Leu Ser Lys Phe Thr Arg Tyr Asn Ser Val Arg Pro Leu 
130 135 140 



Ala Thr Leu Ser Tyr Ala Ser Asp Leu Tyr Asn Gly Ser Ser He Val 
145 150 155 160 

Ser Ser He Glu Phe Asp Arg Asp Cys Asp Tyr Phe Ala He Ala Gly 
165 170 175 



Val Thr Lys Lys He Lys Val Tyr Glu Tyr Asp Thr Val He Gin Asp 
40 180 185 190 

Ala Val Asp He His Tyr Pro Glu Asn Glu Met Thr Cys Asn Ser Lys 
195 200 205 

45 He Ser Cys He Ser Trp Ser Ser Tyr His Lys Asn Leu Leu Ala Ser 
210 215 220 



Ser Asp Tyr Glu Gly Thr Val He Leu Trp Asp Gly Phe Thr Gly Gin 
225 230 235 240 

Arg Ser Lys Val Tyr Gin Glu His Glu Lys Arg Cys Trp Ser Val Asp 
245 250 255 



Phe Asn Leu Met Asp Pro Lys Leu Leu Ala Ser Gly Ser Asp Asp Ala 
55 260 265 270 

Lys Val Lys Leu Trp Ser Thr Asn Leu Asp Asn Ser Val Ala Ser He 
275 280 285 



60 



Glu Ala Lys Ala Asn Val Cys Cys Val Lys Phe Ser Pro Ser Ser Arg 
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290 295 300 

Tyr His Leu Ala Phe Gly Cys Ala Asp His Cys Val His Tyr Tyr Asp 
305 310 315 320 

5 

Leu Arg Asn Thr Lys Gin Pro He Met Val Phe Lys Gly His Arg Lys 
325 330 335 

Ala Val Ser Tyr Ala Lys Phe Val Ser Gly Glu Glu He Val Ser Ala 
10 340 345 350 

Ser Thr Asp Ser Gin Leu Lys Leu Trp Asn Val Gly Lys Pro Tyr Cys 
355 360 365 

15 Leu Arg Ser Phe Lys Gly His lie Asn Glu Lys Asn Phe Val Gly Leu 
370 375 380 

Ala Ser Asn Gly Asp Tyr He Ala Cys Gly Ser Glu Asn Asn Ser Leu 
385 390 395 400 

20 

Tyr Leu Tyr Tyr Lys Gly Leu Ser Lys Thr Leu Leu Thr Phe Lys Phe 
405 410 415 

Asp Thr Val Lys Ser Val Leu Asp Lys Asp Arg Lys Glu Asp Asp Thr 
25 420 425 430 

Asn Glu Phe Val Ser Ala Val Cys Trp Arg Ala Leu Pro Asp Gly Glu 
435 440 445 

30 Ser Asn Val Leu He Ala Ala Asn Ser Gin Gly Thr He Lys Val Leu 
450 455 460 

Glu Leu Val Xaa 
465 

35 



(2) INFORMftTION FOR SEQ ID NO: 381; 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LE2^GTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTXCa* : SEQ ID NO: 381: 

45 

Met Arg Lys Glu Asp Gly Phe Trp Phe Phe Phe Phe Leu Phe Phe Phe 
15 10 15 

Veil Val Gly Ser Lys Phe Val Asn Gly Asn Lys Leu Val 
50 20 25 



(2) mPORMATION FOR SEQ ID NO: 382: 

55 

<i) SBQUH4CE CHARACTERISTICS: 

(A) LQIGTH: 29 amino acids 

(B) T5fPE: amino acid 
(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 
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Met Pro Leu Ala Pro Tyr Cys Asp Leu Leu Val Ala Leu Ser Phe Ala 
15 10 15 

5 Leu Val Leu Glu Ser Pro Val Asp Ser Ser Asp E^e Thr 
20 25 



10 (2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUOaCE CHARACTERISTICS: 

(A) LQX?ni: 138 amino acids 

(B) TXTE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEC^JENCE DESCRIPTION: SEQ ID NO: 383: 

Met Asn Ser Leu Val Ser Trp Gin Leu Leu Leu Phe Leu Cys Ala Thr 

20 ' 

His Phe Gly Glu Pro Leu Glu Lys Val Ala Ser Val Gly Asn Ser Arg 
20 25 30 

Pro Thr Gly Gin Gin Leu Glu Ser Leu Gly Leu Leu Ala Pro Gly Glu 
25 35 40 ,45 

Gin Ser Leu Pro Cys Thr Glu Arg Lys Pro Ala Ala Thr Ala Arg Leu 
50 55 60 

30 Ser Arg Arg Gly Thr Ser Leu Ser Pro Pro Pro Glu Ser Ser Gly Ser 
65 70 75 80 



35 



Pro Gin Gin Pro Gly Leu Ser Ala Pro His Ser Arg Gin He Pro Ala 
85 90 95 

Pro Gin Gly Ala Val Leu Val Gin Arg Glu Lys Asp Leu Pro Asn Tyr 
100 105 110 



Asn Trp Asn Ser Phe Gly Leu Arg Phe Gly Lys Arg Glu Ala Ala Pro 
40 115 120 125 



Gly Asn His Gly Arg Ser Ala Gly Arg Gly 
130 135 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTEEIISTICS : 

(A) LENGTH: 74 amino acids 

(B) TyPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQOaJCE DESCRIPTION: SEQ ID NO: 384: 

55 Met Ser Cys Phe He Asp Ser Xaa Asp Ser Lys He Leu His Leu Leu 
15 10 15 

Val Val Ser Phe He Cys Xaa Leu Phe Leu Leu He Leu Thr His Gly 
20 25 30 

60 
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lie Leu lie Leu Arg Xaa Vhe Phe Ser Val Xaa Xaa His Ser Leu Lys 
35 40 45 

Asn Asn Leu Glu Glu Tyr Leu lie Leu Met Asn Lys Ala Leu Leu Thr 
5 50 55 60 

Arg Glu Asp Phe Phe Val Leu Pro Xaa Ala 
65 t 70 

(2) INPORMATION FDR SBQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) JjENCjTH: 521 amino acids 

(B) TYPE: amino acid 
(D) TOPOLCXnf: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 385: 

20 Met Ser Ala Gly Glu Val Glu Arg Leu Val Ser Glu Leu Ser Gly Gly 
15 10 15 

Thr Gly Gly Asp Glu Glu Glu Glu Trp Leu Tyr Gly Asp Glu Asn Glu 
20 25 30 

25 

Val Glu Arg Pro Glu Glu Glu Asn Ala Ser Ala Asn Pro Pro Ser Gly 
35 40 45 

lie Glu Asp Glu Thr Ala Glu Asn Gly Val Pro Lys Pro Lys Val Thr 
30 50 - 55 60 

Glu Thr Glu Asp Asp Ser Asp Ser Asp Ser Asp Asp Asp Glu Asp Asp 
65 70 75 80 

35 Val His Val Thr He Gly Asp He Lys Thr Gly Ala Pro Gin Tyr Gly 
85 90 95 

Ser Tyr Gly Thr Ala Pro Val Asn Leu Asn He Lys Thr Gly Gly Arg 
100 105 110 

40 

Val Tyr Gly Thr Thr Gly Thr Lys Val Lys Gly Val Asp Leu Asp Ala 
115 120 125 

Pro Gly Ser He Asn Gly Val Pro Leu Leu Glu Val Asp Leu Asp Ser 
45 130 135 140 

Phe Glu Asp Lys Pro Trp Arg Lys Pro Gly Ala Asp Leu Ser Asp Tyr 
145 150 155 160 

50 Phe Asn Tyr Gly Phe Asn Glu Asp Thr Trp Lys Ala Tyr Cys Glu Lys 
165 170 175 

Gin Lys Arg He Arg Met Gly Leu Glu Val He Pro Val Thr Ser Thr 
180 185 190 

55 

Thr Asn Lys He Thr Val Gin Gin Gly Arg Thr Gly Asn Ser Glu Lys 
195 200 205 



Glu Thr Ala Leu Pro 

60 210 



Ser Thr Lys Ala Glu Phe Thr Ser Pro Pro Ser 
215 220 
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Leu Phe hys Thr Gly Leu Pro Pro Ser Arg Arg Leu Pro Gly Ala He 
225 230 235 240 

Asp Val He Gly Gin Thr He Thr He Ser Arg Val Glu Gly Arg Arg 
245 250 255 

Arg Ala Asn Glu Asn Ser Asn lie Gin Val Leu Ser Glu Arg Ser Ala 
260 265 270 

Thr Glu Val Asp Asn Asn Phe Ser Lys Pro Pro Pro Phe Phe Pro Pro 
275 280 285 



Gly Ala Pro Pro Thr His Leu Pro Pro Pro Pro Phe Leu Pro Pro Pro 
15 290 295 300 



20 



Pro Thr Val Ser Thr Ala Pro Pro Leu He Pro Pro Pro Gly Phe Pro 
305 310 315 320 

Pro Pro Pro Gly Ala Pro Pro Pro Ser Leu He Pro Thr He Glu Ser 
325 330 335 



25 



Gly His Ser Ser Gly Tyr' Asp Ser Arg Ser Ala Arg Ala Phe Pro Tyr 
340 345 350 

Gly Asn Val Ala Phe Pro His Leu Pro Gly Ser Ala Pro Ser Trp Pro 
355 360 365 



Ser Leu Val Asp Thr Ser Lys Gin Trp Asp Tyr Tyr Ala Arg Arg Glu 
30 370 375 380 



35 



Lys Asp Arg Asp Arg Glu Airg Asp Arg Asp Arg Glu Arg Asp Arg Asp 
385 390 395 400 

Arg Asp Arg Glu Arg Glu Arg Thr Arg Glu Arg Glu Arg Glu Arg Asp 
405 410 415 



40 



His Ser Pro Thr Pro Ser Val Phe Asn Ser Asp Glu Glu Arg Tyr Arg 
420 425 430 

Tyr Arg Glu Tyr Ala Glu Arg Gly Tyr Glu Arg His Arg Ala Ser Arg 
435 440 445 



Glu Lys Glu Glu Arg His Arg Glu Arg Arg His Arg Glu Lys Glu Glu 
45 450 455 460 



50 



Thr Arg His Lys Ser Ser Arg Ser Asn Ser Arg Arg Arg His Glu Ser 
465 470 475 480 

Glu Glu Gly Asp Ser His Arg Arg His Lys His Lys Lys Ser Lys Arg 
485 490 495 



55 



Ser Lys Glu Gly Lys Glu Ala Gly Ser Glu Pro Ala Pro Glu Gin Glu 
500 505 510 

Ser Thr Glu Ala Thr Pro Ala Glu Xaa 
515 520 



60 



PCT/US98A)4493 



585 



(2) INPORMftTION FOR SBQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQX?ni: 137 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 386: 

Met Asn Ser Arg Gly He Trp Leu Ala Tyr He He Leu Val Gly Leu 
10 1 5 10 15 

Leu His Met Val Leu Leu Ser He Pro Phe Phe Ser He Pro Val Val 
20 25 30 

15 Trp Thr Leu Thr Asn Val He His Asn Leu Ala Thr Tyr Val E^e Leu 
35 40 45 

His Thr Val Lys Gly Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ala 

55 60 

20 

Arg Leu Leu Thr His Trp Glu Gin Met Asp Tyr Gly Leu Gin Phe Thr 
65 70 75 80 

Ser Ser Arg Lys Phe Leu Ser He Ser Pro He Val Leu Tyr Leu Leu 
25 85 90 95 

Ala Ser Phe Tyr Thr Lys Tyr Asp Ala Ala His Phe Leu He Asn Thr 
100 105 110 

30 Ala Ser Leu Leu Ser Val Leu Leu Pro Lys Leu Pro Gin Phe His Gly 
115 120 125 

Val Arg Val Phe Gly He Asn Lys Tyr 
130 135 

35 



(2) INFORMATION FOR SEQ ID NO: 387: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGy: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 387: 



45 



Met Ala Ala Gin Lys Asp Gin Gin Lys Asp Ala Glu Ala Glu Gly Leu 
15 10 15 



Ser Gly Thr Thr Leu Leu Pro Lys Leu He Pro Ser Gly Ala Gly Arg 
50 20 25 30 

Glu Trp Leu Glu Arg Arg Arg Ala Thr He Arg Pro Trp Ser Thr Phe 
35 40 45 

55 Val Asp Gin Gin Arg Phe Ser Arg Pro Arg Asn Leu Gly Glu Leu Cys 
50 55 60 

Gin Arg Leu Val Arg Asn Val Glu Tyr Tyr Gin Ser Asn Tyr Val Phe 
65 70 75 80 

60 
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Val Phe Leu Gly Leu lie Leu Tyr Cys Val Val 'Our Ser Pro Met Leu 
85 90 95 

Leu Val Ala Leu Ala Val Phe E^e Gly Ala Cys Tyr He Leu Tyr Leu 



100 



105 



110 



Arg Thr Leu Glu Ser Lys Leu Val Leu Wie Gly Arg Glu Val Ser Pro 
115 120 



10 Ala 



125 

His Gin Tyr Ala Leu Ala Gly Gly He Ser Phe Pro Phe Phe Trp 
130 135 140 



Leu Ala Gly Ala Gly Ser Ala Val Phe Trp Val Leu Gly Ala Thr Leu 
15 150 155 160. 

val Val He Gly Ser His Ala Ala Phe His Gin He Glu Ala Val Asp 
165 170 175 

Gly Glu Glu Leu Gin Met Glu Pro Val Xaa 
20 180 185 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SBQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) L^JGIU: 1 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

Met 
1 



(2) INFORMATION FOR SBQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEWGrm: 299 amino acids 

(B) TYTE: amino acid 
(D) TOPOLOGY': linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 389: 

Met Leu Ser He Phe Tyr Phe Ala He Pro Val Gly Ser Gly Leu Gly 
15 10 15 

Tyr He Ala Gly Ser Lys Val Lys Asp Met Ala Gly Asp Trp His Trp 
20 25 30 

Ala Leu Arg Val Thr Pro Gly Leu Gly Val Val Ala Val Leu Leu Leu 
35 40 45 



Phe Leu Val Val Arg Glu Pro Pro Arg Gly Ala Val Glu Arg His Ser 
50 55 



60 



60 



Asp Leu Pro Pro Leu Asn Pro Thr Ser Trp Trp Ala Asp Leu Arg Ala 
70 75 80 

Leu Ala Arg Asn Pro Ser Phe Val Leu Ser Ser Leu Gly Phe Thr Ala 
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85 90 95 

Val Ala Phe Val Thr Gly Ser Leu Ala Leu Trp Ala Pro Ala Phe Leu 
5 100 105 110 

Leu Arg Ser Arg Val Val Leu Gly Glu Thr Pro Pro Cys Leu Pro Gly 
115 120 125 

Asp Ser Cys Ser Ser Ser Asp Ser Leu He Phe Gly Leu He Thr Cys 
10 130 135 140 

Leu Thr Gly Val Leu Gly Val Gly Leu Gly Val Glu He Ser Arg Arg 
1^5 150 155 160 

15 Leu Arg His Ser Asn Pro Arg Ala Asp Pro Leu Val Cys Ala Thr Gly 
165 170 175 



20 



Leu Leu Gly Ser Ala Pro Phe Leu Phe Leu Ser Leu Ala Cys Ala Arg 
180 185 190 

Gly Ser He Val Ala Thr Tyr He Phe He Phe He Gly Glu Thr Leu 
195 200 205 



Leu Ser Met Asn Trp Ala He Val Ala Asp He Leu Leu Tyr Val Val 
25 210 215 220 



30 



35 



He Pro Thr Arg Arg Ser Thr Ala Glu Ala Phe Gin He Val Leu Ser 
^25 230 235 240 

His Leu Leu Gly Asp Ala Gly Ser Pro Tyr Leu He Gly Leu He Ser 
245 250 255 

Asp Arg Leu Arg Arg Asn Trp Pro Pro Ser Phe Leu Ser Glu Phe Arg 
260 265 270 

Ala Leu Gin Phe Ser Leu Met Leu Cys Ala Phe Val Gly Ala Leu Gly 
275 280 285 



Giy Ala Leu Pro Gly His Arg His Leu His Xaa 
40 290 295 



(2) XNTORMATICN FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQ9GTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 390: 

Met Gly Pro Gin Gly Trp Val Arg Pro Leu Lys Thr Ala Pro Lys Leu 
15 10 15 

55 Gly Glu Ala He Arg Leu He Leu Phe Leu Asn Phe Val Lys Gin Cys 
20 25 30 



45 



50 



60 



He Ala Ser Val Asn Leu Cys He Leu Arg Leu Asn He Thr Pro Leu 
35 40 45 
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Leu 



5 

(2> INPORMATICN FOR SEQ ID NO: 391: 

(i) SBQUKNLK CHARACTERISTICS: 

(A) LiQXaTH: 61 amino acids 
10 (B) TfFE: amino acid 

(D) TOPOLOGY: linear 
txi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

Met Tyr Val Asn Tyr Gly Thr Arg Asn Tyr Ser Thr Glu Gly Pro Ala 
15 1 5 10 15 

Ala Leu Leu Asp Gin Ala Lys Leu Ser Leu Leu Val Trp Val Leu Cys 
20 25 30 

20 Phe Val Leu Leu Phe Val Cys Phe Cys Gly Leu Ser Tyr Val Val He 
35 40 45 

Ala Gin Val Pro Val Gly Leu Leu Cys He Thr Glu Xaa 

25 " 



(2) INFORMATIC»I FOR SEQ ID NO: 392: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LaJGTH: 79 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 

Met Leu Trp Phe Ala Asn Phe Phe Thr Tyr Leu Phe Leu Ser Gin Ser 
15 10 15 

Val Ala E^e Val His He Ser His He Gly Val Arg Gin Val Asn Thr 
40 20 25 30 

Asn Cys Tyr Phe Ser Arg Lys Ser Tyr Cys Tyr Gly He Leu Asn Pro 
35 40 45 

45 He Asn Cys He Lys Gly Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys 
50 55 60 

Lys Lys Lys Lys He Pro Ala Gly Arg Xaa Leu Phe Pro Phe Gly 

50 



(2) INFORMATION FOR SEQ ID NO: 393: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUEtSCE DESCRIPTION: SEQ ID NO: 393: 
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Met Pro Gly Ala Phe Ser Glu Thr Val He Asn Asp Leu Leu Ser Leu 

Leu Val Leu Pro Ala Glu Leu Ser Tyrr Ser Thr Leu Ser Gly Val 
5 20 



30 



Tyr Arg Asn Ala 

35 



10 



15 



(2) INFORMATION FOR SBQ ID NO: 394: 

(i) SBQUEMCE OaRACTERISTICS : 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 
(D) TOPaLOGV: linear 

(xi) SBC3UENCE DESCRIPTION: SBQ ID NO: 394: 

20 Met Ala Gin Ser Arg Asp Gly Gly Asn Pro Phe Ala Glu Pro Ser Glu 
1.5 10 15 

Leu Asp Asn Pro Phe Gin Asp Pro Ala Val He Gin His Arg Pro Ser 

25 

Arg Gin Tyr Ala Thr Leu Asp Val Tyr Asn Pro Phe Glu Thr Arg Glu 
35 40 45 

Pro Pro Pro Ala Tyr Glu Pro Pro Ala Pro Ala Pro Leu Pro Pro Pro 
30 50 55 60 

Ser Ala Pro Ser Leu Gin Pro Ser Arg Lys Leu Ser Pro Thr Glu Pro 
^5 70 75 80 

35 Lys Asn Tyr Gly Ser Tyr Ser Tte Gin Ala Ser Ala Ala Ala Ala Thr 
85 90 95 

Ala Glu Leu Leu Lys Lys Gin Glu Glu Leu Asn Arg Lys Ala Glu Glu 
^ 105 110 

Leu Asp Arg Arg Ser Glu Ser Cys Ser Met Leu Pro Trp Xaa Ala Gin 
115 120 125 

Leu Leu Asp Arg Thr He Gly Pro Leu Tyr Leu Leu E^e Val Gin Phe 
45 130 135 140 

Ser Pro Ala Phe Ser Arg Thr Ser Pro Trp Arg Ser Pro Lys Asn Phe 
145 150 155 160 

50 Arg Arg Leu Tyr Pro Pro Cys Thr Thr Ser Gly Cys Ala Ala Arg Trp 
165 170 



55 



Xaa Phe Ser Xaa 
180 



60 



(2) INFORMATION FOR SBQ ID NO: 395: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
<D) TOPQUXSY: linear 

(xi) SBOUENCE DESCRIPTION: SEQ ID NO: 395: 

Met Pro Thr Pro cys Thr Ser LBu Pro Ser cys Cys Gin His Arg Ser 
1 5 10 



15 



lie Thr Met Tlir Leu 
20 



(2) DIFORMATIQN FCR SEQ ID NO: 396: 

(i) SBQUHNCE CHARACTERISTICS: 

(A) LBIBTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

^" SEQUENCE DESCRlPTICW: SEQ ID NO: 396: 

Met Pro Leu Phe He Pro Leu He Phe Phe Leu Ser Lai Leu His Cys 
5 10 15 



25 



30 



35 



Gin ser Lys His Pro He Gln.Het Ser Leu Cya Met Cys Val Asn He 
20 25 30 

ser Leu Val Trp Ser Pro Val Arg Trp He Phe Gly Ser Lys Gly Leu 
35 40 45 

Phe Ser Val His Leu Gin Ser Ser Gin Arg Pro Ser 
5» 55 60 



40 



(2) INFORMATION FC»l SEQ ID NO: 397: 

(i) SEQOENCE CHARACTERISTICS: 

■ (A) LENBTH: 152 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 

Met Ala Gly Pro Arg Pro Xaa Trp Arg Asp Gin Leu Leu Phe Met Ser 
^ ^ 1° 15 

He He Val l«i Val He Val Val He cys Leu Met Leu Tyr Ala Leu 
20 25 30 

50 Leu Trp Glu Ala Gly Asn Leu Thr Asp Leu Pro Asn Leu Arg He Gly 
35 40 45 

Phe Tyr Asn Phe cys Leu Trp Asn Glu Asp Thr Ser Thr Leu Gin Cys 

55 



5° 55 60 



His Gin Phe Pro Glu Leu Glu Ala Leu Gly Val Pro Arg Val Gly Leu 
" 75 80 

Gly Leu Ala Arg Leu Gly Val TVr Gly Ser Leu Val Leu Thr Leu Phe 
^ 85 90 95 
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Ala Pro Gin Pro Leu Leu Leu Ala Gin Cys Asn Xaa Asp Glu Arg Ala 
100 105 no 

5 Trp Arg Leu Ala Val Gly Phe Leu Ala Val Ser Ser Val Leu Leu Ala 
115 120 125 

Gly Gly Leu Gly Leu Phe Leu Ser Tyr Val Trp Asn Gly Ser Xaa Ser 

135 140 

Pro Ser Arg Gly Leu Gly Phe Xaa 
145 150 



15 

(2) INPORMATICN FOR SBQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) hENGVH: 480 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPnON: SEQ ID NO: 398: 

Met Ser Asp Gly Phe Asp Arg Ala Pro Gly Ala Gly Arg Gly Arg Xaa 
15 10 15 

Arg Gly Leu Gly Arg Gly Gly Gly Gly Pro Xaa Gly Gly Gly Phe Pro 
20 25 30 

xaa Gly xaa Xaa Pro Ala Glu Arg Xaa Arg His Gin Pro Pro Gin Pro 
35 40 45 

Lys Ala Pro Gly Kie Leu Gin Pro Xaa Pro Leu Arg Gin Pro Arg Thr 
SO 55 60 

Thr Pro Pro Pro Gly Ala Gin Cys Glu Val Pro Ala Ser Pro Gin Arg 
^5 70 75 80 

Ser Arg Pro Gly Ala Leu Pro Glu Glu Thr Arg Pro Leu Arg Ala 
85 90 95 



20 



25 



30 



35 



45 



50 



55 



60 



Pro Pro Ser ser Gin Asp Lys He Pro Gin Gin Asn Ser Glu Ser Ala 
100 105 110 

Met Ala Lys Pro Gin Val Val Val Ala Pro Val Leu Met Ser Lys Leu 
115 120 125 

Ser Val Asn Ala Pro Glu Phe Tyr Pro Ser Gly Tyr Ser Ser Ser Tyr 
130 135 140 

Thr Glu Ser Tyr Glu Asp Gly Cys Glu Asp Tyr Pro Thr Leu Ser Glu 
150 155 160 

Tyr Val Gin Asp Phe Leu Asn His L^ Thr Glu Gin Pro Gly Ser Phe 
165 170 175 

Glu Thr Glu He Glu Gin Phe Ala Glu Tiir Leu Asn Gly Cys Val Thr 
180 185 190 

Thr Asp Asp Ala Leu Gin Glu Leu Val Glu Leu He Tyr Gin Gin Ala 
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195 



200 



205 



Thr Ser He Pro Asn Phe Ser Tyr Met Gly Ala Arg Leu Cys Asn Tyr 
210 215 220 



10 



15 



20 



25 



30 



35 



40 



45 



Leu Ser His His Leu Thr He Ser Pro Gin Ser Gly Asn Phe Arg Gin 
225 230 235 240 

Leu Leu Leu Gin Arg Cys Arg Thr Glu Tyr Glu Val Lys Asp Gin Ala 
245 250 255 

Ala Lys Gly Asp Glu Val Thr Arg Lys Arg Phe His Ala Phe Val Leu 
260 265 270 

Phe Leu Gly Glu Leu Tyr Leu Asn Leu Glu He Lys Gly Thx Asn Gly 
275 280 285 

Gin Val "Wir Arg Ala Asp He Leu Gin Val Gly Leu Arg Glu Leu Leu 
290 295 300 

Asn Ala Leu Phe Ser Asn Pro Met Asp Asp Asn Leu He Cys Ala Val 
310 315 320 

Lys Leu Leu Lys Leu Thr Gly Ser Val Leu Glu Asp Ala Trp Lys Glu 
325 330 335 

Lys Gly Lys Met Asp Met Glu Glu He He Gin Arg He Glu Asn Val 
340 345 

Val Leu Asp Ala Acti Cys Ser Arg Asp Val Lys Gin Met Leu Leu Lys 
355 360 365 

Leu Val Glu Leu Arg Ser Ser Asn Trp Gly Arg Val His Ala Thr Ser 
370 375 380 

Itor Tyr Arg Glu Ala Thr Pro Glu Asn Asp Pro Asn Tyr Phe Met Asn 
385 390 395 400 

Glu Pro Thr Phe Tyr Thr Ser Asp Gly Val Pro Phe Thr Ala Ala Asp 
405 410 415 

Pro Asp Tyr Gin Glu Lys Tyr Gin Glu Leu Leu Glu Arg Glu Asp Kie 
420 425 430 

Phe Pro Asp Tyr Glu Glu Asn Gly Thr Asp Leu Ser Gly Ala Gly Asp 
435 440 445 



50 



Pro Tyr Leu Asp Asp He Asp Asp Glu Met Asp Pro Glu He Glu Glu 
450 455 460 

Ala Tyr Glu Lys Phe Cys Leu Glu Ser Glu Arg Lys Arg Lys Gin Xaa 
465 470 475 480 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 399: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LEXKfTHz 423 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQOQJCE DESCRIPTION: SEQ ID KO: 399: 

Jfet Glu Pro Lys Thr He Thr Asp Ala Leu Ala Ser Ser He He Lys 
15 10 15 



10 Ser Val 



Leu Pro Asn Phe Leu Pro Tyr Asn Val Met Leu Tyr Ser Asp 
20 25 30 



15 



30 



45 



60 



Ala Pro Val Ser Glu Leu Ser Leu Glu Leu Leu Leu Leu Gin Val Val 
35 40 45 

Leu Pro Ala Leu Leu Glu Gin Gly His Thr Arg Gin Trp Leu Lys Gly 
50 55 60 



Leu Val Arg Ala Trp Thr Val Thr Ala Gly Tyr Leu Leu Asp Leu His 
-^0 65 70 75 ' 



80 



Ser Tyr Leu Leu Gly Asp Gin Glu Glu Asn Glu Asn Ser Ala Asn Gin 
85 90 95 

25 Gin Val Asn Asn Asn Gin His Ala Arg Asn Asn Asn Ala He Pro Val 
100 105 110 



Val Gly Glu Gly Leu His Ala Ala His Gin Ala He Leu Gin Gin Gly 
115 120 125 

Gly Pro Val Gly Phe Gin Xaa Tyr Arg Arg Pro Leu Asn Phe Pro Leu 
130 135 140 



Arg He Phe Leu Leu He Val Phe Met Cys He Tlir Leu Leu He Ala 
150 155 160 

Ser Leu He Cys Leu Thr Leu Pro Val Kie Ala Gly Arg Trp Leu Met 
165 170 175 

40 Ser Phe Trp Thr Gly Thr Ala Lys He His Glu Leu Tyr Thr Ala Ala 
180 185 190 



Cys Gly Leu Tyr Val Cys Trp Leu Thr He Arg Ala Val Thr Val Met 
195 200 205 

Val Ala Trp Met Pro Gin Gly Arg Arg Val He Kie Gin Lys Val Lys 
210 215 220 



Glu Trp Ser Leu Met He Met Lys Ttir Leu He Val Ala Val Leu Leu 

240 



50 225 230 235 



Ala Gly Val Val Pro Leu Leu Leu Gly Leu Leu Phe Glu Leu Val He 
245 250 255 

55 Val Ala Pro Leu Arg Val Pro Leu Asp Gin llu: Pro Leu Phe Tyr Pro 
260 265 270 



Trp Gin Asp Trp Ala Leu Gly Val Leu His Ala Lys He He Ala Ala 
275 280 285 
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He Tto Leu Met Gly Pro Gin Trp Trp Leu Lys Thr Val He Glu Gin 
290 295 300 

Val IVr Ala Asn Gly He Arg Asn He Asp Leu His Tyr He Val Arg 
5 305 310 315 320 

Lys Leu Ala Ala Pro Val He Ser Val Leu Leu Leu Ser Leu Cys Val 
325 330 335 

10 Pro Tyr Val He Ala Ser Gly Val Val Pro Leu Leu Gly Val Thr Ala 
340 345 

Glu Met Gin Asn Leu Val His Arg Arg He TVr Pro Phe Leu Leu Met 
355 360 365 

Val Val Val Leu Met Ala He Leu Ser Phe Gin Val Arg Gin Phe Lys 
370 375 3B0 

Arg Leu Tyr Glu His He Lys Asn Asp Lys Tyrr Leu Val Gly Gin Arg 
20 385 390 395 400 

Leu Val Asn Tyr Glu Arg Lys Ser Gly Lys Gin Gly Ser Ser Pro Pro 
405 410 415 

25 Pro Pro Gin Ser Ser Gin Glu 
420 



30 (2) INFORMATION FOR SEQ ID NO: 400: 

(i) SBQUSfCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGy: linear 

<xi) SBQOEMCE DESCRIPTION: SEQ ID NO: 400: 

Met Leu Arg Leu Asp He He Asn Ser Leu Val Thr Thr Val Phe Met 

40 ' ' 

Leu He Val Ser Val Leu Ala Leu He Pro Glu Thr Thr Thr Leu Thr 
20 25 30 

Val Gly Gly Gly Val Phe Ala Leu Val Thr Ala Val Cys Cys Leu Ala 
45 35 40 45 

Asp Gly Ala Leu He Tyr Arg Lys Leu Leu rtie Asn Pro Ser Gly Pro 
50 55 60 

50 Tyr Gin Lys Lys Pro Val His Glu Lys Lys Glu Val Leu Xaa 
65 70 75 



(2) INFORMATION FOR SBQ ID NO: 401: 

(i) SB^J^JCE CHARACTERISTICS: 

(A) LENGIH: 74 amiiK> acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DE9CRIPTICN: SEQ ID NO: 401: 
Met Leu Lys Gin Val Met Phe Val Phe Ser Gly Met Gly Pro Arg Ser 

His Cys Trp Gly Leu Pro Leu His Val Ala Pro Leu Cys Arg Gly His 
20 25 30 

Gin Ala Asp Ser Ser His Leu Leu Pro Leu Lys His Gin Gly Ala Trp 
10 35 40 45 

Asn Arg Asn Leu Ala Asn Gin Arg His Phe Cys Pro Ser lie Phe 
50 55 60 

15 His Thr Cys Pro Thr Val Leu Phe Mie Xaa 
65 70 



20 (2) INFORMATION FOR SEQ ID NO: 402: 



25 



30 



35 



40 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 20 amino acids 

(B) TypE: amino acid 
(D) TOPQLOGy: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 402: 

Ala Arg Thr lie Leu Val Leu Tyr Leu Ser Leu Gin Arg Leu Glu Asn 
15 10 15 

Leu Ala Tyr His 
20 



(2) INFORMATION PCH SEQ ID NO: - 403: 

(i) SBQUQJCE CHARACTERISTICS: 

{A) LSIGTH: 87 amino acids 
(B) TyPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SBQUHJCE nESCRIPTIGN: SBQ ID NO: 403: 

Met Pro Leu Pro Ser Val Pro He Leu Gly He Phe Ser Phe Leu He 
45 1 5 10 15 

Pro Ser Ser Gin Gly Val Ser Tyr Thr Lys Leu Pro He Ser Ser Pro 
20 25 30 

50 Gin Tyr Ser Pro Phe Val Asn Asp His Phe Ser Phe Leu Asn Pro Phe 
35 40 45 



Pro Val Gin He His Thx Gly Phe Ala Arg Val Gly Ser Tyr Met Gin 
50 55 60 

Met Pro Leu Val His Leu Cys Leu Leu Gin Thr Ser Leu Met Lys Asn 
65 70 75 80 

Ser Gly Val Gin Gin Gly Ser 
85 
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(2) INFORMATION FOR SEQ ID NO: 404: 

5 

(i) SBQUHJCE CHARACTERISTICS: 

(A) LEXK^TH: 92 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 

Met Asn Ala Ala Met Val His lie Asn Arg Ala Leu Lys Leu lie He 
15 10 15 

15 Arg Leu Phe Leu Val Glu Asp Leu Val Asp Ser Leu Lys Leu Ala Val 
20 25 30 



30 



Phe Met Trp Leu Met Thr Tyr Val Gly Ala Val Kie Asn Gly He Thr 
35 40 45 

Leu Leu He Leu Ala Glu Leu Leu He E^e Ser Val Pro He Val Tyr 
50 55 60 

Glu Lys Tyr Lys Thr Gin He Asp His Tyr Val Gly He Ala Arg Asp 

80 



20 



25 65 70 75 



Gin Thr Lys Ser He Val Glu Lys He Pro Ser Lys 
85 90 



(2) INFORMATION FOR SEQ ID NO: 405: 



<i> SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY; linear 
(xi) SBC^JENCE DESCRIPTION: SEQ ID NO: 405: 

40 Met Ala Cys Ser Cys Leu Met He Gin Ser Ph& Ser Thr Ser Ala Leu 
15 10 15 



45 



Val Leu Phe Tyr Gly 
20 



<2) INFORMATION FOR SEQ ID NO: 406: 

50 (i). SEQUEtKrE CHARACTERISTICS: 

(A) LQIGflH: 174 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

Met Glu Glu Gly Gly Asn Leu Gly Gly Leu He Lys Met Val His Leu 
15 10 15 

Leu Val Leu Ser Gly Ala Trp Gly Met Gin Met Trp Val Thr Phe Val 
60 20 25 30 
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Ser Gly Phe Pro Ala Phe Pro Lys Pro Ser Pro Thr Tyr Leu Arg Thr 
35 40 45 

5 Ser Ala Glu Gin Olir Leu Pro Leu Leu Leu Pro His Leu His Gly Leu 
50 55 60 

Cys Leu His Gin Pro Leu His Leu Gly Phe Thr Ala Cys Leu Gly Ser 
IQ 70 75 80 

Ala His He Leu Gly Gly Gin Pro Ala Leu Pro Ala Val Pro Glu Pro 
85 90 95 

Tyr Ala Gly His Cys Gin Arg Pro Leu Ala Gly Thr Pro His His Ser 
15 100 105 110 

Cys His Val Gly Pro Ala Asn Arg Gly Arg Arg Ser Glu Ala Trp Val 
115 120 125 

20 Gly Arg Tyr Gin Ala Ala Asn Arg Phe Pro He Leu Asn Ala Xaa Cys 
130 135 

Glu Arg Arg Thr Pro Ser Thr val Leu Ser Ala Arg He Ser Ser Ala 
25 150 155 160 

Thr Met Gly Cys Pro Leu Phe Ala He Trp Ala Ala Ser Xaa 
165 170 

30 

(2) INFDRMATICN EtIR SEQ ID NO: 407: 

(i) SBQUQJCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 
35 (B) TYPE: amino eurid 

(D) TOPOLOGy: linear 
(xi) SBQUQKE DE9CRIPTICN: SEQ ID NO: 407: 

Met Ala Phe He Leu Leu ttie Tyr Cys Leu Met Thr Phe Leu Ser Leu 
40 1 5 10 15 

Glu Gin Asn Ser Ala Thr Val Glu Pro Ser Ser His Glu He Leu His 
20 25 30 

45 Leu Leu Gin Asn Cys Phe Glu Leu Leu Arg Thr Ser Thr Ser Gin Cys 
35 40 45 

Thr Glu Gly He Pro Cys Glh Arg Tyr Gin Asn Gly Leu His He Xaa 

50 



55 

(2) INFORMATIC2N FOR SEQ ID NO: 408: 



60 



(i) SB^^JQitCE CHARACTERISTICS: 

<A) LENGTH : 280 amino acids 
(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 408: 

Met Glu Ala Val Val Asn Leu Tyr Gin Glu Val Met Lys His Ala Asp 
5 1 5 10 15 

Pro Arg He Gin Gly Tyr Pro Leu Met Gly Ser Pro L^ Leu Met Thr 
20 25 30 

10 Ser He Leu Leu Thr Tyr Val Tyr Phe Val Leu Ser Leu Gly Pro Arg 
35 40 45 



15 



30 



45 



He Met Ala Asn Arg Lys Pro Phe Gin Leu Arg Gly Kie Met He Val 
50 55 60 

Tyr Asn Phe Ser Leu Val Ala Leu Ser Leu Tyr He Val Tyr Glu PtxB 
65 70 75 80 



Leu Met Ser Gly Trp Leu Ser Thr Tyr Thr Trp Arg Cys Asp Pro Val 
20 85 90 95 

Asp Tyr Ser Asn Ser Pro Glu Ala Leu Arg Met Val Arg Val Ala Trp 
100 105 110 

25 Leu Phe Leu Phe Ser Lys Phe He Glu Leu Met Asp Thr Val He Phe 
115 120 125 



He Leu Arg Lys Lys Asp Gly Gin Val Thr Phe Leu His Val Phe His 
130 135 140 

His Ser Val Leu Pro Trp Ser Trp Trp Trp Gly Val Lys He Ala Pro 
145 150 155 160 



Gly Gly Met Gly Ser Phe His Ala Met He Asn Ser Ser Val His Val 
35 165 170 175 

He Met Tyr Leu Tyr Tyr Gly Leu Ser Ala Phe Gly Pro Val Ala Gin 
180 185 190 

40 Pro Tyr Leu Trp Trp Lys Lys His Met Thr Ala He Gin Leu He Gin 
195 200 205 



Phe Val Leu Val Ser Leu His He Ser Gin Tyr Tyr PtiB Met Ser Ser 
210 215 220 

Cys Asn Tyr Gin Tyr Pro Val He He His Leu He Trp Met Tyr Gly 
225 230 235 240 



Thr He Phe Phe Met Leu Phe Ser Asn E^e Trp Tyr His Ser Tyr Thr 
50 245 250 255 

Lys Gly Lys Arg Leu Pro Arg Ala Leu Gin Gin Asn Gly Ala Pro Gly 
260 265 270 

55 He Ala Lys Val Lys Ala Asn Xaa 
275 280 



60 



(2) INFOHMATIGN FOR SEQ ID UO: 409: 
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(i) SBQUQJCE CHARACTERISTICS: 

(A) LHKJTH: 284 amino acids 

(B) TYPE: amino acid 
5 <D) TOPOLOOT: linear 

(xi) SEQUENCE EESCRIPTION: SBQ ID NO: 409: 

Met Xaa Leu Trp Pro Gin Thr Cys Ser Gly Lys Phe Asp Gly Thr Leu 
15 10 15 

10 

Ala Phe Ser He His Xaa Leu Ala Val He Leu Gly Asp Gin Leu Thr 
20 25 30 

Ala Ala Asp Leu Val Pro He Phe Asn Gly Phe Lea Lys Asp Leu Asp 
15 35 40 45 

Glu Val Arg He Gly Val Leu Lys His Leu His Asp Phe Leu Lys Leu 
50 55 60 

20 Leu His He Asp Lys Arg Arg Glu Tyr Leu Tyr Gin Leu Gin Glu Phe 
65 70 75 80 

Leu Val Thr Asp Asn Ser Arg Asn Trp Arg Arg Ala Glu Leu Ala 
85 90 95 

25 

Glu Gin I*eu He Leu Leu Leu Glu Leu Tyr Ser Pro Arg Asp Val Tyr 
100 105 110 

Asp Tyr Leu Arg Pro He Ala Leu Asn Leu Cys Ala Asp Lys Val Ser 
30 115 120 125 

Ser Val Arg Trp He Ser Tyr Lys Leu Val Ser Glu Met Val Lys Lys 
130 135 140 

35 Leu His Ala Ala Thr Pro Pro Thr E*e Gly Val Asp Leu He Asn Glu 
145 150 155 160 

Leu Val Glu Asn Phe Gly Arg Cys Pro Lys Trp Ser Gly Arg Gin Ala 
165 170 175 

40 

Phe Val Phe Val Cys Gin Thr Val He Glu Asp Asp Cys Leu Pro Met 
180 185 190 

Asp Gin Phe Ala Val His Leu Met Pro His Leu Leu Thr Leu Ala Asn 
45 195 200 205 

Asp Arg Val Pro Asn Val Arg Val Leu Leu Ala Lys Thr Leu Arg Gin 
210 215 220 

50 Thr Leu Leu Glu Lys Asp Tyr Phe Leu Ala Ser Ala Ser Cys His Gin 
225 230 235 240 

Glu Ala Val Glu Gin Thr He Met Ala Leu Gin Met Asp Arg Asp Ser 
245 250 255 

55 

Asp Val Lys Tyr Phe Ala Ser He His Pro Ala Ser Thr Lys He Ser 
260 265 270 



60 



Glu Asp Ala Met Ser Ito Ala Ser Ser T!hjc Tyr Xaa 
275 280 
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^ (2) INFOroCVTION FOR SBQ ID NO: 410: 

(i) SBQOENCE C2ffiRACTERISriCS : 

(A) 187 azaino acids 

(B) OYPE: amino acid 
(D) lOPOIiOGY: linear 

10 (xi) SEQUmCE DESCRIPTICN: SBQ ID NO: 410: 

Met Leu Phe Leu Phe Phe Val He He Phe Leu Phe Val Phe Leu He 
15 10 15 

15 Leu He He Gin Phe Ser Lys Pro Leu Thr Asn Pro His Pro Pro Ala 
20 25 30 

Gly Xaa Ser Asp Arg Arg Arg Arg Tyr Ser Ser Tyr Arg Ser His Asp 
35 40 45 

20 

His Tyr Gin Arg Gin Arg Val Leu Gin Lys Glu Arg Ala He Glu Glu 
50 55 60 

Arg Arg Val Val Phe He Gly Lys He Pro Gly Arg Met Thr Arg Ser 
25 65 70 75 80 

Glu Leu Lys Gin Arg E^e Ser Val Phe Gly Glu He Glu Glu Cys Thr 
85 90 95 

30 He His Phe Arg Val Gin Gly Asp Asn Tyr Gly Phe Val Thr Tyr Arg 
100 105 110 

Tyr Ala Glu Glu Ala E^e Ala Ala He Glu Ser Gly His Lys Leu Arg 
115 120 125 

35 

Gin Ala Asp Glu Gin Pro Phe Asp Leu Cys Phe Gly Gly Arg Arg Xaa 
130 135 140 

Xaa Cys Lys Arg Ser Tyr Ser Asp Leu Asp Ser Asn Arg Glu Asp Phe 
40 145 150 155 160 

Asp Pro Ala Pro Val Lys Ser Lys Phe Asp Ser Leu Asp Phe Asp Thr 
165 170 175 

45 Leu Leu Lys Gin Ala Gin Lys Asn Leu Arg Arg 
180 185 



50 (2) INFTORMATION FOR SBQ ID NO: 411: 

{i> SEQUENCE CHARACTERISTTICS : 

(A) LQK3TH: 237 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SBQUHTCE DESCRIPTION: SBQ ID NO: 411: 



60 



Met Lys Leu Pro Gly Lys Phe Arg Arg Ala His Gin Gly Asn Leu Glu 
15 10 15 
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Ser Gin Leu Thr Ser Glu Ser Tyr Tyr Lys Glu liu: Leu Ser Val Pro 
20 25 30 

Thr Val Glu His lie He Gin Glu Leu Lys Asp He Phe Ser Glu Gin 
^ 35 40 45 

His Leu Lys Ala Leu Lys Cys Leu Ser Leu Val Pro Ser Val Met Gly 
50 55 60 

10 Gin Leu Lys Phe Asn Thr Ser Glu Glu His His Ala Asp Met Tyr Arg 
^5 70 75 80 

Ser Asp Leu Pro Asn Pro Asp Thr Leu Ser Ala Glu Leu His Cys Trp 

15 

Arg He Lys Trp Lys His Arg Gly Lys Asp He Glu Leu Pro Ser Thr 
100 105 110 

He Tyr Glu Ala Leu His Leu Pro Asp He Lys E*e Phe Pro Asn Val 
20 115 120 125 

Tyr Ala Leu Leu Lys Val Leu Cys He Leu Pro Val Met Lys Val Glu 

135 140 

25 Asn Glu Arg Tyr Glu Asn Gly Arg Lys Arg Leu Lys Ala Tyr Leu Arg 

150 155 160 



30 



Asn Thr Leu Thr Asp Gin Arg Ser Ser Asn Leu Ala Leu Leu Asn He 
165 170 175 

Asn Phe Asp He Lys His Asp Leu Asp Leu Met Val Asp Thr Tyr He 
180 185 ' 190 

^ Tyr Thr Xaa Xaa Ser Xaa Leu Xaa Thr Xaa Xaa Ser Xaa Xaa 
^•^ 195 200 205 

Val Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa 
210 215 220 

40 Asp Xaa Xaa Xaa Arg Glu Lys Ala Val Arg Cys Met Xaa 
225 230 235 



45 (2) INFORMATICN POR SBQ ID NO: 412: 

(i) SBQDEKCE CHARACTERISTICS: 

(A) LQIGrrH: 192 amino acids 

(B) TYPE: andno acid 
^0 (D) TOPOLOGY: linear 

(xi) SE3CKJE^3CE DESCRIPTION: SBQ ID NO: 412: 

Met Lys Pro Met Ala Val Val Ala Ser Thr Val Leu Gly Leu Val Gin 

55 ' 

Asn Met . Arg Ala Phe Gly Gly He Leu Val Val Val Tyr Tyr Val Phe 
20 25 30 



Ala He He Gly He Asn Leu Phe Arg Gly Val He Val Ala Leu Pro 

45 



60 35 40 
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Gly Asn Ser Ser Leu Ala Pro Ala Asn Gly Ser Ala Pro Cys Gly Ser 
50 55 60 

5 Phe Glu Gin Leu Glu ovr Trp Ala Asn Asn Phe Asp Asp Phe Ala Ala 

75 80 

Ala Leu Val Thr Leu Trp Asn Leu Met Val Val Asn Asn Trp Gin Val 
10 « 

Phe Leu Asp Ala Tyr Arg Arg Tyr Ser Gly Pro Trp ser Lys He TVr 

105 110 

Phe Val Leu Trp Trp Leu Val Ser Ser Val He Trp Val Asn Leu Phe 
115 120 125 

Leu Ala Leu He Leu Glu Asn Phe Leu His Lys Trp Asp Pro Arg Ser 
"0 135 140 

His Leu Gin Pro Leu Ala Gly Thr Pro Glu Ala Thr Tyr Gin Met Thr 
150 155 

val Glu Leu Leu Phe Arg Asp He Leu Glu Glu Pro Gly Glu Asp Glu 
165 170 175 

Leu Thr Glu Arg Leu Ser Gin His Pro His Leu Trp Leu Cys Arg Xaa 
180 1B5 190 



15 



20 



25 



30 



(2) INPORMATICN FOR SEQ ID NO: 413: 

(i) SEQOWCB CHARACTERISTICS: 

(A) LBKTFH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi> SBCOENCB DESCRIPTION: SEQ ID NO: 413: 

Asn Val Val Val Val Ala Phe Gly Leu He Leu He He Glu Ser Leu 
15 10 15 

45 Gly Glu Gin Cys Pro 
20 



35 



40 



50 



60 



(2) INFORMATiai FOR SEQ ID NO: 414: 



(i> SEQUENCE CHARACTERISTICS: 

<A) USfSTH: 51 amino acids 
(B) TYPE: amino acid 
^5 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTICW: SBQ ID NO: 414: 



Met Asn Trp Gly Leu Ser He Trp Leu His Tyr Tyr Glu Lys Lys Lys 
15 10 15 
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Glu Gin Val Phe Leu Val He Leu Ala His Val Val Arg Arg Cys Ala 
20 25 30 

Ser Asp Gly He Leu Gin Phe Glu Ser Ser Leu Leu Lys Met Arg Arg 
5 35 40 45 

Ala Pro Xaa 
50 

10 

(2) INFORMATIC3N FOR SBQ ID NO: 415: 

(i) SBQUm^E CHARACTERISTICS: 
15 (A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 415: 

20 Met Leu He He Ser Leu Arg Pro Gin Phe Pro Ser Leu He Val Gin 
1 5 10 15 

Leu Glu Cys Ser Val Leu Phe Leu Pro He Ser Leu Asn Leu Leu Leu 
20 25 30 



30 

(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUiiNCH CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 416: 

Met Val Lys Val Cys Asn Asp Ser Asp Arg Trp Ser Leu He Ser Leu 
40 1 5 10 15 

Ser Asn Asn Ser Gly Lys Asn Val Glu Leu Lys Phe Val Asp Ser Leu 
20 25 30 

45 Arg Arg Gin Phe Glu Phe Ser Val Asp Ser Phe Gin He Lys Leu Asp 
35 40 45 

Ser Leu Leu Leu Phe Tyr Glu Cys Ser Glu Asn Pro Met Thr Glu Ifttr 
50 55 60 

50 

Phe His Pro Thr He He Gly Glu Ser Val Tyr Gly Asp Phe Gin Glu 
65 70 75 80 

Ala Phe Asp His Leu Cys Asn Lys He He Ala Thr Arg Asn Pro Glu 
55 85 90 95 

Glu He Arg Gly Gly Gly Leu Leu Lys Tyr Cys Asn Leu Leu Val Arg 
100 105 110 



60 



Gly Phe Arg Pro Ala Ser Asp Glu 



He Lys Thr Leu Gin Arg Tyr Met 
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115 120 



125 



5 



10 



Cys Ser Arg Phe Phe He Asp Phe Ser Asp He Gly Glu Gin Gin Arg 
130 135 140 ^ 



Lys Glu Ser Tyr Leu Gin Asn His Phe Val Gly He Gly Arg Pro 
Gin Val Xaa 



145 150 ,__ 

155 160 



(2) niFORMATICN FOR SBQ ID NO: 417: 

(i) SBQUENCE CHARACTERISTICS: 

(A) LQIGTH: 174 amino acids 

(B) TyPE: amino acid 
(D) TOPQUXW: linear 

(xi) SBQUENCE DESCRIPTICaj: SBQ ID NO: 417: 

Met Ala Pro Lys Gly Lys Val Gly Thr Arg Gly Lys Lys Gin He Phe 
1 5 10 15 

25 Glu Glu Asn Arg Glu Thr Leu Lys Kie Tyr Leu Arg He He Leu Gly 
20 25 30 



15 



20 



Ala Asn Ala He Tyr Cys Leu Val Thr Leu Val Phe Phe Tyr Ser Ser 

30 



35 40 45 



Ala Ser Mie Trp Ala Trp Leu Ala Leu Gly Phe Ser Leu Ala Val Tyr 
50 55 60 

Gly Ala Ser Tyr His Ser Met Ser Ser Met Ala Arg Ala Ala Phe Ser 
^5 65 70 ,5 

Glu Asp Gly Ala Leu Met Asp Gly Gly Met Asp Leu Asn Met Glu Gin 
85 90 95 

40 Gly Met Ala Glu His Leu Lys Asp Val He Leu Leu Thr Ala He Val 
100 105 110 



Gin Val Leu Ser Cys Phe Ser Leu Tyr Val Trp Ser Phe Trp Leu Leu 

45 



115 120 125 



Ala Pro Gly Arg Ala Leu Tyr Leu Leu Trp Val Asn Val Leu Gly Pro 
130 135 140 

Trp Phe Thr Ala Asp Ser Gly Thr Pro Ala Pro Glu His Asn Glu Lys 

145 icn ICC 

" 155 160 

Arg Gin Arg Arg Gin Glu Arg Arg Gin Met Lys Arg Leu Xaa 
165 170 

(2) INFORMATION FOR SBQ ID NO: 418: 



60 



(i) SEQUQICE CHARACTEEIISTICS : 

(A) LQJGTH: 50 amino acids 
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(B) TYPE: amino acid 
(D> TOPOLOGY: linear 
(xi) SBOUJSNCK DESC31IPTI0N: SEQ ID NO: 418: 

5 Met Glu Leu Pro Lys Gly Leu Gin Gly Val Gly Pro Val Ala Met Met 
15 10 15 

Arg Pro Phe Tyr Leu Leu Leu Pro Val Leu Cys Thr Gin Ala Leu Arg 

10 

Gin Ser Gin Gly Lys Ser Pro Leu Leu Trp Lys Arg Thr Cys Cys Leu 
35 40 45 

Ala Xaa 

15 50 - 



(2) INFORMATICN FOR SEQ ID NO: 419- 

20 

(i) SBQUOJCE CHARACTERISTICS: 

(A) LQ^GTH: 120 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTiaN: SBQ ID NO: 419: 

Met Leu Gly Lys Gly Gly Gly Arg Ala Gly Leu Leu Arg Tyr Arg Leu 

10 15 

30 Leu Tyr Phe Thr Leu Val Val Gly Glu Gly Glu Pro Gly Glu Asn Lys 
20 25 30 

Val Thr He Pro Phe Phe Glu Thr Gly Lys Lys lie He Phe Cys Ser 
35 40 45 

35 

Val Lys Jfet Val Glu Asn Ser Asn Val Pro Ser His Lys Gly Pro Val 
50 55 60 

Pro Leu Arg Ser Glu Gin Trp Glu Leu Lys He Ser Glu Thr Leu Gly 
40 65 70 75 80 

Glu Gly Lys He Gly Phe Leu Leu He Gly Arg Cys Ser Ser Gly Xaa 
85 90 95 

45 Gly Gly Leu Cys Phe Cys Trp Asp Val Leu Cys Cys Met Tyr Ala Tyr 
100 105 110 

Met Asp Arg Ser Leu Leu Ser Leu 
115 120 

50 



(2) INFORMATION FOR SEQ ID NO: 420: 

55 (i) SEQUEEICE CHARACTERISTICS: 

(A) LQIGTH: 159 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIC»J: SBQ ID lK>z 420: 
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Met Thr His Leu Leu Leu Thr Ala Thr Val Thr Pro Ser Glu Gin Asn 
^5 10 15 

Ser Ser Arg Glu E»ro Gly Trp Glu Tte Ala Met Ala Lys Asp He Leu 
^ 20 25 30 

Gly Glu Ala Gly Leu His Phe Asp Glu Leu Asn Lys Leu Arg Val Leu 
35 40 45 

10 Asp Pro Glu Val Thr Gin Gin Thr He Glu Leu Lys Glu Glu Cys Lys 
50 55 60 



Asp Phe Val Asp Lys He Gly Gin Phe Gin Lys He Val Gly Gly Leu 
65 70 75 80 

He Glu Leu Val Asp Gin Leu Ala Lys Glu Ala Glu Asn Glu Lys Met 
85 90 95 

Lys Ala He Gly Ala Arg Asn Leu Leu Lys Ser He Ala Lys Gin Arg 
100 105 110 

Glu Ala Gin Gin Gin Gin Leu Gin Ala Leu He Ala Glu Lys Lys Met 
115 120 125 

25 Gin Leu Glu Arg Tyr Arg Val Glu Tyr Glu Ala Leu Cys Lys Val Glu 
130 135 



15 



20 



30 



35 



40 



45 



50 



55 



60 



Ala Glu Gin Asn Glu Phe He Asp Gin Phe He Phe Gin Lys Xaa 
145 150 155 



(2) INFORMATION FOR SBQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LQK3TH: 154 amino acids 

<B) TypE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 421: 

Met Asn Val Gly Val Ala His Ser Glu Val Asn Pro Asn Thr Arg Val 
15 10 15 

Met Asn Ser Arg Gly Met Trp Leu Otur Tyr Ala Leu Gly Val Gly Leu 
20 25 30 

Leu His He Val Leu Leu Ser He Pro Phe Kie Ser Val Pro Val Ala 
35 40 45 

Trp Thr Leu Thr Asn He He His Asn Leu Gly Met Tyr Val Phe Leu 
50 55 60 

His Ala Val Lys Gly Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ala 
70 75 80 

Arg Leu Leu Thr His Trp Glu Gin Leu Asp Tyr Gly Val Gin Phe Thr 
B5 90 95 

Ser Ser Arg Lys Phe Phe Thr He Ser Pro He He Leu Tyr Phe Leu 
100 105 ,110 
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Ala Ser Phe Tyr Tlir Lys Tyr Asp Pro Thr His Phe lie Leu Asn liir 
115 120 125 

5 Ala Ser Leu Leu Ser Val Leu He Pro Lys Met Pro Gin Leu His Gly 
130 135 140 

Val Arg He Phe Gly He Asn Lys Tyr Xaa 
145 150 

10 



(2) INFORMATION FOR SBQ ID NO: 422: 

15 (i) SEQUENCE CHARACTTERISTICS: 

(A) LBJCTTH: 204 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 422: 

20 

Met Val Cys Gly Gly Phe Ala Cys Ser Lys Asn Cys Leu Cys Ala Leu 
i 5 10 15 



25 



Asn Leu Leu Tyr Thr Leu Val Ser Leu Leu Leu He Gly He Ala Ala 
20 25 30 

Trp Gly He Gly Phe Gly Leu He Ser Ser Leu Arg Val Val Gly Val 
35 40 45 

30 Val He Ala Val Gly He Phe Leu Phe Leu He Ala Leu Val Gly Leu 
50 55 60 



He Gly Ala Val Lys His His Gin Val Leu Leu Phe E*ie Tyr Met He 
65 70 75 80 

He Leu Leu Leu Val Phe He Val Gin Phe Ser Val Ser Cys Ala Cys 
85 90 95 

Leu Ala Leu Asn Gin Glu Gin Gin Gly Gin Leu Leu Glu Val Gly Trp 
100 105 110 

Asn Asn Thr Ala Ser Ala Arg Asn Asp He Gin Arg Asn Leu Asn Cys 
115 120 125 

45 Cys Gly Phe Arg Ser Val Asn Pro Asn Asp Thr Cys Leu Ala Ser Cys 
130 135 140 



35 



40 



50 



Val Lys Ser Asp His Ser Cys Ser Pro Cys Ala Pro He He Gly Glu 

145 150 155 160 

Tyr Ala Gly Glu Val Leu Arg Phe Val Gly Gly He Gly Leu Phe Phe 

165 170 175 



Ser Phe Thr Glu He Leu Gly Val Trp Leu Thr Tyr Arg Tyr Arg Asn 
55 180 185 190 

Gin Lys Asp Pro Arg Ala Asn Pro Ser Ala Phe Leu 
195 200 



60 
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(2) INFX}RKATIGN PQR SEQ ID NO: 423: 

(i) SEQUaKZE CHARACTERISTICS: 

(A) LENGTW: 67 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBCJUENCE DESCFaPnON: SBQ ID NO: 423: 

Met Leu Gin Ser He He Lys Asn He Trp He Pro Met Lys Pro Tyr 
5 10 15 

TVr Thr Lys Val Tyr Gin Glu He Trp He Gly Met Gly Leu Met Gly 
15 2^ 30 

Phe He Val TVr Lys He Arg Ala Ala Asp Lys Arg Ser Lys Ala Leu 
^5 40 



10 



20 



25 



30 



35 



40 



50 



55 



60 



45 



Lys Ala ser Ala Pro Ala Pro Gly His His Asn Gin He Tyr Leu Glu 

55 60 



lyr Met Xaa 
65 



(2) INFORMATION FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

Met Gly Val Ser Leu Phe Leu Val Val Leu lyr His Tyr Val 
^ 10 15 

Ala Val, Asn Asn Pro Lys Lys Gin Glu 



20 



25 



(2) INFOHMATICN FOR SEQ ID NO: 425: 

^5 <i) SEQUENCE CHARACTERISTICS: 

<A) LHIGTH: 299 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
<xi) SB?yjENCE DESCRIPTIC»I: SEQ ID NO: 425* 



Met Ala Ala Xaa Glu Pro Ala Val Leu Ala Leu Pro Asn Ser Gly Ala 
5 10 ^5 

Gly Gly Ala Gly Ala Pro Ser Gly Thr Val Pro Val Leu Phe Cys Phe 
2^ 25 30 

ser val Phe Ala Arg Pro Ser Ser Val Pro His Gly Ala Gly Tyr Glu 
35 40 45 

He Gin Lys Phe Ser Leu Tyr Gly Asp Gin He Asp Met 
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" 55 60 

His Arg Lys Phe Val Val Gin Phe Ala Glu Glu Trp Gly Gin 1^ 

Val ASP Leu pro Lys Gly Phe Ala Val Ser Glu Arg Cys Lys Val Arg 
85 90 

Leu val Pro Gin He Gin Leu «^ ohr Leu Gly Asn Leu Thr Pro 
100 105 

Ser Ser Thr val Phe Phe cy. Cys ASP Met Gin Glu Arg Phe Arg Pro 
"5 120 125 

15 Ala lie Lys Tyr Phe Gly Asp He He Ser Val Gly Gin Arg Leu Leu 

140 

Gin Gly Ala Arg lie Gly He Pro Val He Val «„: Glu Gin Tyr 
20 "° il^ 

Pro Lys Gly Leu Gly Ser Thr Val Gin Glu He Asp Leu Itu: Gly Val 
"5 170 175 

Lys Leu val Leu Pro Lys Thr Lys Phe Ser Met Val Leu Pro Glu Val 

190 



180 

Glu Ala Ala Leu Ala Glu He Pro Gly Val Arg Ser Val Val Leu Phe 
195 200 205 

30 Gly Val Glu'Thr His Val Cys He Gin Gin Thr Ala Leu Glu Leu Val 



210 215 



220 



Gly Arg Gly Val Glu Val His He Val Ala Asp Ala Thr Ser Ser Arg 
35 "° 235 ZTo 

ser Met Met Asp Arg Met Phe Ala Leu Glu Arg Leu Ala Xaa ■ 



245 250 



Xaa. Gly 
255 



He He Val Thr Thr Ser Glu Ala Val Leu Leu Gin Leu Val Ala Asp 
260 265 270 

Lys Asp His Pro Lys Phe Lys Glu He Gin Asn Leu He 



275 280 



Lys Ala Ser 
2B5 



45 Ala Pro Glu Ser Gly Leu Leu Ser Lys Val Xaa 



290 295 



50 (2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQX3TH: 13 amino acids 

(B) TXPE: amino acid 
(O) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 

Met Arg Asp Leu Gly Thr Leu Leu Ser Pro Val Cys Ser 
60 ' ' 1° 
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10 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SBQ ID NO: 427: 

(i) SEWQJCE CHARACTERISTICS: 

(A) I£NQTH: 198 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 427: 

Met Phe Gly Cys Leu Val Ala Gly Arg Leu Val Gin Thr Ala Ala Gin 
15 10 15 



Gin Val Ala Glu Asp Lys Phe Val Phe Asp Leu Pro Asp Tyr Glu Ser 

30 



15 20 25 



He Asn His Val Val Val Phe Met Leu Gly Thr He Pro Phe Pro Glu 
35 40 45 

20 Gly Met Gly Gly Ser Val Tyr E^e Ser Tyr Pro Asp Ser Asn Gly Met 
50 55 60 

Pro Val Trp Gin Leu Leu Gly Phe Val Thr Asn Gly Lys Pro Ser Ala 

25 

He Phe Lys He Ser Gly Leu Lys Ser Gly Glu Gly Ser Gin His Pro 
85 90 95 

Phe Gly Ala Met Asn He Val Arg Thr Pro Ser Val Ala Gin He Gly 
100 105 no 

He Ser Val Glu Leu Leu Asp Ser Met Ala Gin Gin Thr Pro Val Gly 

120 125 

35 Asn Ala Ala Val Ser Ser Val Asp Ser Phe Thr Gin Phe Thr Gin Lys 
"0 135 140 



Met Leu Asp Asn Phe Tyr Asn Phe Ala Ser Ser Phe Ala Val Ser Gin 
150 155 160 

Ala Gin Met Thr Pro Ser Pro Ser Glu Met Phe He Pro Ala Asn Val 
165 170 175 

Val Leu Lys Trp Tyr Glu Asn Phe Gin Arg Arg Leu Ala Gin Asn Pro 
180 185 190 

Xaa Phe Trp Xaa Thr Xaa 
195 



(2) INFORMATION FOR SBQ ID NO: 428: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENC3TH: 47 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 428: 

Met Gly Leu Pro Leu Met Ala Leu Met Trp Ser Thr Leu Pro Ala Ser 
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^5 10 15 

Ala Gly Val Asn Phe He Leu Ala Leu Pro Leu Leu Leu Leu Trp Lys 
5 20 25 30 

Asn Arg Gly Gly Val Gly Arg Ser Val Met Ser Ala Val Glu Xaa 
35 40 45 

10 

(2) INP0RMATIC3N PDR SEQ ID NO: 429: 



15 



30 



35 



45 



50 



(i) SBQUQJCE CHARACTERISTICS: 

(A) LHMGTH: 370 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBCafUENCE DESCRIPTION: SBQ ID NO: 429: 



Met Lys Lys Val Glu Glu Lys Arg Val Asp Val Asn Ser Ala Val Ala 
20 1 5 10 15 

Met Gly Glu Val He Leu Ala Val Cys His Pro Asp Cys He Thr Thr 
20 25 30 

25 He Lys His Trp He Thr He He Arg Ala Arg Phe Glu Glu Val Leu 
35 40 45 



Thr Trp Ala Lys Gin His Gin Gin Arg Leu Glu Thr Ala Leu Ser Glu 
50 55 60 

Leu Val Ala Asn Ala Glu Leu Leu Glu Glu Leu Leu Ala Trp He Gin 
^5 70 75 80 

Trp Ala Glu Thr Thr Leu He Gin Arg Asp Gin Glu Pro He Pro Gin 
85 90 95 

Asn He Asp Arg Val Lys Ala Leu He Ala Glu His Gin Thr Kie Met 
100 105 110 

40 Glu Glu Met Thr Arg Lys Gin Pro Asp Val Asp Arg Val Thr Lys Thr 
115 120 . 125 



Tyr Lys Arg Lys Asn He Glu Pro Thr His Ala Pro Phe He Glu Lys 
130 135 140 

Ser Arg Ser Gly Gly Arg Lys Ser Leu Ser Gin Pro Thr Pro Pro Pro 
150 155 160 

Met Pro He Leu Ser Gin Ser Glu Ala Lys Asn Pro Arg He Asn Gin 
165 170 175 

Leu Ser Ala Arg Trp Gin Gin Val Trp Leu Leu Ala Leu Glu Arg Gin 
IBO 185 190 

55 Arg Lys Leu Asn Asp Ala Leu Asp Arg Leu Glu Glu Leu Lys Glu Phe 
195 200 205 



60 



Ala Asn Phe Asp Phe Asp Val Trp Arg Lys Lys Tyr Met Arg Trp Met 
210 215 220 
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Asn His Lys Lys Ser Arg Val -Met Asp Phe Phe Arg Arg He Asp Lys 
230 235 240 

Asp Gin Asp Gly Lys He Thr Arg Gin Glu Phe He Asp Gly He Leu 
245 250 255 

Ala Ser Lys Wie Pro Thr Thr Lys Leu Glu Met Thr Ala Val Ala Asp 
260 265 270 

10 He Phe Asp Arg Asp Gly Asp Gly IVr He Asp Tyr Tyr Glu Phe Val 
275 280 285 



15 



20 



Ala Ala Leu His Pro Asn Lys Asp Ala Tyr Arg Pro Thr Thr Asp Ala 
290 295 300 

Asp Lys He Glu Asp Glu Val Thr Arg Gin Val Ala Gin Cys Lys Cys 
310 315 320 

Ala Lys Arg Phe Gin Val Glu Gin He Gly Glu Asn Lys Tyr Arg Phe 
325 330 335 

Phe Leu Gly Asn Gin Phe Gly Asp Ser Gin Gin Leu Arg Leu Val Arg 
340 345 350 

25 lie Leu Arg Asn Arg Asp Gly Ser Arg Trp Trp Arg Met Asp Gly Leu 
355 360 365 

Gly Xaa 
370 

30 



(2) INFX)RMAriON FOR SEQ ID NO: 430: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LBJC5TH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQfUENCE DESCRIPTIOM: SBQ ID NO: 430: 



40 



Met Asn Val Lys Thr Phe Ser Xaa Asp His Met His Phe Leu Cys Cys 
^5 10 15 



Leu Tyr Leu Arg Tyr Val Ttu: Phe Val Tyr Leu Asn Leu Phe 
20 25 30 



50 



60 



(2) INFORMATICN FOR SBQ ID NO: 431: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

" (3ci) SBC^JENCE DESCRIPTICN: SBQ ID NO: 431: 



Met Glu Pro His Leu Arg Cys Arg Val Thr Arg Val Arg Gly Ser Leu 
1 5 10 ^5 

Gly Asn Thr Gly Arg Trp Leu Leu 
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613 



20 



5 (2) INFORMATION PGR SBQ ID NO: 432: 

(i) SB^JENCE CHARACTERISTICS: 

(A) La«5TH: 53 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 432: 



15 



20 



25 



Met His Tyr Leu val Leu Gly Gly Leu Gly Val Phe Leu Phe Phe Ser 
^ 5 10 15 

Cys Phe Val Phe Leu Phe Phe Xaa Phe Ser Phe Ala Phe Phe Pro Mie 
20 25 30 

Tyr Leu Glu Gly Met Gly Gly Ser Gly Asn Arg Glu Val Gly Gly Gly 
35 40 45 

Phe Cys Leu Phe Rie 
50 



30 



(2) INFORMATION FOR SEQ ID NO: 433: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 

35 Met Val Ser Lys Ala Leu Leu Arg Leu Val Ser Ala Val Asn Arg Arg 
^5 10 15 

Arg Met Lys Leu Leu Leu Gly lie Ala Leu Leu Ala Tyr Val Ala Ser 

40 

Val Trp Gly Asn Phe Val Asn Met Arg Ser lie Gin Glu Asn Gly Glu 
35 40 45 

Leu Lys He Glu Ser Lys lie Glu Glu Met Val Glu Pro Leu Arg Glu 
45 50 55 60 

Lys He Arg Asp Leu Glu Lys Ser Phe Thr Gin Lys Tyr Pro Pro Val 
70 75 80 

50 Lys Phe Leu Ser Glu Lys Asp Arg Lys Arg He Leu He Thr Gly Gly 
85 90 95 

Ala Gly Phe Val Gly Ser His Leu Thr Asp Lys Leu Met Met Asp Gly 
100 105 110 

His Glu Val Thr Val Val Asp Asn Phe Kie Thr Gly Arg Lys Arg Asn 
115 120 125 

Val Glu His Trp He Gly His Glu Asn Phe Glu Leu He Asn His Asp 
60 130 135 140 
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Val Trp Ser Pro Ser Thr Ser Arg Leu Thr Arg Tyr Thr lie Trp His 
150 155 160 

5 Leu Gin Pro Pro Leu Gin Thr Thr Cys lie lie Leu Ser Arg His xaa 
165 170 175 



10 



(2) INFORMATION FOR SBQ ID NO: 434: 

^5 (i) SBQOENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

2^ to) SEtyjEKCE DSESCRIPTION: SEQ ID NO: 434: 

Met Leu Arg Cys Trp Pro Leu Phe Trp Leu Pro Leu Val Ser Pro Phe 
15 10 15 

Cys Ser Leu Wie Trp Leu Leu Val Glu Trp Phe Gly Thr Asn lie Asp 
25 20 25 30 

Arg Glu Ser Tyr Asp Ala He Gly Gly Pro Ser Trp Met Thr Ala Ser 
35 40 45 

30 Ser Phe Cys Leu Ser Asn Ser Asn He Trp Ser Leu Glu He Ser Ser 
50 55 60 



35 



Gly Ser Thr Ser Val Val His Ser Gin Gin Ala Met Asp 
65 70 75 



<2) INPORMATICN FDR SEQ ID NO: 435: 

40 (i) SBQUQJCE CHARACTERISTICS: 

(A) LSNGIH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLCXSY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 435: 



45 



50 



55 



Met Arg Ser Cys Glu He Gin Leu Cys Val Trp Leu Leu Val Ser Ser 
^5 10 15 

His Val Asp Met Val Leu Gly Gly Ser Pro Ser Thr Leu Tyr Met Met 
20 25 30 



(2) INFORMATION FOR SBQ ID NO: 436: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEM3TH: 30 amino acids 
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10 



20 



25 



30 



35 



40 



(B) lYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 436: 

Met val val Asn Ser Leu cys Phe Leu Ser Leu Leu Leu val He Leu 
^ 5 10 



15 



Glu Ser Thr Asp Ser Ser Ala Arg Leu Leu OVr His Glu 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 437: 



^5 (i) SBQiOENCE CHARACTERISTICS: 

(A) LQJGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 



Met Asp Lys Gin Lys His Leu Glu Val Arg Arg Ser Val Phe Lys He 

Gin Gly Lys He Ala Phe Ser Leu Met Phe Val Leu Lys Asp Leu Ser 
2«> 25 30 

Pro Thr He Phe Ser His Ser He Leu Leu Leu Leu Pro His His Val 
35 40 45 

Leu Pro cys Thr Pro Gin Met Val Arg Gly Val Thr Gin Val Leu Arg 



50 55 

Glu E*e Gly Asp Gin 
65 



(2) INPORMATICN FOR SEQ ID NO: 438: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQKTTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

SEQUENCE DESCRIPTION: SEQ ID NO: 438: 

Met Pro Leu cys Phe Phe Ser Phe Leu Cys Cys Trp Val Leu Val Phe 
'5 10 15 



50 



Lys Leu He 



(2) INPORMATICN FOR SEQ ID NO: 439: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 
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Met Lys Phe Ser Leu Val Leu Leu lie Lys lie lie Ser Phe Glu Arg 
15 10 15 

5 Leu Leu lie Phe Leu Phe Pro Leu Ser Phe Leu Pro Asn lie Trp Arg 
20 25 30 



10 



Arg Val Met Val Asn Leu Asn He Leu Phe Xaa 
35 40 



(2) INFORMATION FCR SEQ ID NO: 440: 

15 (i) SEQUHICE CHARACTERISTICS: 

(A) LSJGIH: 33 amino acids 

(B) TyPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 

20 

Met Leu Leu Phe Pro Ser Leu Leu Phe Ala Ala Thr Tyr Asn Val Ala 
15 10 15 

Asn Pro Ser Arg Leu He Leu Tyr Met He Ser Ala Gly Ala Asp Ser 
25 20 25 30 

Gin 



30 



45 



(2) INPORMATION FOR SEQ ID NO: 441: 



(i) SBQUQJCE CHARACTERISTICS: 
35 (A) LENGTH: 53 amino acids 

(B) TypE: amino acid 
(D) TOPOLOGy: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

40 Met Trp Gin Val Arg Gly Leu Pro Pro Val Pro Leu Leu Leu Hxr Met 
1 5 10 15 



Ser Pro Pro Pro Cys Leu Ser Ser Pro Phe Pro Phe He Ser Val Pro 
20 25 30 

Leu Phe Glu Ala Val Pro He Ser Val Ser Asp Gin Pro Ser Pro Xaa 
35 40 45 



Leu Hir Thr Leu Leu 

50 50 



55 



(2) INPORMATION FOR SEQ ID NO: 442: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQtGTH: 64 amino acids 

(B) TypE: amino acid 
(D) TOPOLOGY: linear 

60 (xi) SEQUQJCE DESCRIPTION: SEQ ID NO: 442: 
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Met lie Thr Ser Val Leu Val Phe Leu He Phe Phe Phe Pro Tyr Leu 

5 10 . 15 

5 ser Leu val Hu: Leu l«i Gin Ala Arg Asn Leu Trp Val He His Arg 



10 



15 



20 



25 



30 



35 



40 



45 



20 25 



30 



Ala Ala Leu Cys Glu Ser Gly Leu Phe His Trp Arg Lys Gly He Glu 

45 



35 40 



Asn Gin Leu Glu Pro Met Tyr Phe Leu Pro His Gly Thr Leu Phe 



50 



55 



Leu 



60 



(2) mPORMATION FOR SBQ ID NO: 443: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LBHSTH: 34 amino acids 

<B) TypE: amino acid 

(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 443; 

Met Leu Tyr Ser cys Glu Pro Tyr Leu He He Leu Asn He Tyr Ser 
5 10 15 

Gin Lys Ala Phe Tyr Phe Tyr Hie I*e Glu Gly Ser Phe Ser Val Cys 
20 25 30 

Ttir Leu 



(2) mFXHaCfl-ION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 89 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUQJCE DESCRIPTION: SBQ ID NO: 444: 

Met Arg Gin Arg Gin Ala Ala Cys Gin Pro Pro Pro Ser Arg Asn Gly 
^5 10 15 

Leu Ala Gin Glu Cys Pro Pro His He Pro Ser Ser Phe Phe Leu Val 
20 25 30 



55 



Lys Leu Leu Phe He Pro Trp Leu Ala ser Leu Leu Ser Ser Pro Leu 
35 40 45 

Asn Leu Leu Leu Leu val ser He ser Trp Asp Leu Gly Leu Lys Leu 

55 60 



Asn Leu Gin Gin Cys Arg Gin His Gin Val Leu Gin Glu Lys Asn Thr 

60 
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Lys Lys Phe Asn Lys Lys Lys Lys Lys 
85 



5 

(2) INFORHATION FOR SBQ ID NO: 445: 

(i) SBQUOTCK CHARACTERISTICS: 

(A) LENGTH: 350 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGy: linear 
(xi) SBO^JW^E DEESCRIPTICN: SEQ ID NO: 445: 

Met Asp Phe lie Thr Ser Thr Ala He Leu Pro Leu Leu Phe Gly Cys 
15 1 5 10 15 

Leu Gly Val Phe Gly Leu Phe Afg Leu Leu Gin Trp Val Arg Gly Lys 
20 25 30 

20 Ala Tyr Leu Arg Asn Ala Val Val Val He Thr Gly Ala Thr Ser Gly 
35 40 45 

Leu Gly Lys Glu Cys Ala Lys Val Phe Tyr Ala Ala Gly Ala Lys Leu 
50 55 60 

25 

Val Leu Cys Gly Arg Asn Gly Gly Ala Leu Glu Glu Leu He Arg Glu 
65 70 75 - 80 

Leu Thr Ala Ser His Ala Thr Lys Val Gin Thr His Lys Pro Tyr Leu 
30 85 90 95 

Val Thr Phe Asp Leu Thr Asp Ser Gly Ala He Val Ala Ala Ala Ala 
100 105 110 

35 Glu He Leu Gin Cys Phe Gly Tyr Veil Asp He Leu Val Asn Asn Ala 
115 120 125 

Gly He Ser Tyr Arg Gly Thr He Met Asp Thr Thr Val Asp Val Asp 
130 135 140 

40 

Lys Arg Val Met Glu Thr Asn Tyr Phe Gly Pro Val Ala Leu Thr Lys 
145 150 155 160 

Ala Leu Leu Pro Ser Met He Lys Arg Arg Gin Gly His He Val Ala 
45 165 170 175 

He Ser Ser He Gin Gly Lys Met Ser He Pro Phe Arg Ser Ala Tyr 
180 185 190 

50 Ala Ala Ser Lys His Ala Thr Gin Ala Kie Phe Asp Cys Leu Arg Ala 
195 200 205 

Glu Met Glu Gin Tyr Glu He Glu Val Thr Val He Ser Pro Gly Tyr 
210 215 220 

He His Thr Asn Leu Ser Val Asn Ala He Thr Ala Asp Gly Ser Arg 
225 230 235 240 



60 



Tyr Gly Val Met Asp Thr Thr Thr Ala Gin Gly Arg Ser Pro Val Glu 
245 250 255 
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Val Ala Gin Asp Val Leu Ala Ala Val Gly Lys Lys Lys Lys Asp Val 
260 265 270 

5 lie Ala Asp Leu Pro Ser Leu Ala Val Tyr Leu Arg Thr Leu 
275 280 285 

Ala Pro Gly Leu Phe Phe Ser Leu Met Pro Pro Gly Pro Glu Lys Ser 
290 295 300 ' 

Gly Asn Pro Arg Thr Pro Ser Thr Leu 'Kir Ser Gin Gly Gin Gly Arg 
310 315 320 

Glu Ala Ala Leu Leu Gly Leu Leu Thr Leu Gin Gly Olir Val Ala Phe 
325 330 335 

Val Glu Thr Leu Met Glu lie Cys Leu Thr Ser Gly Lys Asp 
340 345 350 

20 

(2) INFDRMATICfN FOR SEQ ID NO: 446: 

(i) SEQUENCE CMARACTEJaSTICS : 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGy: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

30 Met Val Phe Leu Pro Arg Gly Val Val Val Ser Gly Gly Ala Ala Cys 
1 5 10 15 

Leu Trp Leu Thr Phe He Leu Glu 'Thr Glu Val Tyr Leu Asp Leu Ala 

35 

Thr Glu Ala Arg Ala His Ser Arg Met Gly Leu Gly Leu Trp Pro Pro 
35 40 45 

Asn 

40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 447; 

(i) SBQUEKCE CHARACTEEaSTICS : 

(A) LajGTPH: 278 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUETKIE DESCRIPTION: SEQ ID NO: 447: 

Met Ala Ser Ala Glu Leu Asp Tyr Thr He Glu He Pro Asp Gin Pro 
1 5 10 15 

Cys Trp Ser Gin Lys Asn Ser Pro Ser Pro Gly Gly Lys Glu Ala Glu 
20 25 30 

Thr Ar^ Gin Pro Val Val He Leu Leu Gly Trp Gly Gly Cys Lys Asp 
35 40 45 
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Lys Asn Leu Ala Lys Tyr Ser Ala lie Tyr His Lys Arg Gly Cys lie 
50 55 60 

Val lie Arg Tyr Thr Ala Pro Trp His Met Val Phe Phe Ser Glu Ser 
5 65 70 75 80 

Leu Gly He Pro Ser Leu Arg Val Leu Ala Gin Lys Leu Leu Glu Leu 
85 90 95 

10 Leu Phe Asp Tyr Glu He Glu Lys Glu Pro Leu Leu Phe His Val Phe 
100 105 110 

Ser Asn Gly Gly Val Met Leu Tyr Arg Tyr Val Leu Glu Leu Leu Gin 
115 120 125 

15 

Thr Arg Arg Phe Cys Arg Leu Arg Val Val Gly Thr He E^e Asp Ser 
130 135 140 

Ala Pro Gly Asp Ser Asn Leu Val Gly Ala Leu Arg Ala Leu Ala Ala 
20 145 150 155 160 

He Leu Glu Arg Arg Ala Ala Met Leu Arg Leu Leu Leu Leu Val Ala 
165 170 175 

25 Phe Ala Leu Val Val Val Leu E*ie His Val Leu Leu Ala Pro He Thr 
180 185 190 

Ala Xaa Phe His Thr His Phe Tyr Asp Arg Leu Gin Asp Ala Gly Ser 
195 200 205 

30 

Arg Trp Pro Glu Leu Tyr Leu Tyr Ser Arg Ala Asp Glu Val Val Leu 
210 215 220 

Ala Arg Asp He Glu Arg Met Val Glu Ala Arg Leu Ala Arg Arg Val 
35 225 230 235 240 

Leu Ala Arg Ser Val Asp Phe Val Ser Ser Ala His Val Ser His Leu 
245 250 255 

40 Arg Asp Tyr Pro Thr Tyr Tyr Thr Ser Leu Cys Val Asp Phe Met Arg 
260 265 270 

Asn Cys Val Arg Cys Xaa 
275 

45 



(2) INFX)RMATICN FOR SEQ ID NO: 448: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LQKJTH: 199 amino acic3s 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE nESCRIPTION: SEQ ID NO: 448: 

55 

Met Ser Phe He Phe Asp Trp He Tyr Ser Gly Phe Ser Ser Val Leu 
15 10 15 

Gin Phe Leu Gly Leu Tyr Lys Lys Thr Gly Lys Leu Val Phe Leu Gly 
60 20 25 30 
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Leu Asp Asn Ala Gly Lys Itar Thr Leu Leu His Met Leu Lys Asp Asp 
35 40 45 

5 Arg Leu Gly Gin His Val Pro Thr Leu His Pro Thr Ser Glu Glu Leu 
50 55 60 

Thr He Ala Gly Met Thr Phe Thr Thr Phe Asp Leu Gly Gly His Val 
^5 70 75 Qft 

10 ®° 

Gin Ala Arg Arg Val Trp Lys Asn Tyr Leu Pro Ala He Asn Gly He 
85 90 95 

Val Phe Leu Val Asp Cys Ala Asp His Glu Arg Leu Leu Glu Ser Lys 
15 100 105 110 

Glu Glu Leu Asp Ser Leu Met Thr Asp Glu Thr He Ala Asn Val Pro 
115 120 125 

20 He Leu He Leu Gly Asn Lys He Asp Arg Pro Glu Ala He Ser Glu 
130 135 140 

Glu Arg Leu Arg Glu Met Phe Gly Leu Tyr Gly Gin Thr Thr Gly Lys 
25 150 155 160 

Gly Ser He Ser Leu Lys Glu Leu Asn Ala Arg Pro Leu Glu Val rtxe 
165 170 175 

Met Cys Ser Val Leu Lys Arg Gin Gly Tyr Gly Glu Gly Phe Arg Trp 
30 180 185 190 

Met Ala Gin Tyr He Asp Xaa 
195 

35 

(2) INFORMATION FOR SEQ ID NO: 449: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

CD) TOPOLOGy; linear 
(xi) SEC^JENCE DESCRIPTION: SEQ ID NO: 449: 

45 Met Thr Leu Ser Arg Phe Ala Tyr Asn Gly Lys Arg Cys Pro Ser Ser 
^5 10 15 

Tyr Asn He Leu Asp Asn Ser Lys He He Ser Glu Glu Cys Arg Lys 

50 

Glu Leu Thr Ala Leu Leu His His Tyr Tyr Pro He Glu He Asp Pro 
35 40 45 

His Arg Thr Val Lys Glu Lys Leu Pro His Met Val Glu Trp Trp Thr 
50 55 60 

Lys Ala His Asn Leu Leu Cys Gin Gin Lys He Gin Lys Phe Gin He 
65 70 75 80 

Ala Gin Val Val Arg Glu Ser Asn Ala Met Leu Arg Glu Gly Tyr Lys 



60 
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85 90 95 

Thr Phe Phe Asn ito Leu Tyr His Asn Asn He Pro Leu Ete He Phe 
5 100 105 xio 

Ser Ala Gly lie Gly Asp He Leu Glu Glu He He Arg Gin Met Lys 
115 120 125 

Val Phe His Pro Asn He His He Val Ser Asn Tyr Met Asp Phe Asn 
130 135 140 

Glu Asp Gly Phe Leu Gin Gly Phe Lys Gly Gin Leu He His Thr Tyr 
150 155 160 

15 Asn Lys Asn Ser Ser Val Cys Glu Asn Xaa Gly Tyr Phe Gin Gin Leu 
165 170 175 

Glu Gly Lys Thr Asn Val He Leu Leu Gly Asp Ser He Gly Asp Leu 
2Q 185 190 

Thr Met Ala Asp Gly Val Pro Gly Val Gin Asn He Leu Lys He Gly 
195 200 205 

Phe Leu Asn Asp Lys Val Glu Glu Arg Arg Xaa Arg Tyr Met Asp Ser 
25 210 215 220 

Tyr Asp He Val Leu Glu Lys Asp Glu Thr Leu Asp Val Val Asn Gly 
230 235 240 

30 Leu Leu Gin His He Leu Cys Gin Gly Val Gin Leu Glu Met Gin Gly 
245 250 255 

Pro Xaa 

35 



{2) INFORMATION FOR SBQ ID NO: 450: 

40 (i) SEQUQilCE CHARAdERISTICS: 

(A) LaJC3TH: '87 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

t3Ci) SBOOENCE nESCRIPnCN: SBQ ID NO: 450- 

45 

Met Ser His Val Leu Leu cys Pro Ser Leu Ser cys Ser Asn Leu Leu 
15 10 15 



Pro Pro Ser His Ser Leu Gly Thr Met Gly Ser Leu Ser Pro His Leu 

30 



50 20 25 



55 



Cys Gly His Thr Met Cys Pro Val Asn Pro Glu Leu Pro Leu Ser Ser 
35 40 45 

Arg Leu Tte Thr A^ Gin Pro Gin Pro Asp Ala Cys Ser Pro Thr Leu 
50 55 60 



Leu Thr Leu Pro Leu Pro Ser Ser Phe Leu Pro His Ser Lys Pro Thr 

60 



^5 70 75 80 
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Phe Xaa His Pro Cys Ser Pro 
85 



5 

(2) INFORMATIOEM FOR SBQ ID NO; 451; 

(i) SEQUQiCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 
10 (B) TYPE: amino acid 

(D) T0P0LCX3Y: linear 
(xi) SBQUEMCE DESCRIPTION: SEQ ID NO: 451: 



25 



30 



40 



55 



60 



Met Phe Ser lie Asn Pro Leu Glu Asn Leu Lys Val Tyr He Ser Ser 

15 



15 1 5 10 



Arg Pro Pro Leu Val Val Kie Met He Ser Val Xaa Pro Met Ala He 
20 25 30 

20 Ala Phe Leu Thr Leu Gly Tyr Phe Phe Lys He Lys Glu He Lys Ser 
35 40 45 



Pro Glu Met Ala Glu Asp Trp Asn Thr Phe Leu Leu Arg Phe Asn Asp 
50 55 60 

Leu Asp Leu cys Val Ser Glu Asn Glu Thr Leu Lys His Leu Thr Asn 
^5 70 75 80 

Asp Thr Thr Thr Pro Glu Ser Thr Met Thr Ser Gly Gin Ala Arg Ala 
85 90 95 

Ser Thr Gin Ser Pro Gin Ala Leu Glu Asp Ser Gly Pro Val Asn He 

105 no 

35 Ser Val Ser He Thr Leu Thr Leu Asp Pro Leu Lys Pro Phe Gly Gly 
115 120 125 



Tyr Ser Arg Asn Val Thr His Leu Tyr Ser Thr He Leu Gly His Gin 
130 135 140 

He Gly Leu Ser Gly Arg Glu Ala His Glu Glu He Asn He Thr Phe 
150 155 160 



Le^ Pro Thr Ala Trp Ser ser Asp Asp cys Ala Leu His Gly His 
165 170 175 

Cys Glu Gin Val Val Phe Thr Ala Cys Met Thr Leu Thr Ala Ser Pro 
180 185 190 

50 Gly Val Phe Pro Val Thr Val Gin Pro Pro His Cys Val Pro Asp Thr 
195 200 205 



Tyr Ser Asn Ala Thr Leu Trp Tyr Lys He Phe Thr Thr Ala Arg Asp 
210 215 220 

Ala Asn 'ttir Lys Tyr Ala Gin Asp Tyr Asn Pro Phe Trp Cys Tyr Lys 
230 235 240 

Gly Ala He Gly Lys Val Tyr His Ala Leu Asn Pro Lys Leu Thr Val 
245 250 255 
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He Val Pro Asp Asp Asp Arg Ser Leu He Asn Leu His Leu Met His 
260 265 270 

5 Thr Ser Tyr Phe Leu Phe Val Met Val He Thr Met Phe Cys Tyr Ala 
275 280 285 



10 



Val He Lys Gly Arg Pro Ser Lys Leu Arg Gin Ser Asn Pro Glu Kie 
290 295 300 

Cys Pro Glu Lys Val Ala Leu Ala Glu Ala Xaa 
305 310 315 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFX)RMATIQN FOR SBQ ID NO: 452: 

(i) SEQUQJCE CHZWACTERISTICS: 

(A) Vax^TH: 52 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUaJCE DESCRIPTICN: SBQ ID NO: 452: 

Met Pro Gly Leu Ser Leu Ala Leu Leu Pro Phe Gly Pro Gly Cys Tte 
1 5 10 15 

Glu Ala Leu His Ala Gly Cys Phe Pro Ala Phe Ala Ser Ala Thr Arg 
20 25 30 

Val Asn Gly Glu Ala Ala Leu Ser Pro Gly Leu Cys Asp Pro He Ser 
35 40 45 

Val Pro Tyr Val 
50 



(2) INPORMATION FOR SEQ ID NO: 453: 

(i) SEQUQ^ CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 453: 

Met Ala Val Gly Gin He Met Thr Pb& Gly Ser Pro Val He Gly Cys 
1 5 10 15 

Gly Phe He Ser Gly Trp Asn Leu Val Ser Met Cys Val Glu Tyr Val 
20 25 30 

Leu Leu Trp Lys Val Tyr Gin Lys Thr Pro Ala Leu Ala Val Lys Ala 
35 40 45 

55 Gly Leu Lys Glu Glu Glu Thr Glu Leu Lys Gin Leu Asn Leu His Lys 
50 55 60 



60 



Asp Hhx Glu Pro Lys Pro Leu Glu Gly Thr His Leu Met Gly Val Lys 
^5 70 75 80 
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Asp Ser Asn He His Glu Leu Glu His Glu Gin Glu Pro Thr Cys Ala 
85 90 95 

Ser Gin Bet Ala Glu Pro Phe Arg Thr Phe Arg Asp Gly Trp val Ser 
J 100 105 



110 



TVr TVr Asn Gin Pro Val Phe Leu Ala Gly Met Gly Leu Ala Phe Leu 
115 120 125 

10 Tyr Met Thr Val Leu Gly Phe Asp Cys He Thr Thr Gly Tyr Ala Tyr 
130 135 140 



15 



20 



25 



30 



Thr Gin Gly Leu Ser Gly Phe His Pro Gin Tyr Phe Asp Gly Ser He 
150 155 

Ser Tyr Asn Ttp Asn Asn Gly Asn cys ser Phe Tyr Leu Ala Thr Ser 
165 170 175 

Lys Met Trp Phe Gly Ser Ala Gly Leu He Ser Gly Leu Ala Gin Leu 
180 IBS 190 

Ser cys Leu He Leu Cys Val He Ser Val Hie Met Pro Gly Ser Pro 
195 200 205 

Leu Asp Leu Ser Val Ser Pro Hie Glu Asp He Arg Ser Arg Phe He 
210 215 220 

Gin Gly Glu Ser He Thr Pro Thr Lys He Pro Glu He Thr Thr Glu 
230 235 . 240 

He Tyr Met Ser Asn Gly Ser Asn ser Ala Asn He Val Pro Glu Thr 
245 250 255 



ser Pro Glu Ser Val Pro He He Ser Val Ser Leu Leu Phe Ala Gly 
260 265 270 

Val He Ala Ala Arg He Gly Leu Trp Ser Phe Asp Leu Thr Val Thr 
275 280 285 

40 Gin Leu Leu Gin Glu Asn Val He Glu Ser Glu Arg Gly He He Asn 
290 295 300 

Gly val Gin Asn Ser Met Asn Tyr Leu Leu Asp Leu Leu His Phe He 
45 310 315 320 

Met Val He Leu Ala Pro Asn Pro Glu Ala Phe Gly Leu Leu Val Leu 
325 330 

He Ser Val Ser Phe Val Ala Met Gly His He Met Tyr Phe Arg Phe 
^" 340 345 350 

Ala Gin Asn Thr Leu Gly Asn Lys Leu Phe Ala Cys Gly Pro Asp Ala 
355 360 365 

55 Lys Glu Val Arg Lys Glu Asn Gin Ala Asn Thr Ser Val Val Xaa 
370 375 383 



60 



(2) INFORMATION FOR SBQ ID NO: 454: 
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(i) SBQUEKCE CHARACTERISTICS: 

(A) LOaGTH: 186 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 454: 

Met Arg Ser He Gly Asn Lys Asn Thr He Leu Leu Gly Leu Gly Phe 
15 10 15 

Gin He Leu Gin Leu Ala Trp Tyr Gly Phe Gly Ser Glu Pro Trp Met 
20 25 30 

Met Trp Ala Ala Gly Ala Val Ala Ala Met Ser Ser He Thr Phe Pro 
35 40 45 

Ala Val Ser Ala Leu Val Ser Arg Thr Ala Asp Ala ASP Gin Gin Gly 
50 55 60 

20 Val Val Gin Gly Met He Thr Gly He Arg Gly Leu Cys Asn Gly Leu 

70 75 80 



10 



15 



Gly Pro Ala Leu Tyr Gly Phe He Phe Tyr He Phe His Val Glu Leu 
85 90 95 

Lys Glu Leu Pro He Thr Gly 'Kir Asp Leu Gly Thr Asn Thr Ser Pro 
100 105 110 

Gin His His Phe Glu Gin Asn Ser He He Pro Gly Pro Pro Phe Leu 
115 120 125 

Phe Gly Ala Cys Ser Val Leu Leu Ala Leu Leu Val Ala Leu Phe He 
130 135 140 

35 Pro Glu His Thr Asn Leu ser Leu Arg ser Ser Ser Trp Arg Lys His 

150 155 160 



25 



30 



40 



45 



50 



cys Gly Ser His Ser His Pro His Asn Thr Gin Ala Pro Gly Glu Ala 



165 170 

Lys Glu Pro Leu Leu Gin Asp Thr Asn Val 
180 185 



(2) INFORMATION FTO SEQ ID NO: 455: 

(i) SBC3UENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPnON: SEQ ID m: 455: 



175 



Met Leu Gin Thr Ser Asn Tyr Ser Leu Val Leu Ser Leu Gin Phe Leu 
5 10 15 

Lai Leu Ser Tyr Asp Leu Kie val Asn Ser Phe Ser Glu Leu Leu Gin 
20 25 30 

60 Lys Thr Pro Val He Gin Leu Val Leu Phe He He Gin Asp He Ala 
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35 40 



45 



Val Leu Phe Asn He He He He E^e Leu Met Phe Phe Asn Thr Phe 

5 

Val Phe Gin Ala Gly Leu Val Asn Leu Leu Phe His Lys Phe Lys Gly 

65 70 nc 

75 80 

Thr He He Leu Thr Ala Val Tyr E*e Ala Leu Ser He Ser Leu His 
1« 85 90 95 

Val Trp Val Met Asn Leu Arg Trp Lys Asn Ser Asn Ser Phe He Trp 

105 110 

15 Thr Asp Gly Leu Gin Met Leu Phe Val Phe Gin Arg Leu Ala Ala Val 

120 125 

Leu Tyr Cys TVr Phe Tyr Lys Arg Thr Ala Val Arg Leu Gly Asp Pro 

20 

His Phe Tyr Gin Asp Ser Leu Trp Leu Arg Lys Glu Phe Jfet Gin Val 
145 

Arg Arg Xaa 



150 155 160 



25 



(2) INFORMATION FOR SEQ ID NO: 456: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTIH: 46 amino acids 
'(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
SBCJUEKCE DESCRIPTION: SEQ ID NO: 456: 

Met Arg He Gin Val Phe He Leu Leu Leu Gly Ala Gly Gly Thr Ser 
1 5 10 15 

40 Gin Phe Thr Lys Pro Pro Ser Leu Pro Leu Glu Pro Glu Pro Ala Val 
20 25 30 



Glu Ser Ser Pro Thr Glu Thr Ser Glu Gin He Arg Glu Lys 

45 



35 40 45 



(2) INFDRMATICN FOR SEQ ID NO: 457: 

^0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 

Met Ser Tyr Leu Ala Phe Leu Tyr Met Thr Phe Asp Phe Cys Cys Leu 
1 5 10 



60 



15 



Tyr Phe Ser Thr Val Tyr Ala Pro Ser Phe Lys Tyr He Cys Val His 
20 25 30 
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Thr Asp Thr His He Cys val Cys Val Cys He Tyr Leu Ser Ser Val 
35 40 45 

5 val Ser Lys Ser Ser Ala Glu Ala Asp Gly Val Leu Gin Pro Arg Arg 

55 60 



10 



15 



20 



His Pro Ala Ser Leu Leu He Val Phe Ala Thr Ser He Ser Glu Ser 
70 75 80 

Ser Leu Leu He Phe Ser Phe Gin Lys Thr Glu Ala Lys Leu He Val 



85 90 

Phe Ala Val Ser Leu Ala Ala Lys Xaa 
100 105 



(2) INFX>RMATIQN FX>R SBQ ID NO: 458: 



95 



(i) SEQUQICE CHARACTERXSncS: 

(A) LQ«Grra: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

SBC3UQ«CE DESCMPTICN: SEQ ID NO: 458: 

Met Leu Pro Pro Phe Ser Leu Val Tyr Thr His Phe Leu Val Ala Ser 
^ 5 10 15 

30 Leu Leu Pro Val He Leu Ala Val Phe Pro Asp Ser Ala Gin He Val 
20 25 30 



Pro Leu Leu Lys Pro He Pro Arg Pro Gin Pro Glu Val He Phe Pro 

35 



35 40 45 



Ser Ser Glu Leu Leu Glu Gin Leu Leu Ser Val Gin Phe Val Trp Gin 
50 55 60 

Ala His Thr Val Ala Xaa 
40 65 70 



(2) INPORMATIC»J FOR SEQ ID NO: 459- 

45 

(i) SEQUENCE CHARACTEEaSTICS : 

(A) LENC3TH: 155 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SBQ ID NO: 459: 

Met Ala Leu Leu Leu Ser Val Leu Arg Val Leu Leu Gly Gly Phe Phe 
^5 10 15 



55 



Ala Leu Val Gly Leu Ala Lys Leu Ser Glu Glu He Ser Ala Pro Val 
20 25 30 



Ser Glu Arg Met Asn Ala Leu Phe Val Gin Phe Ala Glu Val Phe Pro 

60 



35 40 45 
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Leu Lys Val Phe Gly Tyr Gin Pro Asp Pro Leu Asn Tyr Gin He Ala 
50 55 60 

Val Gly Phe Leu Glu Leu Leu Ala Gly Leu Leu Leu Val Met Gly Pro 
5 65 70 75 80 

Pro Met Leu Gin Glu He Ser Asn Leu Phe Leu He Leu Leu Met Met 
85 90 95 

10 Gly Ala He Phe Thr Leu Ala Ala Leu Lys Glu Ser Leu Ser Thr Cys 
100 105 110 

He Pro Ala He Val Cys Leu Gly Phe Leu Leu Leu Leu Asn Val Gly 
115 120- 125 

Gin Leu Leu Ala Gin Thr Lys Lys Val Val Arg Pro Thr Arg Lys Lys 
130 135 140 

Thr Leu Ser Thr Phe Lys Glu Ser Trp Lys Xaa 
20 145 150 155 



(2) INFORMATION FOR SBQ ID NO: 460: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGrPH: 332 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SBC^JENCE DESCRIPTION: SBQ ID NO: 460: 

Met Lys Leu Gly Arg Ala Val Leu Gly Leu Leu Leu Leu Ala Pro Ser 
15 10 15 

35 Val Val Gin Ala Val Glu Pro He Ser Leu Gly Leu Ala Leu Ala Gly 
20 25 30 

Val Leu Thr Gly Tyr He Tyr Pro Arg Leu Tyr Cys Leu Phe Ala Glu 
35 40 45 

40 

Cys Cys Gly Gin Lys Arg Ser Leu Ser Arg Glu Ala Leu Gin Lys Asp 
50 55 60 

Leu Asp Asp Asn Leu Phe Gly Gin His Leu Ala Lys Lys He He Leu 
45 65 70 75 80 

Asn Ala Val Phe Gly Phe He Asn Asn Pro Lys Pro Lys Lys Pro Leu 
85 90 95 

50 Thr Leu Ser Leu His Gly Trp Thr Gly Thr Gly Lys Asn Phe Val Ser 
100 105 110 

Lys He He Ala Glu Asn He Tyr Glu Gly Gly Leu Asn Ser Asp Tyr 
115 120 125 

55 

Val His Leu Phe Val Ala Thr Leu His Phe Pro His Ala Ser Asn He 
130 135 140 



60 



Thr Leu Tyr Lys Asp Gin Leu Gin Leu Trp He Arg Gly Asn Val Ser 
145 150 155 160 
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Ala Cys Ala Arg Ser He Phe He Phe Asp Glu Met Asp Lys Met His 
165 170 175 

5 Ala Gly Leu He Asp Ala He Lys Pro E^e Leu Asp Tyr Tyr Asp Leu 
180 185 190 

Val Asp Gly Val Ser Tyr Gin Lys Ala Met Phe He Phe Leu Ser Asn 
155 200 205 

Ala Gly Ala Glu Arg He Tte Asp Val Ala Leu Asp Phe Trp Arg Ser 
210 215 220 

Gly Lys Gin Arg Glu Asp He Lys Leu Lys Asp He Glu His Ala Leu 
^5. 225 230 235 240 

Ser Val Ser Val Phe Asn Asn Lys Asn Ser Gly Phe Trp His Ser Ser 
245 250 255 

20 Leu He Asp Arg Asn Leu He Asp Tyr E^e Val Pro Phe Leu Pro Leu 
260 265 270 

Glu Tyr Lys His Leu Lys Met Cys He Arg Val Glu Met Gin Ser Arg 
22 275 280 285 

Gly Tyr Glu He Asp Glu Asp He Val Ser Arg Val Ala Glu Glu Met 
290 295 300 

Thr Phe Phe Pro Lys Glu Glu Arg Val Phe Ser Asp Lys Gly Cys Lys 
30 305 310 315 320 

Thr Val Phe Thr Lys Leu Asp Tyr Tyr Tyr Asp Asp 
325 330 

35 

(2) INFORMATICN FOR SBQ ID NO: 461: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LQIGrrH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SE(^IENCE DESCRIPTION: SEQ ID NO: 461: 

45 Met Leu Lys Cys He 
1 5 



50 (2) INFORMATION FOR SEQ ID NO: 462: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LaJGTH: 14 amino acids 

(B) TlfPE: amino acid 
55 {D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 462: 

Met He Leu Thr Leu Leu Ser Val Val Ser Thr Met Ala Ser 
^ 5 10 

60 
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10 



20 



50 



(2) IMPORMATIOM FOR SBQ ID NO: 463: 

(i) SEWJQJCE CHAPACTERlSriCS: 

(A) LENGTH: 285 amino acids 

(B) TYPE: amino acid 
(D) TOPOUXW: linear 

(xi) SBCOHJCB EESCRIPTICN: SEQ ID NO: 463: 

Met Lys Leu His Pro Pro Pro Pro Ser Pro Val Ttur Gin Asp His Arg 
1 5 10 15 



Ser Lys Ser Ser His Ser Asn Trp Met Pro Arg tfet Gly Ala Cys Ser 
20 25 30 



Met Ser Arg Thr Ser Ser Ser Gly Pro Pro Ser Leu cys Lys ser Itir 
35 40 , 45 

ser Gly Arg Ser Cys 71»r Arg Pro His Cys Trp Pro Ser Leu Pro Ala 
5° 55 60 



Tip val Ser Val Phe Thr Arg Thr Asn Thr Gly ser Trp cys IVr Pro 
25 " 'V 75 . 80 



Ala 1^ Gly Gly Ala Phe Ser Arg Pro Trp Met ser Ala Gin ser Met 
85 90 95 

^ Cys Ala Gl" toS Ser Val Leu Gin Val Ala Cys Arg Leu Leu Asp 
100 105 110 

Ala Leu Glu Phe Leu His Glu Asn Glu Tyr Val His Gly Asn Val -Bur 
115 120 125 

35 Ala Glu Asn He Phe Val Asp Pro Glu Asp Gin Ser Gin Val Thr Leu 
130 135 140 

Ala Gly Tyr Gly Phe Ala Phe Arg Tyr Cys Pro ser Gly Lys His Val 
40 "° 155 160 

Ala Tyr Val Glu Gly Ser Arg Ser Pro His Glu Gly Asp Leu Glu Phe 
165 170 175 

lie Ser Met Asp Leu His Lys Gly Cys Gly Pro Ser Arg Arg Xaa Asp 
180 185 190 



Leu Gin Ser Leu Gly Tyr Cys Jfet Leu Lys Trp Leu Tyr Gly Phe Leu 
195 200 205 

Pro Trp Thr Asn Cys Leu Pro Xaa Xaa Glu Asp lie Met Lys Gin Lys 
210 215 220 

Gin Lys Phe Val Asp Lys Pro Gly Pro Phe Val Gly Pro Cys Gly His 
55 230 235 240 

Trp He Arg Pro Ser Glu Thr Leu Gin Lys Tyr Leu L^ Val Val Met 
245 250 255 

Ala Leu Thr tvr Glu Glu Lys Pro Pro Tyr Ala Met Leu Arg Asn Asn 
°" 260 265 270 
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Leu Glu Ala Leu Leu Gin Asp L^ Arg Val Ser Pro Tyr 
275 280 285 

5 

(2) INPOiWATION FOR SBQ ID ISO: 464: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LB«?ni: 80 amino acids 

(B) TifPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 464: 

15 Met Thr Ser Pro Pro Pro His Gin Gly Trp Glu Gin Arg Gly Cys Gly 
15 10 15 

Glu Ser Gin Val Pro Leu Ala Leu Ser Arg Val Hie Ser Thr Ser His 

20 

Tyr cys Leu Leu Leu Val Ala Asn Gin Ser lie Phe Phe Pro Cys Leu 
35 40 45 

Trp Ala Val Glu Arg Leu Leu Gly Val Arg Cys Thr Cys Pro Leu Ser 
50 55 60 

Trp Gly Lys Arg He He Ser Glu His Cys Ser Ala Gin Ser Ser Xaa 
70 75 80 

30 



35 (2) INPQRMATICN FOR SEQ ID NO: 465: 

(i) SEQUENCE CHARACIERISTICS : 

(A) USKm: 47 amino acids 

(B) TYPE: amino acid 
^ (D) TOPOLOGY: linear 

(xi) SEQIJENCE DESCRIPTION: SBQ ID NO: 465: 

MetHisThrTrpTyrAsnAspArgArgGlnAsnCysHisCysL^Leu 
45 ' 

Phe Wie Leu He Tyr Leu Arg Lys He Tyr Gin Val Val Pro His Val 
20 25 30 



Pro Leu Leu Val Lys Cys Arg Gly Arg Leu Lys Gly Val Asn He 

45 



50 35 40 



(2) INFORMATION FOR SBQ ID NO: 466: 

(i) SEQUHTCE CHARACTERISTICS: 

(A) LQK3TH: 96 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE E3ESCRIPTI0N : SBQ ID NO: 466: 
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Met Glu Leu Val Leu VaX Phe Leu Cys Ser Leu Leu Ala Pro Met Val 
15 10 15 

5 Leu Ala Ser Ala Ala Glu Lys Glu Lys Glu Met Asp Pro Phe His Tyr 
20 25 30 



10 



20 



30 



35 



40 



50 



55 



60 



Asp Tyr Gin Thr Leu Arg He Gly Gly Leu Val Phe Ala Val Val Leu 
3.5 40 45 

Phe Ser Val Gly He Leu Leu He Leu Ser Arg Arg Cys Lys Cys Ser 
50 55 60 



Phe Asn Gin Lys Pro Arg Ala Pro Gly Asp Glu Glu Ala Gin Val Glu 
70 75 80 



Asn Leu He Thr Ala Asn Ala Thr Glu Pro Gin Lys Ala Glu Asn Xaa 
85 90 95 



25 (2) DaFORMATION FOR SEQ ID NO: 467: 



(i) SBQUEtKrE CHARACTERISTICS: 

(A) LEN3TO: 399 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 

Met Ala Ser Gly Ala Asp Ser Lys Gly Asp Asp Leu Ser Thr Ala lie 
^ 5 10 15 

Leu Lys Gin Lys Asn Arg Pro Asn Arg Leu He Val Asp Glu Ala He 
20 25 30 

Asn Glu A^ Asn Ser Val Val Ser Leu Ser Gin Pro Lys Met Asp Glu 
35 40 45 

Leu Gin Leu Phe Arg Gly Asp Thr Val Leu Leu Lys Gly Lys Lys Arg 
50 55 60 

45 Arg Glu Ala Val Cys He Val Leu Ser Asp Asp Thr Cys Ser Asp Glu 
^5 70 75 80 



Lys He Arg Met Asn Arg Val Val Arg Asn Asn Leu Arg Val Arg Leu 
85 90 95 

Gly Asp Val He Ser He Gin Pro Cys Pro Asp Val Lys Tyr Gly Lys 
100 105 110 

Arg He His Val Leu Pro He Asp Asp Thr Val Glu Gly He Thr Gly 

120 125 

Asn Leu Phe Glu Val Tyr Leu Lys Pro Tyr Phe Leu Glu Ala Tyr Arg 
130 135 

Pro He Arg Lys Gly Asp He Phe Leu Val Arg Gly Gly Met Arg Ala 
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145 150 



155 160 



Val Glu Phe Lys Val Val Glu Thr Asp Pro Ser Pro TVr Cys He Val 
5 165 170 175 

Ala Pro Asp -nir Val He His Cys Glu Gly Glu Pro He Lys Arg Glu 
180 185 190 

Asp Glu Glu Glu Ser Leu Asn Glu Val Gly -Tyr Asp Asp He Gly Gly 
10 195 200 205 

Cys Arg Lys Gin Leu Ala Gin He Lys Glu Met Val Glu Leu Pro Leu 
210 215 220 

15 Arg His Pro Ala Leu Phe Lys Ala He Gly Val Lys Pro Pro Arg Gly 
225 230 235 240 



20 



35 



40 



55 



He Leu Leu Tyr Gly Pro Pro Gly Thr Gly Lys Ttir Leu He Ala Arg 
245 250 255 

Ala Val Ala Asn Glu Thr Gly Ala Phe Phe Phe Leu He Asn Gly Pro 
260 265 270 



Glu He Met Ser Lys Leu Ala Gly Glu Ser Glu Ser Asn Leu Arg Lys 
^ 275 280 285 

Ala Phe Glu Glu Ala Glu Lys Asn Ala Pro Ala He He Phe He Asp 
290 295 300 

30 Glu Leu Asp Ala He Ala Pro Lys Arg Glu Lys Thr His Gly Glu Val 
305 310 315 320 



Glu Arg Arg He Val Ser Gin Leu Leu Thr Leu Met Asp Gly Leu Lys 
325 330 335 

Gin Arg Ala His Val He Val Met Ala Ala Thr Asn Arg Pro Asn Ser 
340 345 350 

He Asp Pro Ala Leu Arg Arg Phe Gly Arg Phe Asp Arg Glu Val Asp 
355 360 365 

He Gly He Pro Asp Ala Thr Gly Arg Leu Glu He Leu Gin He His 
370 375 380 

45 Thr Lys Asn Met Lys Leu Ala Asp Asp Val Asp Leu Glu Gin Xaa 
385 390 395 



50 (2) INFORMATION FOR SBQ ID NO: 468: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 amino acids 

(B) TYPE: aznino acid 
(D) TOPOLOGY: linear 

(xi) SBQOENCE DESCRIPTION: SEQ ID NO: 468: 



60 



Leu 
1 
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(2) INFORMATION FOR SBQ ID NO: 469: 

5 (i) SBQUQJCE CHARACTERISTICS; 

(A) IWSTHi 273 amino acids 

(B) TYPE: amino acid 
(D) T0POUX3Y: linear 

<xi) SEQUENCE DESCRIPTION: SBQ ID NO: 469: 

Met Ala Ala Pro Lys Gly Ser Leu Trp Val Arg Thr Gin Leu Gly Leu 
1 5 . 10 15 

Pro Pro Leu Leu Leu Leu Thr Met Ala Leu Ala Gly Gly Ser Gly Thr 
15 20 25 30 

Ala Ser Ala Glu Ala Phe Asp Ser Val Leu Gly Asp Tto Ala Ser Cys 
35 40 45 

20 His Arg Ala Cys Gin Leu TSir Tyr Pro Leu His Thr Tyr Pro Lys Glu 
50 55 60 



Glu Glu Leu Tyr Ala Cys Gin Arg Gly Cys Arg Leu Phe Ser He Cys 
^5 70 75 80 

Gin E^e Val Asp Asp Gly He Asp Leu Asn Arg Thr Lys Leu Glu Cys 
85 90 95 

Glu Ser Ala Cys Thr Glu Ala Tyr Ser Gin Ser Asp Glu Gin Tyr Ala 
100 105 110 

Cys His Leu Gly Cys Gin Asn Gin Leu Pro Phe Ala Glu Leu Arg Gin 
115 120 125 

35 Glu Gin Leu Met Ser Leu Met Pro Lys Met His Leu Leu Phe Pro Leu 
130 135 140 



25 



30 



Tlir Leu Val Arg Ser Phe Trp Ser Asp Met Met Asp Ser Ala Gin Ser 
145 150 155 160 

Phe He Thr Ser Ser Trp Thr Phe Tyr Leu Gin Ala Asp Asp Gly Lys 
165 170 175 

He Val He Phe Xaa Ser Lys Pro Arg Asn Pro Arg Tyr Ala Pro His 
180 185 190 

Leu Glu Pro Gly Ala Leu Pro Asn Leu Xaa Xaa Xaa Ser Leu Ser Lys 
195 200 205 

50 Met Ser Xaa Xaa Ser Xaa Met Arg Asn Ser Gin Ala His Arg Asn Phe 
210 215 220 



40 



45 



55 



Leu Glu Asp Gly Glu Ser Asp Gly Phe Leu Arg Cys Leu Ser Leu Asn 
225 230 235 240 

Ser Gly Trp He Leu Thr Thr Thr Leu Val Leu Ser Val Met Val Leu 
245 250 255 



^ Trp He Cys Cys Ala Thr Cys Cys Tyr Thr Leu L^ Asp Ala Val 
^ 260 265 270 
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636 



Xaa 



5 

(2) INPOraiATIQN FOR SEQ ID NO: 470: 

(i) SBC?UQK:E CHARACTERISTICS: 
10 (A) LENCxTH: 192 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUQ3CE DESCRIPTION: SEQ ID NO: 470: 

15 Met Met Val Leu Ser Leu Gly He He Leu Ala Ser Ala Ser Phe Ser 
^5 10 15 

Pro Asn Wie Thr Gin Val Thr Ser Thr Leu Leu Asn Ser Ala Tyr Pro 

20 

Phe He Gly Pro Phe Phe Phe He He Ser Gly Ser Leu Ser He Ala 
35 40 45 

Thr Glu Lys Arg Leu Thr Lys Leu Leu Val His Ser Ser Leu Val Gly 
25 50 55 60 

Ser He Leu Ser Ala Leu Ser Ala Leu Val Gly Kie He He Leu Ser 
65 70 75 80 

30 Val Lys Gin Ala Thr Leu Asn Pro Ala Ser Leu Gin Cys Glu Leu Asp 
85 90 95 

Lys Asn Asn He Pro Thr Arg Ser Tyr Val Ser Tyr Rie Tyr His Asp 
2^ 100 105 110 

Ser Leu Tyr Thr Thr Asp Cys Tyr Thr Ala Lys Ala Ser Leu Ala Gly 
115 120 125 

Xaa Leu Ser Leu Met Leu He Cys Thr Leu Leu Glu Phe Cys Leu Ala 
40 130 135 . 140 

Val Leu Thr Ala Val Leu Arg Trp Lys Gin Ala Tyr Ser Asp Phe Pro 
145 150 155 160 

45 Gly Ser Val Leu Phe Leu Pro His Ser Tyr He Gly Asn Ser Gly Met 
165 170 175 

Ser Ser Lys Met Thr His Asp Cys Gly Tyr Glu Glu Leu Leu Thr Ser 
180 185 190 



55 

(2) INFORMATION FOR SEQ ID NO: 471: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I£NC?ni: 234 amino acids 
^ (B) TYPE: amino acid 
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(D) TOPaLCGY: linear 
(xi) SEQUENCE DESaaPTION: SBQ ID NO: 471: 

Met Arg Lys Thr Arg Leu Trp Gly Leu Leu Trp Met Leu Phe Val Ser 
5 1 5 10 15 

Glu Leu Arg Ala Ala Thr Lys Leu Thr Glu Glu Lys Tyr Glu Leu Lys 
20 25 30 

10 Glu Gly Gin Thr Leu Asp Val Lys Cys Asp Tyr Thr Leu Glu Lys Phe 
35 40 45 



15 



30 



35 



45 



50 



55 



Ala Ser Ser Gin Lys Ala Trp Gin He lie Arg Asp Gly Glu Met Pro 
50 55 60 

Lys Thr Leu Ala Cys Thr Glu Arg Pro Ser Lys Asn Ser His Pro Val 
65 70 75 80 



Gin Val Gly Arg He He Leu Glu Asp Tyr His Asp His Gly Leu Leu 
20 85 90 95 

Arg Val Arg Met Val Asn Leu Gin Val Glu Asp Ser Gly Leu Tyr Gin 
100 105 110 

25 Cys Val He Tyr Gin Pro Pro Lys Glu Pro His Met Leu Phe Asp Arg 
115 120 125 



He Arg Leu Val Val Thr Lys Gly Phe Ser Gly Thr Pro Gly Ser Asn 
130 135 140 

Glu Asn Ser Thr Gin Asn Val Tyr Lys He Pro Pro Thr Thr Thr Lys 
145 150 155 160 

Ala Leu Cys Pro Leu Tyr Thr Ser Pro Arg Thr Val Thr Gin Ala Pro 
165 170 175 

Pro Lys Ser Thr Ala Asp Val Ser Thr Pro Asp Ser Glu He Asn Leu 
180 185 190 

40 Thr Asn Val Thr Asp He He Arg Val Pro Val Phe Asn He Val He 
195 200 205 



Leu Leu Ala Gly Gly Phe Leu Ser Lys Ser Leu Val Phe Ser Val Leu 
210 215 220 

Phe Ala Val Thr Leu Arg Ser E^e Val Pro 
225 230 



<2) INP0RMATIC3N FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQASTH: 105 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBC^IENCE DESCRIPTION: SBQ ID NO: 472: 

Met Leu His He Leu Pro Leu Lys Ser Tyr Asp Phe Pro His Phe Ser 
60 1 5 10 15 
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Leu Met Gly Arg Tyr Arg Cys Ala Ser Leu Leu Phe Cys Phe Leu Leu 
20 25 30 

5 Leu Kie Phe Phe Phe Cys Ser Val Leu Trp Thr Phe Ser Asp Met His 
35 40 45 

Arg Ser Gly Glu Asp Gly Pro Trp Thr Pro Cys Val His His Leu Ala 
50 55 60 

10 

Ala Ser Leu He Ser Tyr Gly Gin Pro Gly Phe He Cys He Ser Leu 
65 70 75 80 

Phe Ser Pro Val Leu Phe He Glu Asn Pro Arg His Tyr Ala Asn Ala 
15 85 90 95 

Thr Val 'Hhr Thr Leu Gly Asp Trp Xiia 
100 105 

20 

(2) INFORMATION FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LHJGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGy: linear 
(xi) SEQUETKZE DESCRIPTION: SEQ ID NO: 473: 

30 Met Val Phe Leu Lys Tyr Arg Phe Leu Phe Phe Leu Val Phe Leu Ala 
^5 10 15 

Asn Cys He Tyr Ser Leu His Tyr Lys Pro Ser Leu Met Tyr Pro Lys 
.20 25 30 



40 

(2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMC5TH: 571 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SBQUQJCE DESCRIPTION: SEQ ID NO: 474: 

Met Ala Leu Ser Arg Gly Leu Pro Arg Glu Leu Ala Glu Ala Val Ala 
50 1 5 10 15 

Gly Gly Arg Val Leu Val Val Gly Ala Gly Gly He Gly Cys Glu Leu 
20 25 30 

55 Leu Lys Asn Leu Val Leu Thr Gly Phe Ser His He Asp Leu He Asp 
35 40 45 

Leu Asp Thr He Asp Val Ser Asn Leu Asn Arg Gin Phe Leu Phe Gin 

60 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Lys Lys His Val Gly Arg Ser Lys Ala Gin Val Ala Lys Glu Ser Val 
70 75 80 

Leu Gin Phe Tyr Pro Lys Ala Asn lie Val Ala Tyr His Asp Ser lie 
85 90 95 

Met Asn Pro Asp TVr Asn Val Glu Phe Phe Arg Gin Phe He Leu Val 

105 110 

Met Asn Ala Leu Asp Asn Arg Ala Ala Arg Asn His Val Asn Arg Met 

120 X25 

Cys Leu Ala Ala Asp val Pro Leu He Glu Ser Gly Thr Ala Gly Tyr 

135 140 

Leu Gly Gin Val Thr -rtir He Lys Lys Gly Val Thr Glu Cys T/r Glu 
150 155 160 

Cys His Pro Lys Pro Thr Gin Arg Thr Kie Pro Gly Cys Thr He Arg 
165 170 175 

Asn Thr Pro ser Glu Pro He His Cys He Val Trp Ala Lys Tyr Leu 
180 185 190 

Rie Asn Gin Leu Kie Gly Glu Glu Asp Ala Asp Gin Glu Val Ser Pro 

200 205 

Asp Arg Ala Asp Pro Glu Ala Ala Trp Glu Pro Thr Glu Ala Glu Ala 
210 215 220 

Arg Ala Arg Ala Ser Asn Glu Asp Gly Asp He Lys Arg He Ser Thr 
230 235 240 

Lys Glu Trp Ala Lys ser Thr Gly Tyr Asp Pro Val Lys Leu Mifi Thr 
245 250 255 

Lys Leu Phe Lys A^ Asp He Arg Tyr Leu Leu Thr Met Asp Lys Leu 
260 265 270 

Trp Arg Lys Arg Lys Pro Pro Val Pro Leu Asp Trp Ala Glu Val Gin 
275 280 285 

Ser Gin Gly Glu Glu Thr Asn Ala Ser Asp Gin Gin Asn Glu Pro Gin 
290 295 300 

Leu Gly Leu Lys Asp Gin Gin Val Leu Asp Val Lys Ser Tyr Ala Arg 
310 315 320 

Leu Phe Ser Lys Ser He Glu Thr Leu Arg Val His Leu Ala Glu Lys 
325 330 

Gly Asp Gly Ala Glu Leu He Trp Asp Lys Asp Asp Pro Ser Ala Met 
340 345 350 

Asp Kie Val Thr Ser Ala Ala Asn Leu Arg tfet His He Phe Ser Met 
355 360 365 

Asn Met Lys Ser Arg Phe Asp He Lys Ser Met Ala Gly Asn He He 
370 375 380 
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Pro Ala He Ala Thr 'fiir Asn Ala Val He Ala Gly Leu He Val Leu 
390 395 400 

Glu Gly Leu Lys lie Leu Ser Gly Lys He Asp Gin Cys Arg Tte He 
40S 410 

Phe Leu Asn Lys Glh Pro Asn Pro Arg Lys Lys Leu Leu Val Pro Cys 
420 425 430 

10 Ala Leu Asp Pro Pro Asn Pro Asn cys Tyr Val Cys Ala Ser Lys Pro 
435 440 445 



Glu Val -nir Val Arg Leu Asn Val His Lys Val Tte Val Leu Thr Leu 
450 455 460 

Gin Asp Lys He Val Lys Glu Lys Phe Ala Met Val Ala Pro Asp Val 
^^5 470 475 480 

Gin He Glu Asp Gly Lys Gly Thr He Leu He Ser Ser Glu Glu Gly 
485 490 

Glu Thr Glu Ala Asn Asn His Lys Lys Leu Ser Glu Phe Gly He Arg 
500 505 510 

25 Asn Gly Ser Arg Leu Gin Ala Asp Asp Phe Leu Gin Asp lyr Thr Leu 
515 520 525 



15 



20 



30 



35 



40 



Leu He Asn He Leu His Ser Glu Asp Leu Gly Lys Asp Val Glu Phe 
530 535 540 

Glu Val Val Gly Asp Ala Pro Glu Lys Val Gly Xaa Lys Gin Ala Glu 
550 555 560 

Asp Ala Ala Lys Ser He Thr Asn Gly Gin Xaa 
565 570 



(2) INPORMAriON FOR SBQ ID NO: 475: 



(i) SBQfUENCE CHARACTERISTICS: 

(A) LOHSTH: 312 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

^5 (xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 475: 

Met Gin Val Val Thr Cys Leu Thr Arg Asp Ser Tyr Leu Thr His Cys 
15 10 15 

50 Phe Leu Gin His Leu Het Val Val Leu Ser Ser Leu Glu Arg Thr Pro 
20 25 30 

Ser Pro Glu Pro Val Asp Lys Asp Phe Tyr Ser Glu tt»e Gly Asn Lys 

55 « 

Thr Thr Gly Lys Met Glu Asn Tyr Glu Leu He His Ser Ser Arg Val 
50 55 60 

Lys Phe Thr Tyr Pro Ser Glu Glu Glu He Gly Asp Leu Thr Phe Thr 
65 70 75 80 



wo 98^^9448 

641 



Val Ala Gin Lys Met Ala Glu Pro Glu Lys Ala Pro Ala Leu Ser He 
85 90 95 

5 Leu Leu Tyr Val Gin Ala Phe Gin Val Gly Met Pro Pro Pro Gly Cys 
100 105 xio 

Cys Arg Gly Pro Leu Arg Pro Lys Thr Leu Leu Leu Thr Ser Ser Glu 

120 125 

He Phe Leu Leu Asp Glu Asp Cys Val His Tyr Pro Leu Pro Glu Phe 
130 135 140 

Ala Lys Glu Pro Pro Gin Arg Asp Arg lyr Arg Leu Asp Asp Gly Arg 
15 145 150 155 160 

Arg Val Arg Asp Leu Asp Arg Val Leu Met Gly Tyr Gin Thr Tyr Pro 
165 170 175 

20 Gin Pro Ser Pro Ser Ser Ser Met Thr Cys Lys Val Met Thr Ser Trp 
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25 



35 



40 



50 



180 185 



190 



Ala Val Ser Pro Trp Thr Thr Leu Gly Arg Cys Gin Val Ala Arg Leu 
195 200 205 

Glu Pro Ala Arg Ala Val Lys Ser Ser Gly Arg Cys Leu Ser Pro Val 
210 215 . 220 



Leu Arg Ala Glu Arg Ser Ser Ser Arg Cys Trp Leu Ala Ser Gly Arg 
30 225 230 235 240 



Pro Cys Val Ala Val Ser Cys Leu Ser Ser Ser Pro Ala Ser Pro Gly 
245 250 255 

His Ser Gin Pro Val Val Ser Ser Leu Thr Pro Thr Gly Ala Gly Gin 
260 265 270 

Gin Ala Phe Val Phe Ser Lys Asn Val Leu Ser Ser Leu Trp Tyr Leu 
275 280 285 

Asn Leu Thr Val Leu Ala Glu Asn Val Asn Met Cys Val Cys Cys Val 
290 295 300 



Asn Ser Phe Ser Cys Trp Glu Xaa 
45 305 310 



(2) INFORMATION FOR SEQ ID NO: 476: 



(i) SBt^JENCE CHARACTERISTICS: 

(A) LaiGrrH: 329 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

{3ti) SEQUENCE DESCRIPTIGN: SEQ ID NO: 476: 

Met Ala Gin His His Leu Trp He Leu Leu Leu Cys Leu Gin Thr Trp 
1 5 . 10 15 



60 



Pro Glu Ala Ala Gly Lys Asp Ser Glu He Phe Thr Val Asn Gly He 
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20 25 



30 



Leu Gly Glu Ser Val Ito Phe Pro Val Asn He Gin Glu Pro Arg Gin 
5 35 40 45 

Val Lys He He Ala Trp Ihr Ser Lys Thr Ser Val Ala lyr Val Thr 
50 55 60 

Pro Gly Asp Ser Glu Thr Ala Pro Val Val Thr Val Thr His Arg Asn 
'0 ^5 70 75 80 

Tyr Tyr Glu Arg He His Ala Leu Gly Pro Asn TVr Asn Leu Val He 
85 90 95 

15 Ser Asp Leu Arg Met Glu Asp Ala Gly Asp Tyr Lys Ala Asp He Asn 

105 110 



20 



35 



45 



50 



Thr Gin Ala Asp Pro Tyr Thr Thr Thr Lys Arg Tyr Asn Leu Gin lie 

120 125 

Tyr Arg Arg Leu Gly Lys Pro Lys He Thr Gin Ser Leu Met Ala Ser 
130 135 140 



Val Asn Ser Thr Cys Asn Val Thr Leu Thr Cys Ser Val Glu Lys Glu 
2^ 150 155 

Glu Lys Asn Val Thr Tyr Asn Trp Ser Pro Leu Gly Glu Glu Gly Asn 
165 170 175 

30 Val Leu Gin He E^e Gin Thr Pro Glu Asp Gin Glu Leu Thr Tyr Thr 
180 185 190 



Cys Thr Ala Gin Asn Pro Val Ser Asn Asn ser Asp Ser He Ser Ala 
195 200 205 

Arg Gin Leu cys Ala Asp He Ala Met Gly Phe Arg Thr His His Thr 
210 215 220 



Gly Leu Leu Ser Val Leu Ala Met Phe Phe Leu Leu Val Leu He Leu 
^ 225 230 235 240 



Ser Ser Val Phe Leu Phe Arg Leu Phe Lys Arg Arg Gin Asp Ala Ala 
245 250 255 

Ser Lys Lys Thr He Tyr Thr Tyr He Met Ala Ser Arg Asn Thr Gin 
260 265 270 

Pro Ala Glu Ser Arg He Tyr Asp Glu He Leu Gin Ser Lys Val Leu 
275 280 285 

Pro Ser Lys Glu Glu Pro Val Asn Thr Val Tyr Ser Glu Val Gin Phe 
290 295 300 



Ala Asp Lys Met Gly Lys Ala Ser Thr Gin Asp Ser Lys Pro Pro Gly 
55 305 310 



320 



Thr Ser Ser Tyr Glu He Val He Xaa 
325 



PCT/US98m493 



60 
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(2) INPORMATICN FOR SEQ ID NO: 477: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LSKTIH: 178 amino acids 

(B) T3fPE: amino acid 

(D) TOPOLOGy: linear 
(Xi) SBC3UQCE DESCRIPTION: SEQ ID NO: 477: 



10 Met Lys Leu Gin 



Cys Val Ser Leu Trp Leu Leu Gly Thr He Leu He 



15 



20 



25 



30 



40 



45 



50 



55 



60 



i 5 10 15 

Leu Cys ser Val Asp Asn His Gly Leu Arg Arg cys Leu He Ser Thr 
20 25 30 

Asp Met His His He Glu Glu Ser Phe Gin Glu He Lys Arg Ala He 
35 40 45 

Gin Ala Lys Asp Thr Phe Pro Asn Val Thr He Leu Ser Thr Leu Glu 
50 55 60 



1^ Leu Gin He He Lys Pro Leu Asp Val Cys Cys Val Thr Lys Asn 

80 



65 70 75 



Leu Leu Ala Phe Tyr Val Asp Arg Val Kie Lys Asp His Gin Glu Pro 
85 90 95 

Asn Pro Lys He Leu Arg Lys He Ser Ser He Ala Asn Ser Phe Leu 

105 110 

Tyr Met Gin Lys Thr Leu Arg Gin Cys Gin Glu Gin Arg Gin Cys His 
115 120 125 

Cys Arg Gin Glu Ala Thr Asn Ala Thr Arg Val He His Asp Asn Tyr 
130 135 140 

Asp Gin Leu Glu Val His Ala Ala Ala He Lys Ser Leu Gly Glu Leu 
150 155 160 



Asp Val Phe Leu Ala Trp He Asn Lys Asn His Glu Val Met Ser Ser 
165 170 175 

Ala Xaa 



(2) INPORMATION FOR SEQ ID NO: 478: 

<i) SEQUENCE CHARACTERISTICS: 

<A) LENGrra: 52 amino acids 

(B) TlfPE: amino acid 

(D) TOPOLOGV^: linear 
(xi) SBQUQICE DESCRIPTION: SEQ ID NO: 478: 

Asp Thr Ala He Arg Val Ala Leu Ala Val Ala Val Leu Lys Thr Val 
1 5 10 15 

He Leu Gly Leu Leu Cys Leu Leu Leu Cys Gly Gly Gly Glu Gly Lys 
20 25 30 
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val Ala Gly Arg Gin Ala Val Thr Ser Asp Gin Gin Ser Val Gly Arg 
35 40 45 

5 Arg Asp Val Tyr 
50 



10 (2) INFX>RMATIQN FOR SEQ ID NO: 479 1 



15 



20 



25 



30 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TlfPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 

Met Gin Lys Lys Asn Ser Leu Phe Phe Phe Phe Ala Phe Tyr Tyr Glu 
^ 5 10 



15 



Asn Lys Tiir Asn Ala Pro Gly Glu Gly Ser Met He Thr Arg Asn He 
20 25 30 

Lys Glu Tyr Phe Leu Pro rtie Leu Phe Cys Cys Val Glu Ala Ser He 
35 40 . 45 

Ala He Asn Lys Leu Asn Tyr Leu His Trp -nir His Phe Gin 
^0 55 60 



(2) INFORMATION FOR SEQ ID NO: 480: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LOTCTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 

Met Pro Gly Leu Ser Leu He Leu Thr Val Thr Leu Leu Ala Val Ser 
15 10 



15 



Asp Ser Ala Ala Thr Cys He Val Ala Lys Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 481: 

^0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGflH: 339 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

t3ci) SEQUENCE DESCRIPTIGN: SEQ ID NO: 481: 

Met Ser Gly Pro Asp Val Glu Thr Pro Ser Ala He Gin He Cys Arg 
1 5 10 15 

He Met Arg Pro Asp Asp Ala Asn Val Ala Gly Asn Val His Gly Gly 
20 25 30 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



ntr He Leu Lys Met He Glu Glu Ala Gly Ala He He Ser Thr Arg 
35 40 45 

His Cys Asn Ser Gin Asn Gly Glu Arg cys Val Ala Ala Leu Ala Arg 

55 60 

Val Glu Arg Thr Asp Phe Leu Ser Pro Met Cys He Gly Glu Val Ala 
70 75 80 

His Val Ser Ala Glu He Thr Tyr Thr Ser Lys His Ser Val Glu Val 
85 90 95 

Gin Val Asn Val Met Ser Glu Asn He Leu Thr Gly Ala Lys Lys Leu 

105 110 

Thr Asn Lys Ala Ihr Leu Trp Tyr Val Pro Leu Ser Leu Lys Asn Val 
115 120 125 

Asp Lys val Leu Glu Val Pro Pro Val Val Tyr Ser Arg Xaa Glu Gin 
130 135 140 

Glu Glu Glu Gly Arg Lys Arg Tyr Glu Ala Gin Lys Leu Glu Arg Met 
150 155 160 

Glu Thr Lys Trp Arg Asn Gly Asp He Val Gin Pro Val Leu Asn Pro 
165 170 175 

Glu Pro Asn Thr Val Ser Tyr Ser Gin Ser Ser Leu He His Leu Val 
180 185 190 

Gly Pro Ser Asp Cys Thr Leu His Gly Phe Val His Gly Gly Val Thr 
195 200 205 

Met Lys Leu Met Asp Glu Val Ala Gly He Val Ala Ala Arg His Cys 
210 215 220 

Lys Tte Asn He Val Thr Ala Ser Val Asp Ala He Asn Hie His Asp 
230 235 240 

Lys He Arg Lys Gly Cys Val He Thr He Ser Gly Arg Met Thr Phe 
245 '250 255 

Thr Ser Asn Lys Ser Met Glu He Glu Val Leu Val Asp Ala Asp Pro 
260 265 270 

Val Val Asp Ser Ser Gin Lys Arg Tyr Arg Ala Ala Ser Ala Phe Phe 
275 280 285 

Thr Tyr Val Ser Leu Ser Gin Glu Gly Arg Ser Leu Pro Val Pro Gin 
290 295 300 



55 



60 



Leu Val Pro Glu Thr Glu Asp Glu Lys Lys Arg Phe Glu Glu Gly Lys 
305 310 315 



320 



Gly Arg Tyr Leu Gin Met Lys Ala Lys Xaa Gin Gly His Ala Xaa Xaa 
325 330 335 

Gin Pro Xaa 
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(2) INFORMATION FOR SBQ ID NO: 482: 

5 

(i) SBC^JEMCE CHARACTERISTICS: 

(A) LQKTTH: 32 amino acids 

(B) TyPE: amino acid 
(D) TOPQUXJy: linear 

10 (xi> SEQUENCE DESCRIPTION: SBQ ID NO: 482: 

Met Leu Asn Ser Asn He Asn Asp Leu Leu Met Val •Rir Tyr Leu Ala 
15 10 15 

15 Asn Leu Tto Gin Ser Gin He Ala Leu Asn Glu Lys h&i Val Asn Leu 
20 25 30 



20 



(2) INFORMATION FOR SEQ ID NO: 483: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) hEnjTHi 48 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 483 : 



30 



35 



40 



50 



55 



Met Arg Glu Thr Ser He Arg Val Leu Leu Met Leu Pro Ala Leu Glu 
15 10 15 

Ser ^Chr Ser Gly Leu Ser Ala Phe Met Gly Leu Gly 'Thr Arg He Gly 
20 25 30 

Cys Phe Lys Thr He Thr Cys Trp Pro Thr Ser Leu Thr Gin Arg Xaa 
35 40 45 



45 (2) INFORMATION FOR SBQ ID NO: 484: 



(i) SEQUQICE CHARACTERISTICS: 

(A) LQIGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUaiCE DESCRIPTION: SBQ ID NO: 484: 

Met Tyr Met Tyr Ser Leu Asn Val Phe Leu Ser Wie He Phe Leu Ala 
15 10 15 

Leu Val Phe Lys Cys Val His Val Cys Gin Gly Ala Asn Ala Phe Leu 
20 25 30 



60 



Phe Leu Lys Leu Val Phe 
35 
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^ (2) INFORMATION FOR SBQ ID NO: 485: 

(i) SEQUENCE CHARACTERIffnCS : 

(A) LENC?ni: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SBQUajCE OESCnUPTTON: SBQ ID NO: 485: 

Met Gly Leu Arg Leu lie Cys Leu Glu Leu Thr Met Val Lys Ala Leu 
^5 10 15 

15 Val Cys Glu Met Phe Leu Phe Phe Leu Met Thr Gin Lys Leu lie Trp 
20 25 30 

Gin Glu Cys rbr Glu Lys Phe Ala Lys Leu Leu Val Gin Leu He Ser 
35 40 45 

20 

Leu Val ESie Ala Trp Glu Phe Phe Ser Glu Asp Thr Pro 
50 55 60 



25 

(2) INFORMATION FOR SEQ ID NO: 486: 

(i) SBQOQICE CHARACTERISTICS: 

(A) LQJCmi: 346 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 486: 

Met Leu Ala Ala Arg Leu Val Cys Leu Arg Thr Leu Pro Ser Arg Val 
j5 1 5 10 15 

Phe His Pro Ala Ph& Thr Lys Ala Ser Pro Val Val Lys Asn Ser He 
20 25 30 

40 Thr Lys Asn Gin Trp Leu Leu Thr Pro Ser Arg Glu Tyr Ala Thr Lys 
35 40 45 

Thr Arg He Gly He Arg Arg Gly Arg Thr Gly Gin Glu Leu Lys Glu 
50 55 60 

45 

Ala Ala Leu Glu Pro Ser Met Glu Lys He Kie Lys He Asp Gin Met 
65 70 75 80 

Gly Arg Trp Phe Val Ala Gly Gly Ala Ala Val Gly Leu Gly Ala Leu 
50 85 90 95 

Cys Tyr Tyr Gly Leu Gly Leu Ser Asn Glu He Gly Ala He Glu Lys 
100 105 110 

55 Ala Val He Trp Pro Gin Tyr Val Lys Asp Arg He His Ser Tlir Tyr 
115 120 125 

Met Tyr Leu Ala Gly Ser He Gly Leu Thr Ala Leu Ser Ala He Ala 

60 
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He ser Arg Thr Pro Val Leu Met Asn Phe Met Met Arg Gly Ser Trp 
145 150 155 160 

Val Thr He Gly Val Thr Phe Ala Ala Met Val Gly Ala Gly Met Leu 
5 165 170 175 

Val Arg Ser He Pro Tyr Asp Gin Ser Pro Gly Pro Lys His Leu Ala 
180 185 190 

10 Trp Leu Leu His Ser Gly Val Met Gly Ala Val Val Ala Pro Leu Thr 
195 200 205 

He Leu Gly Gly Pro Leu Leu He Arg Ala Ala Trp Tyr Thr Ala Gly 
210 215 220 

15 

He Val Gly Gly Leu Ser Thr Val Ala Met Cys Ala Pro Ser Glu Lys 
225 230 235 240 

Phe Leu Asn Met Gly Ala Pro Leu Gly Val Gly Leu Gly Leu Val Phe 
20 245 250 255 

Val Ser Ser Leu Gly Ser Met Phe Leu Pro Pro Thr Thr Val Ala Gly 
260 265 270 

25 Ala Thr Leu Tyr Ser Val Ala Met Tyr Gly Gly Leu Val Leu Phe Ser 
275 280 285 



30 



35 



40 



Met Phe Leu Leu Tyr Asp Thr Gin Lys Val He Lys Arg Ala Glu Val 
290 295 300 

Ser Pro Met Tyr Gly Val Gin Lys Tyr Asp Pro He Asn Ser Met Leu 
305 310 315 320 

Ser He Tyr Met Asp Thr Leu Asn He Phe Met Arg Val Ala Thr Met 
325 330 335 

Leu Ala Thr Gly Gly Asn Arg Lys Lys Xaa 
340 345 



45 



(2) INFX)RMftTICN FOR SEQ ID NO: 487: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQ^GTH: 237 amino acids 

(B) TypE; amino acid 
(D> TOPOLOGy: linear 

(xi) SEf^jmCB DESCRIPTICN: SEQ JD NO: 487: 

50 Met Glu Glu Val Leu Leu Leu Gly Leu Lys Asp Arg Glu Gly Tyr Thr 
15 10 15 

Ser Phe Trp Asn Asp Cys He Ser Ser Gly Leu Arg Gly Cys Met Leu 

55 

He Glu Leu Ala Leu Arg Gly Arg Leu Gin Leu Glu Ala Cys Gly Met 
35 40 45 

Arg Arg Lys Ser Leu Leu Thr Arg Lys Val He Cys Lys Ser Asp Ala 
60 50 55 60 
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Pro TSir Gly Asp Val Leu Leu Asp Glu Ala Leu Lys His Val Lys Glu 
65 70 75 80 

5 Thr Gin Pro Pro Glu Thr Val Gin Asn Trp He Glu Leu Leu Ser Gly 
85 90 95 

Glu Thr Trp Asn Pro Leu Lys Leu His Tyr Gin Leu Arg Asn Val Arg 
100 105 110 

Glu Arg Leu Ala Lys Asn Leu Val Glu Lys Gly Val Leu Thr 'Thr Glu 
115 120 125 

Lys Gin Asn Phe Leu Leu Phe Asp Met Thr Thr His Pro Leu Thr Asn 
15 130 135 140 

Asn Asn He Lys Gin Arg Leu He Lys Lys Val Gin Glu Ala Val Leu 
145 150 155 160 

20 Asp Lys Trp Val Asn Asp Pro His Arg Met Asp Arg Arg Leu Leu Ala 
165 170 175 



25 



Leu He Tyr Leu Ala His Ala Ser Asp Val Leu Glu Asn Ala Phe Ala 
180 185 190 

Pro Leu Leu Asp Glu Gin Tyr Asp Leu Ala Thr Lys Arg Val Arg Gin 
195 200 205 



Leu Leu Asp Leu Asp Pro Glu Val Glu Cys Leu Lys Ala Asn Thr Asn 
30 210 215 220 



Glu Val Leu Trp Ala Val Val Ala Ala Phe Thr Lys Xaa 
225 230 235 



35 



40 



(2) INFORMftTIC3N FDR SEQ ID NO: 488: 

(i) SBQUQICE CHARACIERISTICS: 

(A) LQKTTH: 200 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQPENCE DESCRIPTION: SBQ ID NO: 488: 

45 Met Ala Gin Arg Met Val Trp Val Asp Leu Glu Met Thr Gly Leu Asp 
15 10 15 

He Glu Lys Asp Gin He He Glu Met Ala Cys Leu He Thr Asp Ser 

50 

Asp Leu Asn He Leu Ala Glu Gly Pro Asn Leu He He Lys Gin Pro 
35 40 45 

Asp Glu Leu Leu Asp Ser Met Ser Asp Trp Cys Lys Glu His His Gly 
" 50 55 60 

Lys Ser Gly Leu Thr Lys Ala Val Lys Glu Ser Thr He Thr Leu Gin 
65 70 75 80 

60 Gin Ala Glu Tyr Glu Phe Leu Ser Phe Val Arg Gin Gin Thr Pro Pro 
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85 90 95 

Gly Leu Cys Pro Leu Ala Gly Asn Ser Val His Glu Asp Lys Lys Phe 
^ 100 105 110 

Leu Asp Lys Tyr Met Pro Gin Phe Met Lys His Leu His Tyr Arg lie 
115 120 125 

lie Asp Val Ser Thr Val Lys Glu Leu Cys Arg Arg Trp Tyr Pro Glu 
10 130 135 140 

Glu Tyr Glu Phe Ala Pro Lys Lys Ala Ala Ser His Arg Ala Leu Asp 
145 150 155 160 

15 Asp lie Ser Glu Ser He Lys Glu Leu Gin Phe Tyr Arg Asn Asn He 
165 170 175 

Phe Lys Lys Lys He Asp Glu Lys Lys Arg Lys He He Glu Asn Gly 
180 185 190 

20 

Glu Asn Glu Lys Ihr Val Ser Xaa 
195 200 



25 

(2) INFORMATION FOR SBQ ID NO: 489: 

(i) SBQUQICE CHARACTERISTICS: 

(A) LQJGrrH: 351 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 489: 

Met Ala Thr Thr Ala Ala Pro Ala Gly Gly Ala Arg Asn Gly Ala Gly 
35 1 5 10 ' 15 

Pro Glu Trp Gly Gly Phe Glu Glu Asn He Gin Gly Gly Gly Ser Ala 
20 25 30 

40 Val He Asp Met Glu Asn Met Asp Asp Thr Ser Gly Ser Ser Phe Glu 
35 40 45 

Asp Met Gly Glu Leu His Gin Arg Leu Arg Glu Glu Glu Val Asp Ala 
50 55 60 

45 

Asp Ala Ala Asp Ala Ala Ala Ala Glu Glu Glu Asp Gly Glu Phe Leu 
65 70 75 80 

Gly Met Lys Gly Phe Lys Gly Gin Leu Ser Arg Gin Val Ala Asp Gin 
50 85 90 95 

Met Trp Gin Ala Gly Lys Arg Gin Ala Ser Arg Ala Phe Ser Leu Tyr 
100 105 110 

55 Ala Asn He Asp He Leu Arg Pro Tyr Phe Asp Val Glu Pro Ala Gin 
115 120 125 

Val Arg Thr Gly Leu Leu Glu Ser Met He Prt> He Lys Met Val Asn 
130 135 140 

60 
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Phe Pro Gin Lys lie Ala Gly Glu Leu Tyr Gly Pro Leu Met Leu Val 
145 150 155 160 

Phe Thr Leu Val Ala He Leu Leu His Gly Met Lys Thr Ser Asp Thr 
165 170 175 

He He Arg Glu Gly Thr Leu Met Gly Thr Ala He Gly Thr Cys Phe 
180 185 190 

Gly Tyr Trp Leu Gly Val Ser Ser Phe He Tyr Phe Leu Ala lyr Leu 
195 200 205 

Cys Asn Ala Gin He Thr Met Leu Gin Met Leu Ala Leu Leu Gly Tyr 
210 215 220 

Gly Leu Phe Gly His Cys He Val Leu Phe He rta: Tyr Asn He His 
225 230 235 240 

Leu His Ala Leu Phe Tyr Leu Kie Trp Leu Leu Val Gly Gly Leu Ser 
245 250 255 

Thr Leu Arg Met Val Ala Val Leu Val Ser Arg Thr Val Gly Pro Thr 
260 265 270 

Gin Arg Leu Leu Leu Cys Gly Thr Leu Ala Ala Leu His Met Leu Phe 
275 280 285 

Leu Leu Tyr Leu His Phe Ala Tyr His Lys Val Val Glu Gly He Leu 
290 295 300 

Asp Thr Leu Glu Gly Pro Asn He Pro Pro He Gin Arg Val Pro Arg 
305 310 315 320 

Asp He Pro Ala Met Leu Pro Ala Ala Arg Leu Pro Thr Thr Val Leu 
325 330 335 

Asn Ala Thr Ala Lys Ala Val Ala Val Thr Leu Gin Ser His Xaa 
340 345 350 



(2) INFX5RMATICN FOR SEQ ID NO: 490: 

(i) SEQUEUfCE CHARACTERISTICS: 

(A) LETSTTH: 265 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIFTIQN: SEQ ID NO: 490: 

Met Arg Gly Ser Arg Gly Gly Trp Ala Gly Glu Met Ala Ala Ser Gly 
15 10 15 

Glu Ser Gly Thr Ser Gly Gly Gly Gly Ser Thr Glu Glu Ala Phe Met 
20 25 30 

Thr Phe Tyr Ser Glu Val Lys Gin He Glu Lys Arg Asp Ser Val Leu 
35 40 45 

Thr Ser Lys Asn Gin He Glu Arg Leu Thr Arg Pro Gly Ser Ser Tyr 
50 55 60 
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Phe Asn Leu Asn Pro Phe Glu Val Leu Gin He Asp Pro Glu Val Thr 
65 70 75 80 

Asp Glu Glu He Lys Lys Arg Phe Arg Gin Leu Ser He Leu Val His 
85 90 95 

Pro Asp Lys Asn Gin Asp Asp Ala Asp Arg Ala Gin Lys Ala Phe Glu 
100 105 HO 

Ala Val Asp Lys Ala Tyr Lys Leu Leu Leu Asp Gin Glu Gin Lys Lys 
115 120 125 

Arg Ala Leu Asp Val He Gin Ala Gly Lys Glu Tyr Val Glu His Thr 
130 135 140 

Val Lys Glu Arg Lys Lys Gin Leu Lys Lys Glu Gly Lys Pro Thr He 
145 150 155 160 

Val Glu Glu Asp Asp Pro Glu Leu Phe Lys Gin Ala Val Tyr Lys Gin 
165 170 175 

Thr Met Lys Leu Phe Ala Glu Leu Glu He Lys Arg Lys Glu Arg Glu 
180 185 190 

Ala Lys Glu Met His Glu Arg Lys Arg Gin Arg Glu Glu Glu He Glu 
195 200 205 

Ala Gin Glu Lys Ala Lys Arg Glu Arg Glu Trp Gin Lys Asn Phe Glu 
210 215 220 

Glu Ser Arg Asp Gly Arg Val Asp Ser Trp Arg Asn Phe Gin Ala Asn 
225 230 235 240 

Thr Lys Gly Lys Lys Glu Lys Lys Asn Arg Thr Phe Leu Arg Pro Pro 
245 250 255 

Lys Val Lys Met Glu Gin Arg Glu Xaa 
260 265 



(2) INFORMATION FOR SEQ ID mz 491: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LHJGTH: 25 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY : linear 

(xi) SEQUEIflCE DESCRIPTION: SEQ ID NO: 491: 

Asp Ser Met Pro Thr Cys Pro Leu Xaa Ala Ser Leu Glu Cys Gly Pro 
1 5 10 15 

Leu Leu Pro Val Arg Leu Cys Cys Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 492: 
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(i) SEQDStCE CHARACTERISTICS: 

(A) LQ«?ni: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOIjOGY: linear 

(xi) SBQOENCE DESCRIPTICN: SBQ ID NO: 492: 

Met Asn Glu Tyr Arg Val Pro Glu Leu Asn Val Gin Asn Gly Val Leu 
15 10 15 

Lys Ser Leu Ser Phe Leu Phe Glu Tyr lie Gly Glu Met Gly Lys Asp 
20 25 30 

Tyr He Tyr Ala Val Thr Pro Leu Leu Glu Asp Ala Leu Met Arg 
35 40 45 

Asp Leu Val His Arg Gin Thr Ala Ser Ala Val Val Gin His Met Ser 
50 55 60 

Leu Gly Val Tyr Gly Phe Gly Cys Glu Asp Ser Leu Asn His Leu Leu 
65 70 75 80 

Asn Tyr Val Trp Pro Asn Val Phe Glu Thr Ser Pro His Val He Gin 
85 90 95 

Ala Val Met Gly Ala Leu Glu Gly Leu Arg Val Ala He Gly Pro Cys 
100 105 110 

Arg Met Leu Gin Tyr Cys Leu Gin Gly l«u Phe His Pro Ala Arg Lys 
115 120 125 

Val Arg Asp Val Tyr Trp Lys He Tyr Asn Ser He Tyr He Gly Ser 
130 135 140 

Gin Asp Ala Leu He Ala His Tyr Pro Arg He Tyr Gin Arg Xaa 
145 150 155 



(2) INPORMATIGN FOR SBQ ID NO: 493: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 493: 

Met He Ser Asp Asn Ser Ala Glu Asn He Ala Leu Val Thr Ser Met 
1 5 10 15 

Tyr Asp Gly Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin 
20 25 30 

Leu Glu A^ He Arg Asn Leu Gin Asp Leu Ufa: Pro Leu Lys Leu Ala 
35 40 45 

Ala Lys Glu Gly Lys He Glu He Kie Arg His He Leu Gin Arg Glu 
50 55 60 

Phe Ser Gly Leu Ser His Leu Ser Arg Lys Phe ihr Glu Trp Cys Tyr 
65 . 70 75 80 
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Gly Pro Val Arg Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys 
85 90 95 

Glu Glu Asn Ser Val Leu Glu He He Ala Phe His Cys Lys Ser Pro 
100 105 110 

His Arg His Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin 
115 120 125 

Ala Lys Trp Asp Leu Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys 
130 135 140 

Asn Leu He Tyr Met Phe He Phe Thr Ala Val Ala Tyr His Gin Pro 
145 150 155 160 

Thr Leu Lys Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn 
165 170 175 

Ser Met Leu Leu Thr Gly His He Leu He Leu Leu Gly Gly He Tyr 
180 185 190 

Leu Leu Val Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe He 
195 200 205 

Trp He Ser Phe He Asp Ser Tyr Phe Glu He Leu Phe Leu Phe Gin 
210 215 220 

Ala Leu Leu Thr Val Val Ser Gin Val Leu Cys Phe Leu Xaa He Glu 
225 230 235 240 

Trp Tyr Leu Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn 
245 250 255 

Leu Leu Tyr Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val 
260 265 270 

Met He Gin Lys Pro Trp Xaa 
275 



<2) INFORMATICN FOR SEQ ID NO: 494: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGra: 193 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIGN: SEQ ID NO: 494: 



Met He Arg Cys Gly Leu Ala Cys 
1 5 

Leu Leu Leu Leu Ser Ala He Ala 
20 

Arg Gly Trp Leu Gin Ser Ser Asp 
35 40 



Glu Arg Cys Arg Trp He Leu Pro 
10 15 

Phe Asp He He Ala Leu Ala Gly 
25 30 

His Gly Gin Thr Ser Ser Leu Trp 
45 



Trp Lys Cys Ser Gin Glu Gly Gly Gly Ser Gly Ser Tyr Glu Glu Gly 
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50 55 60 

Cys Gin Ser Leu Met Glu Tyr Ala Trp Gly Arg Ala Ala Ala Ala Met 
65 70 75 80 

Leu Phe Cys Gly Phe lie He Leu Val He Cys Phe He Leu Ser Phe 
85 90 95 

Phe Ala Leu Cys Gly Pro Gin Met Leu Val Phe Leu Arg Val He Gly 
100 105 110 

Gly Leu Leu Ala Leu Ala Ala Val Phe Gin He He Ser Leu Val He 
115 120 125 

Tyr Pro Val Lys Tyr Thr Gin Thr Phe Thr Leu His Ala Asn Xaa Ala 
130 135 140 

Val Thr Tyr He Tyr Asn Trp Ala Tyr Gly Phe Gly Trp Ala Ala Thr 
145 150 155 160 

He He Leu He Gly Cys Ala Phe Phe EHie Cys Cys Leu Pro Asn Tyr 
165 170 175 

Glu Asp Asp Leu Leu Gly Asn Ala Lys Pro Arg Tyr Phe Tyr Thr Ser 
180 185 190 

Ala 



(2) INPCmMATION FOR SBQ ID 1K>: 495; 

(i) SEQUENCE GHARACTEEUSTICS : 

(A) LB4GrrH: 205 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE nESOOPnCK: SBQ ID NO: 495: 

Met Ala Ala Gly A^ Gin Val Phe Ser Gly Ala Gly His Val Xaa Glu 
15 10 15 

His Val Ala Gly Gly Arg His Ala Trp Leu Leu Thr Trp Gin Ser Ala 
. 20 25 30 

Cys Pro Ala Asn Arg Leu Ser Leu Val Pro Leu Val Pro Ser Ala Ser 
35 40 45 

Met Thr Arg Met Arg Xaa Arg Thr Ala Ser Gly Ser Ser Val He 
50 55 60 

Leu Trp Met Ala Pro Ala Ala Ala Pro Thr Pro Ala Arg Ala Pro Glu 
65 70 75 80 

Ala Ala Pro Thr Pro Ala Arg Ala Pro Ala Ala Ala Arg Thr Pro Ala 
85 90 95 

Arg Gly Pro Thr Trp Thr Ser Pro Pro TSir Arg Val Leu Leu Gly Thr 
100 105 110 
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Xaa Pro Gly Pro Ser Pro Trp Arg Ser Pro Ala Arg Arg Pro Ala Gin 
115 120 125 

Leu Pro Pro Pro Asp Ser Asp Leu Cys Ser Gly Pro Leu Leu Pro Gly 
130 135 140 

Pro Phe Ser Pro Pro Ala Cys His Thr Ala Pro Asn Ser Val Leu lie 
145 150 155 160. 

Gin Ser Leu Phe Cys Lys Ser Glu Leu Trp Trp Arg Gin Met Arg Ser 
165 170 175 

He Thr Trp Val Pro Ser Pro Lys Ala Gly Trp Arg Trp Thr Lys Gly 
180 185 190 

Arg Lys Gin Ala Ser Pro His Arg He Leu Phe His Xaa 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 496: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TSfPE: amino aucxd 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 496: 

Met Ala Leu Thr Leu Leu Pro Ser Val Ser Arg Leu Pro Gly Glu Arg 
15 10 15 

Met Ala Ala Ser Gly Leu Pro Tyr Val Leu His His Lys Ser Ser Leu 
20 25 30 

Met Lys Val He Phe Phe Pro Tyr Pro Val Leu Pro Leu Pro Ala Pro 
35 40 45 

Asn Gly Thr Trp Val Pro Arg Leu Val Leu Gly Leu Gly Ser Gly Asp 
50 55 60 

Gin Val His Tyr Leu Pro He Ser Ser Ser He Val Asn Tyr Gly Thr 
65 70 75 80 

Ser Val Ser Gly Lys Ser Trp Val Phe Leu Val Tyr Pro Leu His Pro 
85 90 95 

Thr Pro Thr Trp Ser Thr Arg Cys Phe Gin Val Trp Asp Leu Leu Ser 
100 105 110 

Val Glu Leu Pro Asp Lys Gly Glu Gly Asn Thr Arg Arg Ala Ser Gly 
115 120 125 

Val Pro Gly Leu Ser Gin Leu Pro Thr Ser His Lys Pro He Lys Gin 
130 135 140 



Glu Tyr Xaa 
145 
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(2) INFORMATION FOR SBQ ID NO: 497: 

(i) SBQUQJCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SBQ ID ttO: 497: 

Met Val Trp Val Leu Trp Ser Ala Pro Ser Leu Ala Pro Pro Trp Val 
15 10 15 

Gly Pro Cys Trp Pro Ser Thr Gly Asn Cys Cys Leu Cys Glu Val Gly 
20 25 30 

Ala Ala Leu Pro Pro Arg Gly Paro Ser Leu Ser Asp Cys Leu Gly Leu 
35 40 45 

Pro Pro Trp Thr Pro Trp Gly Pro Ala Trp Thr Leu Ala Gin Ser Xaa 
50 55 60 



(2) INFORMATION FOR SBQ ID NO: 498: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 94 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 498: 

Met Ser Thr Gly Ala Leu Asn Thr Ser Pro Pro Ala Ser Asn Arg Leu 
15 10 15 

Glu Ser Thr Leu Asn Glu Tyr Leu He Gin Pro Gin Leu His Cys Ser 
20 25 30 

Ser Val Gin Arg Leu Thr Leu Lys Trp Gly Cys Ser Ser Leu Gin Arg 
35 40 45 

Asp Gly Gin Ala Val Pro Trp Gly Leu Trp Gin Arg Ala Tyr Pro Ser 
50 55 60 

Leu Leu Pro Thx Leu Pro Ser Asp Leu Leu Arg Pro His Ala Val Thr 
65 70 75 80 

Pro Ser Val Ser Val Ser Val His Tlu: Cys Glu Ser Ser Xaa 
85 90 



(2) INFORMATION FOR SEQ ID NO: 499: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUQTCE DESCRIPTION: SBQ ID NO: 499: 
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Met Phe Leu lie Phe Val Tyr Phe Leu Lys Xaa Leu Phe Ser Ser Ser 
15 10 15 

5 Leu Pro Phe Leu Trp Leu 
20 



10 (2) INPQRMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LHKJTH: 33 amino acids 

(B) T5fPE: amino acid 
15 {D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 



20 



25 



30 



35 



40 



50 



55 



Arg Gly Gly Leu Cys Pro Leu Leu Val Pro Gly Pro Leu Ala Arg Gin 
15 10 15 

Glu Pro Ser Pro Ser Leu Gin Gly Cys Ser Glu Ser Pro Val Gly Met 
20 25 30 

Asp 



(2) INPORMATICN FOR SEQ ID NO: 501: 

(i) SBQUEWCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 

Met Gin rtie Leu Leu Thr Ala Phe Leu Leu Val Pro Leu Leu A3a Leu 



1 5 10 

Cys Asp Val Pro He Ser Leu Gly Phe Ser Pro Ser 
20 25 



15 



45 (2) INFORMATICN FOR SBQ XD NO: 502: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQKSTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 502: 

Pro Gly Lys Pro Gin Ala Cys Pro Glu Leu Tte Ser Val Leu Pro 
15 10 15 



60 



(2) INFORMATION FOR SEQ ID NO: 503: 
(i) SEQUQICE CHARACTERISTICS: 
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(A) LQIGIH: 19 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SBC3UENCE DESCRIPnON: SBQ ID NO: 503: 

Asn Lys Ser Leu Xaa Ser Cys Leu Phe Val Leu His Phe Val Leu His 
15 10 15 

Cys Xaa Phe 



(2) mFORMATIOM FOR SBQ ID NO: 504: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIQN: SBQ ID NO: 504; 

Met Glu Lys Thr His Arg Leu Arg lie Arg Asn Pro Cys Leu Gin Phe 
is 10 15 

Ser He Leu Asn Leu Phe Leu Leu Lys Met He Val Ser 
20 25 



(2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO; 505: 

Met Val Asp He Ser Lys Met His Met He Leu Tyr Asp Leu Gin Gin 
15 10 15 

Asn Leu Ser Ser Ser His Arg Ala Leu Glu Lys Gin He Asp Thr Leu 
20 25 30 

Ala Gly Lys Leu Asp Ala Leu Thr Glu Leu Leu Ser Thr Ala Leu Gly 
35 40 45 

Pro Ser Ser Phe Gin Asn Pro Ala Ser Ser Pro Ser Ser Trp Thr His 
50 55 60 

Glu Glu Glu Pro Gly Tyr Phe Pro Gin Tyr Xaa 
65 70 75 



(2) INFORMATION FOR SBQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LOKTIH: 10 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 506: 

Leu Pro Leu Ala Glu Leu Lys Asn Trp Val 
15 10 



(2) INFORMATION FOR SEQ ID NO: 507: 

(i) SBQUQICE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 507: 

Met Leu Trp Phe Gly Gly Cys Ser Ala Val Asn Ala Thr Gly His Leu 
15 10 15 

Ser Asp Thr Leu Trp Leu lie Pro He Thr Phe Leu Thr He Gly Tyr 
20 25 30 

Gly Asp Val Val Pro Gly Thr Met Trp Gly Lys He Val Cys Leu Cys 
35 40 45 

Thr Gly Val Met Gly Val Cys Cys Thr Ala Leu Leu Val Ala Val Val 
50 55 60 

Ala Arg Lys Leu Glu Phe Asn Lys Ala Glu Lys His Val His Asn Phe 
65 70 75 80 

Met Met Asp He Gin Tyr Thr Lys Glu Met Lys Glu Ser Ala Ala Arg 
85 90 95 

Val Leu Gin Glu Ala Trp Met Phe Tyr Lys His Thr Arg Arg Lys Glu 
100 105 110 

Ser His Ala Ala Arg Arg His Gin Arg Xaa Leu Leu Ala Ala He Asn 
115 120 125 

Ala Phe Arg Gin Val Arg Leu Lys His Arg Lys Leu Arg Glu Gin Val 
130 135 140 

Asn Ser Met Val Asp He Ser Lys Met His Met He Leu Tyr Asp Leu 
145 150 155 160 

Gin Gin Asn Leu Ser Ser Ser His Arg Ala Leu Glu Lys Gin He Asp 
165 170 175 

Thr Leu Ala Gly Lys Leu Asp Ala Leu Thr Glu Leu Leu Ser Thr Ala 
180 185 190 

Leu Gly Pro Arg Gin Leu Pro Glu Pro Ser Gin Gin Ser Lys Xaa 
195 200 205 



(2) INFORMATION FOR SBQ ID NO: 508: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SBC?UEWCB DESCMPTION: SBQ ID NO: 508: 

Met Trp Arg Cys Arg Gly Lys Leu Ser Phe Pro Leu Phe Ala Val Val 
15 10 15 

lie Val Ser Cys Arg Lys Asp Gly Pro Asp Ala Ala Ala Ala Pro Ala 
20 25 30 

Val Xaa Lys Lys 
35 



(2) INFORMATION FOR SBQ ID NO: 509: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPGLOGY: linear 

(xi) SBQfDENCE DESCRIPTION: SEQ ID NO: 509: 

Met Ala Leu Val Ala Leu Phe Thr Gin Leu Met Arg Xaa Leu Gly Arg 
^ 5 10 15 

Cys Pro Gin 



(2) INPOPMATIGN FOR SEQ ID NO: 510: 

ii) SBQUraiCE CHARACTERISTICS: 

(A) LQXTTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SB(?;aiCE DESCRIPnCN: SBQ ID NO: 510: 

Met Thr Phe Pro Phe Glu Lys Glu Asn Ser Cys Mae Gin Cys Leu Leu 
1 5 10 . 15 

Kie Asp Ser Trp Arg Glu Gin Thr Arg Thr Asn lie Gin Pro Gin Arg 
20 25 30 



(2) INPORMATION FOR SBQ ID NO: 511: 

(i) SBQOHJCE CHARAdKRISriCS: 

(A) IJQ4G^m: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID tK): 511: 



Met His Leu Leu Asp Phe Phe Arg Asp Leu Vad Leu Leu Val Leu Leu 
^5 10 15 
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Ala Leu Leu Asp Ser Phe Trp Leu Glu Val Gin Lys 
20 25 



(2) INFDRMTVTXCN FOR SEQ ID NO: 512: 

(i) SEQUENCE CHARACTERISTICS: 

<A) UENCFTH: 26 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 512: 

Met Cys Leu He His Wie He Lys He He Leu Val Phe He Leu Lys 
^ . 5 10 15 

Leu Trp Leu Tyr Ser Gin Lys Cys Pro Lys 
20 25 



(2) INPORMATICN FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LaJGTTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUEtKIE nESCRIPTION: SEQ ID NO: 513: 

»tet He His Val His Glu Trp Asn Asp Gin Met Leu Met Val Tyr He 
^ 5 10 15 

Rie Leu Tyr Pro Val Ser He Thr Phe Leu Asn Leu Cys Ser Leu Thr 
20 25 30 

cys 



(2) INPCWMATICN FOR SEQ ID NO: 514: 

(i) SBQUHKE CHARACTERISTICS: 

(A) LQ9Gmi: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 

Leu Asn Glu Ser Tyr Val Ser Arg Ala Gly Gly Trp Phe Ser Jfet Kie 
^ 5 10 15 

Xaa Leu He Mxe Phe Leu Leu Ala Leu Gly Ser Xaa Leu Cys Leu Leu 
20 25 30 



Leu Cys Leu Pro Ser Phe Asn Lys Thr Arg Arg Lys Gin Lys Pro 
35 40 45 
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(2) INFOroftTICN FOR SEQ ID NO: 515: 

(i) SBQUQCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SB^^JENCE DESCRIPnON: SEQ ID NO: 515: 

Ser Ser Lys Thr Pro Leu Pro Ser Glu Arg Arg Trp He Ser Gly Ser 
15 10 15 

Ser Leu Met Ala Pro Arg Pro Trp Leu Leu Gly He Ala Leu Leu Gly 
20 25 30 

Leu Trp Ala Leu Glu Pro Ala Leu Gly His Trp 
35 40 



(2) INFORMATICN FOR SEQ ID NO: 516: 

(i) SBQUQJCE CHARACTERISTICS: 

(A) LENGTH: 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQU03CE DESCRIPTiaN: SBQ ID NO: 516: 

Leu Asn Trp 
1 



(2) INFORMATICN FOR SBQ ID NO: 517: 

(i) SEQUENCE CHARACTERISTICS: 

(A) 'LENGTH: 174 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUEtTCE DESCRIPTKDN : SBQ ID NO: 517: 

Phe Ala E^e Cys Ala Glu Leu Met He Gin Asn Trp Thr Leu Gly Ala 
1 5 10 15 

Val Asp Ser Gin Met Asp Asp Met Asp Met Asp Leu Asp Lys Glu Phe 
20 25 30 

Leu Gin Asp Leu Lys Glu Leu Lys Val Leu Val Ala Asp Lys Asp Leu 
35 40 45 

Leu Asp Leu His Lys Ser Leu Val Cys Thr Ala Leu Arg Gly Lys Leu 
50 55 60 

Gly Val Phe Ser Glu Met Glu Ala Asn Phe Lys Asn Leu Ser Arg Gly 
65 70 75 80 

Leu Val Asn Val Ala Ala Lys Leu Thr His Asn Lys Asp Val Arg Asp 
85 90 95 



Leu Phe Val Asp 
100 



Leu Val Glu Lys 



Phe Val Glu Pro Cys Arg Ser Asp 
105 110 
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His Trp Pro Leu Ser Asp Val Arg 
115 120 

Ser Val His Ser Leu Asp Gly Phe 
130 135 

Thr Trp Ala Pro Ser Ala Ala Ala 
145 150 

Val Pro Pro Asn Ala Pro Pro Thr 
165 



Phe Phe Leu Asn Gin Tyr Ser Ala 
125 

Arg His Gin Ala Ser Gly Thr Ala 
140 

Ser Cys Ala Cys He Met Thr Glu 
155 160 

Leu "Kir He Lys Leu Leu 
170 



<2) INFORMATICN FOR SBQ ID NO: 518: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQfUENCE DESCRIPTION: SBQ ID NO: 518: 

Met Trp Lys Asn Leu Gly Ser Gly Ser Val Phe Val T^ir Trp Phe Ser 
1 5 10 15 

Leu Val Met He Leu Ser Gly He Gly Pro Leu Gly Asp Ala Glu Asp 
20 25 30 

Ser He Ser A^ Val Ser His Arg Leu Arg Pro 
35 40 



(2) INPORMATICN FOR SEQ ID NO: 519: 

(i) SBQUENLIE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPGLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 519: 

Phe Gin Phe Pro Leu Leu Thr He Ala Leu Gin PhG Leu 
15 10 



(2) INFORMATION FOR SBQ ID NO: 520: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUaiCE DESCRIPTION: SEQ ID NO: 520: 

Met His Tyr Val He Val Leu Ser Leu Phe Val Val Leu Glu Lys Lys 
15 10 15 



Asn Lys Met Gly Ser Asp Gly Cys Leu Arg Lys Asn Gly Ser 
20 25 30 
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(2) INFORMATICN FOR SEQ ID NO: 521: 

(i) SBOfOQCE CHARACTERISTICS: 

(A) LQKTTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOrjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 521: 

Met Ser Arg Ser He Val Leu Arg Gly Ser Leu Phe Leu Phe Phe Ser 
15 10 15 

His Tyr Thr Leu Lys Leu Leu Ser Val He Lys Gin Thr Asn Arg Lys 
20 25 30 

He Val Trp Glu Lys Pro Cys He Arg Leu E^e Tyr Xaa Val Leu 
35 40 45 



(2) INFORMATION FOR SBQ ID NO: 522: 

(i) SEC^IENCE CHARACTERISTICS: 

(A) LH^GTH: 26; amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DE9CRIPTICN: SEQ ID NO: 522: 

Met Pro Leu Pro Val Leu Leu Cys Leu Thr Leu Pro Met Pro Leu Pro 
15 10 15 

Ser Ala Thr Ala Arg Gly Gly Asn Arg Thr 
20 25 



(2) INPORMATIC»J FOR SBQ ID NO: 523: 

(i) SBQDENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino ctcid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTiaN: SBQ ID NO: 523: 

Ser Ser He Pro Val Ser He Leu He Gly Met Lys Leu He Leu Tyr 
1 5 10 15 

Leu Leu He Thr Glu Ser Gly Ser His Glu Lys Lys Ser Phe Tyr Pro 
20 25 30 

Ser PhG Lys Tyr Met Phe Lys He He He Tyr Val Ser Ala Tyr Cys 
35 40 45 

Arg Thr Ala Leu Arg Ala Thr Val Ser His 
50 55 
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(2) INFORMKriON FX>R SEQ ID NO: 524: 

(i> SEX3UENCE CHARACTERISTICS: 

(A) LQJGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOIXCY: linear 

(xi) SEC^aENCE nESCRIPTION: SEQ ID NO: 524: 

Asn Arg "nir Leu Leu Phe Leu lie Leu Phe Val Leu Phe Gly Leu Gly 
15 10 15 

Tyr Gly Phe 



(2) INPORMATICN FOR SEQ ID NO: 525: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEC3UENCE DESCRIPTION: SEQ ID NO: 525: 

Met Phe Leai Leu Val Leu Ser Val E^e Cys Asp Phe Met Cys Ser lie 
1 5 10 15 

Ala Pro Arg Cys His Ala Leu Ser Leu Val Ser Leu Arg Ala Gin His 
20 25 30 

Leu Ser Leu Phe lie Thr Cys His 
35 40 



(2) INPORMATICN FOR SEQ ID NO: 526: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 
(D) TOPCOiOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 526: 

Met Leu Leu Phe lie Leu Leu Thr Leu Ser Ser Gly Cys Arg Leu Leu 
1 5 10 15 

Val $er Ser Trp Lys Thr Vhe Leu Pro His Phe Ser Leu Pro Gly Pro 
20 25 30 

Arg Glu His Pro Glu Gly Ser Arg Thr Trp Kie Phe Arg Tyr Trp Glu 
35 40 45 

Pro Gly Ala His Cys Leu His Cys Ala 
50 55 



(2) INFQRMATIGN FOR SEQ ID NO: 527: 
(i) SBQ^)E^K:E CHARACTHaSTICS: 
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(A) LSIGmi: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DE9CRIPTIC»I : SBQ ID NO: 527: 

Ala Arg Leu Leu Leu Phe Leu Ser Ser Val His Pro Ser lie Met Pro 
15 10 15 

Ser Cys Asn Gin Leu 
20 



(2) INFORMATION FOR SBQ ID NO: 528: 

<i) SBQOQICE CHARACTKRISriCS : 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 528: 

Met Ser Leu Tbr Ser Ser Leu Thr Phe Leu Ser His lie Leu Leu Leu 
1 5 10 15 

Pro Gin Lys Leu Gin Phe Leu Ser Trp Met Glu Arg Gin Gin Arg Cys 
20 25 30 

Thr Gly Val Ala Lys Tyr Ala 
35 



(2) INFORHATICN FOR SEQ ID NO: 529: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(xi) SB&mtCE DESCRIPTICN: SEQ ID NO: 529: 

Met Val Leu Arg Leu He Gin Leu He Phe Leu He Phe Phe He His 
15 10 15 

He He He Leu Leu He Pro Gly Ser Arg Pro Cys Gly Ser Trp Val 
20 25 30 

Asn Asp Arg Xaa Leu Gly Leu Arg Asp Val Thr His Leu He Tyr Leu 
35 40 45 

His Trp Val His Gly His Leu Pro Trp Cys His Pro Tyr He Gin Val 
50 55 60 

Glu Phe Ser Ala Leu He Glu Ser Thr Ala Gin Leu Gly Leu Pro Phe 
65 70 75 80 

Ser Trp Val Arg Val He His Pro Phe Leu Val Leu Pro Cys Leu Tyr 
85 90 95 

Ser Pro Gly Leu Lys Asn Gly He PhB Leu Phe Leu Leu Arg Ala Met 
100 105 110 
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Pro Gly Gly Met Phe Pro Gly Asn Leu Glu Ala Phe Arg Val Pro Val 
115 120 125 



{2) INFX>RHATIGN FOR SEQ TO NO: 530: 

(i) SBQUHSKIE CHARACTER3OTICS : 

(A) LaKTH: 82 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPnON: SEQ ID NO: 530: 

Met Gly Ser Ser Val Leu Pro Phe Cys Val Cys Val Thr Ser Pro Ser 
15 10 15 

Leu Gly Gly Arg Cys He Gin Gly Arg Phe Ala Ser His Ser Lys Phe 
20 25 30 , 

Trp Gly Phe Gly Xaa Lys Thr Ala Ser E^e Gly Ala Val Gly Glu Thr 
35 40 45 

Pro Pro Asp Gin Glu Pro Gin Lys Glu Thr Glu Pro Ala Thr Ser Ser 
50 55 60 

His Ala Arg Pro Trp Ala Arg Val lie Gly Leu Arg He Trp Pro Gin 
65 70 75 80 

Pro Asn 



(2) INFX)RMATICN FOR SEQ ID NO: 53l': 

(i) SBQfUENCE CHARACTERISTICS: 

(A) LQIGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPQUX^Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 

Met Leu Leu Ser Val Ala He Phe He Leu Leu Thr Leu Val Tyr Ala 
15 10 15 

Tyr Trp Tbr Met 
20 



(2) INPORMATICN FOR SEQ ID NO: 532: 

(i) SEQUBfSCB CHARACTERISTICS: 

(A) LENOTH: 75 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUSICE DESCRIPTION: SEQ ID NO: 532: 
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Asn Cys Glu He Leu Glu Tyr Cys Tyr Tyr Leu Thr Gin Lys He 
15 10 15 

-Ser Met Gly Lys Tyr Leu Ser He Pro Thr Val Leu Leu Lys Xle He 
20 25 30 

Arg Cys Ser He Thr Ala Val Ser Asp Ser Ser Thr Ser Trp Ala He 
35 40 45 

Lys Ala Gin Leu Lys He Glu Asn Lys Asp Leu Asp Asn Lys Thr Ala 
50 55 60 

Lys Gly Gly Gly Gin Glu Ala Leu Thr Cys Thr 
65 70 75 



(2) INPORMATION FOR SEQ ID NO: 533: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOFOLOGy: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 533: 

Met Phe Leu Met Arg Met His Leu Cys Phe Cys Lys Tyr Cys Cys Ser 
1 " 5 10 15 

Phe He Val Thr Pro Thr Ser Thr Ser Asn Thr Xaa Ser Tyr Leu Trp 
20 25 30 

Pro Trp He Ser Ala Ser Met Ala Gly Arg Gly Ser Xaa Trp Ala Cys 
35 40 45 

Tto Leu Asn Ala Val Thr Arg Glu Gly Leu Pro Glu 
50 55 60 



(2) INPORMATION FOR SEQ ID NO: 534: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQ9SIH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SBQUaflCB DESCRIPTION: SEQ ID NO: 534: 

Met Ser Leu Leu Asn Thr His Thr Leu Cys Phe Val Leu Phe Cys Phe 
^5 10 15 

Thr Leu Ser He Asn Gin Glu Lys Leu Ala Asn His Leu Ala Phe Arg 
20 25 30 

He Leu Phe Kxe He Val Phe 
35 



(2) INFCSRMATION PGR SEQ ID NO: 535: 
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(i) SEQUEtfCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 
(D) TOPOIOGy: linear 

(xi) SBQfUEETCE DESCRIPTION: SEQ ID KO: 535: 



Met Leu 
1 



(2) INFORMATICN FOR SEQ ID NO: 536: 

15 (i) SBQUHICE CHARACTERISTICS: 

(A) LQIC?ni: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQqEKCE DESCRIPTIGN: SEQ ID NO: 536: 



20 



30 



40 



Met Asp Gin Phe Lys lie Phe Tyr Phe Leu Lys Ala Phe Phe Ala Cys 
^5 10 15 



Cys Asn Val Gin Asp Pro Ser Pro Phe Met Gly Glu Thr Gly Ser Tyr 
25 20 25 30 



Leu Asn lie Gly 
35 



(2) INFORMATION FOR SEQ ID NO: 537: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 14 amino acids 

<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SBC^TQfCE DESCRIPTION: SEQ ID NO: 537: 



Met Phe Asp Phe Leu Ser Tyr Phe Lys Asp Leu Leu Ser Cys 
1 5 10 



45 (2) INPCWMATICN FOR SBQ ID NO: 538: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 18 amino acids 

(B) TYPE: amino acid 
50 (D) TOPaLOGV: linear 

(xi) SEQUENCE DE9CRIPTICN: SBQ ID NO: 538: 

Met Gly Phe Gly Phe Val Leu Asn lie Phe Ser Phe Phe Leu Xaa Pro 

55 ' 

Pro Leu 



60 
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(2) INPORMATICN FDR SEQ ID NO: 539: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LB3GTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 539: 

Leu Leu Leu Trp Thr Leu Leu Ala Xaa Tyr Xaa 
15 10 



(2) INFOraiATiaN FOR SEQ ID NO: 540: 

(i) SBQUQ3CE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQCXQICE DESCRIPTICN: SEQ ID NO: 540: 

Met Ala Ala Gin Lys Asp Gin Gin Lys Asp Ala Glu Ala Glu Gly Leu 
15 10 15 

Ser Gly "ttir Thr Leu Leu Pro Lys Leu lie Pro Ser Gly Ala Gly Arg 
20 25 30 

Glu Trp Leu Glu Arg Arg Arg Ala Thr He Arg Pro Trp Ser Thr Phe 
35 40 45 

Val Asp Gin Gin Arg Phe Ser Arg Pro Arg Asn Leu Gly Glu Leu Cys 
50 55 60 

Gin Arg Leu Val Arg Asn Val Glu Tyr Tyr Gin Ser Asn Tyr Val . Phe 
65 70 75 80 

Val Kie Leu Gly Leu He Leu Tyr Cys Val Veil Thr Ser Pro Met Leu 
85 90 95 

Leu Val Ala Leu Ala Val Phe Phe Gly Ala Cys Xaa 
100 105 



(2) INFORMftTICN FOR SEQ ZD NO: 541; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQ9GTH: 106 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SBQUaiCE DESCRIPTICN: SBQ ID NO: 541: 

Phe Val Phe Leu Gly Leu He Leu Tyr Cys Val Val Thr Ser Pro Met 
15 10 15 

Leu Leu Val Ala Leu Ala Val Phe Phe Gly Ala Cys Tyr He Leu Tyr 
20 25 30 

Leu Arg rhr Leu Glu Ser Lys Leu Val Leu Gly Arg Glu Val Ser 
35 40 45 



W09a/39448 



672 



Pro Ala His Gin Tyr Ala Leu Ala Gly Gly He Ser Phe Pro Phe Phe 
50 55 60 

Trp Leu Ala Gly Ala Gly Ser Ala Val E^e Trp Val Leu Gly Ala Thr 
65 70 75 80 

Leu Val Val He Gly Ser His Ala Ala Phe His Gin He Glu Ala Val 
85 90 95 

Asp Gly Glu Glu Leu Gin Met Glu Pro Val 
100 105 



(2) INPORMATICN FOR SEQ ID NO: 542: 

(i) SBC^JHOCE CHARACTEiaSTICS: 

(A) LENGIH: 136 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: . linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 542: 

Met Asp Arg Phe Thr Val Ala Gly Val Leu Pro Asp He Glu Gin Phe 
^5 10 15 

Phe Asn He Gly Asp Ser Ser Ser Gly Leu He Gin Thr Val Phe He 
20 25 30 

Ser Ser Tyr Met Val Leu Ala Pro Val Phe Gly Tyr Leu Gly Asp Arg 
35 40 45 

Tyr Asn Arg Lys TVr Leu Met Cys Gly Gly He Ala Phe Trp Ser Leu 
50 55 60 

Val Thr Leu Gly Ser Ser Phe He Pro Gly Glu His Phe Trp Leu Leu 
^5 70 75 80 

Leu Leu Thr Arg Gly Leu Val Gly Val Gly Glu Ala Ser Tyr Ser Thr 
85 90 95 

He Ala Pro Thr Leu He Ala Asp Leu Phe Val Ala Asp Gin Arg Thr 
100 105 110 

Gly Cys Ser Ala Ser Ser Thr Leu Pro Phe Arg Trp Ala Val Val Trp 
115 120 125 

Ala Thr Leu Gin Ala Pro Lys Xaa 
130 135 



<2) INFORMATION FOR SEQ ID NO: 543: 

(i) SEQUaiCE CHARACTERISTICS: 

(A) LENGTH: 424 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESGRIPnON: SEQ ID NO: 543: 
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Met Ala Gly Asp Trp His Trp Ala Leu Arg Val -nir Pro Gly Leu Gly 
1 5 10 . 15 

Val Val Ala Val Leu Leu Leu Phe Leu Val Val Arg Glu Pro Pro Arg 
20 25 30 

Gly Ala Val Glu Arg His Ser Asp Leu Pro Pro Leu Asn Pro Thr Ser 
35 40 45 

Trp Trp Ala Asp Leu Arg Ala Leu Ala Arg Asn Pro Ser Phe Val Leu 
50 55 60 

Ser Ser Leu Gly Phe Thr Ala Val Ala Phe Val Thr Gly Ser Leu Ala 
65 70 75 80 

Leu Trp Ala Pro Ala Phe Leu Leu Arg Ser Arg Val Val Leu Gly Glu 
85 90 95 

Thr Pro Pro Cys Leu Pro Gly Asp Ser Cys Ser Ser Ser Asp Ser Leu 
100 105 110 

He Phe Gly Leu He Thr Cys Leu Thr Gly Val Leu Gly Val Gly Leu 
115 120 125 

Gly Val Glu He Ser Arg Arg Xaa Arg His Ser Asn Pro Arg Ala Asp 
130 135 140 

Pro Leu Val Cys Ala Thr Gly Leu Leu Gly Ser Ala Pro Hie Leu Phe 
145 150 165 160 

Leu Ser Leu Ala Cys Ala Arg Gly Ser He Val Ala Thr Tyr He Phe 
165 170 175 

He Phe He Gly Glu Thr Leu Leu Ser Met Asn Trp Ala He Val Ala 
180 185 190 

Asp He Leu Leu Tyr Val Val He Pro Thr Arg Arg Ser Thr Ala Glu 
195 200 205 

Ala Phe Gin He Val Leu Ser His Leu Leu Gly Asp Ala Gly Ser Pro 
210 215 220 

Tyr Leu He Gly Leu He Ser Asp Arg Leu Arg Arg Asn Trp Pro Pro 
225 230 235 240 

Ser Kie Leu Ser Glu Phe Arg Ala Leu Gin Phe Ser Leu Met Leu Cys 
245 250 255 

Ala Phe Val Gly Ala Leu Gly Gly Ala Leu Ser Trp Ala Pro Xaa Ser 
260 265 270 

Ser Arg Pro Thr Ala Gly Gly His Ser Cys Thr Cys Arg Ala Cys 
275 280 285 

Cys Thr Lys Gin Gly Pro Gin Thr Thr Gly Leu Trp Cys Pro Ser Gly 
290 295 300 



Ala Ala Pro Pro Ala Cys Pro Trp Pro Val Cys Ser Ser Glu Arg Leu 
305 310 315 320 
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Pro I^u Thr Tyr Leu His lie Cys His Ser Xaa Pro Trp Ala His Pro 
325 330 335 

Tlir Lys Gly Leu Gly Leu Thr Pro Trp Pro Gly Pro Ala Ser Arg Gly 
5 340 345 

Tbr Leu Gly Arg Val Pro Ala Pro Arg His Tyr Xaa Gly Ser Ser Gly 
355 360 365 

10 Glu Glu Val Gly Val Gin Glu Gly Asp Pro Ser Pro Gin Gly Xaa Pro 
370 375 380 

Gin Gly Leu Gly Ala He Cys Asn Gly He Lys Phe Val Ala Arg Pro 
385 390 395 400 

Gin Val Pro Ala Leu Val Phe Leu Trp Val Ala Ser Asp Leu Ala Pro 
405 410 415 

Arg Leu His Pro Arg Ala Pro Glu 
20 420 



25 



30 



35 



40 



50 



(2) INFOIWATION FOR SEQ ID NO: 544: 

(i) SBQfUENCK CHARACTERISTICS: 

(A) LQJGTH: 39 amino acids 

(B) TyPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIGN: SEQ ID NO: 544: 

Met Phe Arg Phe Val He Cys Leu Phe Leu Trp Leu Val Leu Cys Arg 
1 5 10 15 

A^ Ser Thr Ser Ala Ser Arg He Ala Leu Tyr Tyr Arg He Val ^e 
20 25 30 

Leu He His Gin Cys Ser Ser 
35 



(2) IKPORMATION FOR SE5Q' ID NO: 545: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 545: 



Met Leu Pro Trp Xaa Ala Gin Leu Leu Asp Arg Thr He Gly Pro Leu 
^5 10 15 



Tyr Leu Leu Phe Val Gin Phe Ser Pro Ala Phe Ser Arg Thr Ser Pro 
55 20 



25 30 



Trp Arg Ser Pro Lys Asn Phe Arg Arg Leu Tyr Pro Pro Cys Thr Thr 
35 40 45 



60 



Ser Gly Cys Ala Ala Arg Trp Leu Phe Ser 



wo 98^9448 



675 



50 55 



(2) INFORMATION FOR SBQ ID NO: 546: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 33 amino acids 

(B) TYPE: amino acid 
<D) TOPGLOGY: linear 

<xi) SBQfUEircE DESCRIPTICN: SBQ ID NO: 546: 

Met Gly Leu Ser Val Leu Leu Pro Leu Cys Leu Leu Gly Pro Gly Arg 
1 5 10 15 

Phe Thr Ser Gly Gin Lys Pro Leu Asp Thr Pro Gly Leu Gly Val Pro 
20 25 30 



(2) INPORMATIQN FOR SBQ ID NO: 547: 

(i) SBQUQJCE CHARACTERISTICS: 

<A) LQIOTH: 367 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SBQfOENCE EESCRIPTICN: SBQ ID NO: 547: 

Met Ala Lys Pro Gin Val Val Val Ala Pro Val Leu Met Ser Lys Leu 
15 10 15 

Ser Val Asn Ala Pro Glu Phe Tyr Pro Ser Gly Tyr Ser Ser Ser Tyr 
20 25 30 

Tlur Glu Ser Tyr Glu Asp Gly Cys Glu Asp Tyr Pro Thr Leu Ser Glu 
35 40 45 

Tyr Val Gin Asp Phe Leu Asn His Leu Thr Glu Gin Pro Gly Ser Phe 
50 55 60 

Glu Thr Glu He Glu Gin Phe Ala Glu Thr Leu Asn Gly Cys Val Thr 
65 70 75 80 

Thr Asp Asp Ala Leu Gin Glu Leu Val Glu Leu lie Tyr Gin Gin Ala 
85 90 95 

Thr Ser He Pro Asn Phe Ser Tyr Miet Gly Ala Arg Leu Cys Asn Tyr 
100 105 110 

Ser His His Leu Thr He Ser Pro Gin Ser Gly Asn Phe Arg Gin 
115 120 125 

Leu Leu Leu Gin Arg Cys Arg Thr Glu Tyr Glu Val Lys Asp Gin Ala 
130 135 140 

Ala Lys Gly Asp Glu Val Thr Arg Lys Arg Phe His Ala Phe Val Leu 
145 150 155 160 
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Phe Leu Gly Glu Leu Tyr Leu Asn Leu Glu lie Lys Gly Thr Asn Gly 
165 170 175 

Gin Val Thr Arg Ala Asp He Leu Gin Val Gly Leu Arg Glu Leu Leu 
180 185 190 

Asn Ala Leu Phe Ser Asn Pro Met Asp Asp Asn Leu He Cys Ala Val 
195 200 205 

Lys Leu Leu Lys Leu Thr Gly Ser Val Leu Glu Asp Ala Trp Lys Glu 
210 215 220 

Lys Gly Lys Met Asp Met Glu Glu He He Gin Arg He Glu Asn Val 
225 230 235 240 

Val Leu Asp Ala Asn Cys Ser Arg Asp Val Lys Gin Met Leu Leu Lys 
245 250 255 

Leu val Glu Leu Arg Ser Ser Asn Trp Gly Arg Val His Ala Thr Ser 
260 265 270 

Thr Tyr Arg Glu Ala Thr Pro Glu Asn Asp Pro Asn Tyr Phe Met Asn 
275 280 285 

Glu Pro Ttir Phe Tyr Thr Ser Asp Gly Val Pro Wie Thr Ala Ala Asp 
290 295 300 

Pro Asp Tyr Gin Glu Lys Tyr Gin Glu Leu Leu Glu Arg Glu Asp Phe 
305 310 315 

Phe Pro Asp Tyr Glu Glu Asn Gly Thr Asp Leu Ser Gly Ala Gly Asp 
325 330 335 

Pro Tyr Leu Asp Asp He Asp Asp Glu Met Asp Pro Glu He Glu Glu 
340 345 350 

Ala Tyr Glu Lys Phe Cys Leu Glu Ser Glu Arg Lys Arg Lys Gin 
355 360 365 



(2) INFCHWATICN FOR SEQ ID NO: 548: 

(i) SEQUENCE CHARACTERIOTICS : 

(A) LQJGTH: 77 amino acids 

(B) TlfPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBC2UQ3CE DE9CRIPTICW: SBQ 10 NO: 548: 

Met Leu Arg Leu Asp He He Asn Ser Leu Val Thr Thr Val Phe Met 
1 5 10 15 

Leu He Val Ser Val Leu Ala Leu He Pro Glu Thr Thr Thr Leu Thr 
20 25 30 

Val Gly Gly Gly Val E*e Ala Leu Val Thr Ala Val Cys Cys Leu Ala 
35 40 45 

Asp Gly Ala Leu He Tyr Arg Lys Leu Leu Phe Asn Pro Ser Gly Pro 
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€0 



Tyr Gin Lys Lys Pro Val His Glu Lys Lys Glu Val Leu 
65 70 75 



(2) INFX)RMATICN FOR SEQ ID NO: 549: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEC^JENCE DESCRIPTICN: SEQ ID NO: 549: 

Met Leu Lys Gin Val Met Phe Val Phe Ser Gly Met Gly Pro Arg Ser 
1 5 10 15 

His Cys Trp Gly Leu Pro Leu Ala Cys Gly Thr Phe Val Gin Gly His 
20 25 30 

Gin Ala Asp Ser Ser His Leu Leu Pro Leu Lys His Gin Gly Ala 
35 40 45 



(2) INPORMATIGN FOR SEQ ID NO: 550: 

(i) SBQqENCE CHARACTEEIISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 

Met Leu Leu Ser Leu Ala Ala Phe Ser Val He Ser Val Val Ser Tyr 
15 10 15 

Leu He Leu Ala Leu Leu Ser Val Thr He Ser Phe Urg He Tyr Lys 
20 25 30 

Ser Val He Gin Ala Val Gin Lys Ser Glu Glu Gly His Pro Phe Lys 
35 40 45 

Ala Tyr Leu Asp Val Asp He Thr Leu Ser Ser Glu Ala Kie His Asn 
50 55 60 

Tyr Met Asn Ala Ala Met Val His He Asn Arg Ala Leu Lys Leu He 
^5 70 75 80 

He Arg Leu Phe Leu Val Glu Asp Leu Val Asp Ser Leu Lys Leu Ala 
85 90 95 

Val Phe Met Trp Leu Met Thr IVr Val Gly Ala Val Phe Asn Gly He 
100 105 110 

Thr Leu Leu He Leu Ala Glu Leu Leu He Phe Ser VslL Pro He Val 
115 120 125 

Tyr Glu Lys Tyr Lys Thr Gin He Asp His Tyr Val Gly He Alk Arq 
130 135 X40 



J 
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Asp Gin Thr Lys Ser He Val Glu Lys He Gin Ala Lys Leu Pro Gly 
150 

5 He Ala Lys Lys Lys Ala Glu Xaa 
165 



10 (2) INFORMATION FOR SEQ ID NO; 551; 

(i) SBC^JENCE CHARACTERISTICS: 

(A) LEWCTH: 124 amino acids 

(B) TYPE: Bffftino acid 
(D) TOPOLOGY: linear 

(xi) SBQUElKrE DESCRIPTION: SEQ ID NO: 551: 

Ser val Pro Phe His Leu Leu Val Val Leu Arg Ser Arg Ala Val Arg 
20 ' 15 

Ala Arg Arg Arg Arg Glu Pro Arg Ser Leu Pro Arg Pro Gly Asp Glu 
20 25 30 

Glu Leu Gin Leu Leu Cys Gly Ala Arg Ser Asp Phe Leu Glu Arg 



25 



40 



45 



Cys Glu Glu Asp Trp Val Cys Leu Trp His His Ala Asp His Ala Ala 

55 60 

30 Phe Pro Gly Ser Phe Gin Cys His Gin Cys Gly Phe Leu Pro His Pro 

^° 75 80 

Gly ser Ser Leu cys His His Gin Leu Gin Asp Leu Gin Val Arg His 
35 95 

Pro Ser cys Tte Glu Val Arg Arg Arg Pro Ser He Gin Ser Leu Pro 

110 



100 105 



Gly Arg Arg His Tyr Ser Val Leu Arg Ser Phe Pro 

120 



40 115 



^2) INFORM?iTiaN FOR SEQ ID NO* 552- 

45 . • 

(i) SEQUENCE CHARACTERISTICS: 

(A) XiEMSTH: 177 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

^ SEQUENCE DESCRIPTION: SBQ ID NO: 552: 

Met val His Leu Leu Val Ser Gly Ala Trp Gly Met Gin Met Trp 



10 



15 



55 val Thr Phe Val Ser Gly Phe L^ Phe Arg Ser Leu Pro Arg His 
20 25 30 

Thr Fhe Gly Leu Val Gin Ser Lys Leu Phe Pro Phe Tyr Phe His He 

60 45 
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Ser Met Gly Cys Ala Phe He Asn Leu Cys He Leu Ala Ser Gin His 
50 55 60 

Ala Trp Ala Gin Leu Thr Phe Trp Glu Ala Ser Gin Leu Tyr Leu Leu 
^5 70 75 80 

Phe Leu ser Leu Thr Leu Ala Thr Val Asn Ala Arg Trp Leu Glu Pro 
85 90 55 

Arg Thr Oiir Ala Ala Met Trp Ala Leu Gin Thr Val Glu Lys Glu Arg 
100 105 110 

Gly Leu Gly Gly Glu Val Pro Gly Ser His Gin Gly Pro Asp Pro Tyr 

120 125 

Arg Gin Leu Arg Glu Lys Asp Pro Lys Tyr Ser Ala Leu Arg Gin Asn 
130 135 140 

Phe Phe Arg Tyr His Gly Leu Ser Ser Leu Cys Asn Leu Gly Cys Val 
150 155 160 

Leu Ser Asn Gly Leu Cys Leu Ala Gly Leu Ala Leu Glu He Arg Ser 
165 170 175 



Leu 



(2) INFORMATIGN FOR SBQ ID NO; 553: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQflGTH: 72 amino acids 

(B) TypE: amino acid 
(D) TOPOLOGY: lirwaj- 

(xi) SEQUENCE DESCEaPTION: SEQ ID NO: 553: 

Met Ala Phe He Leu Leu Phe Tyr Cys Leu ifet Thr Phe Leu Ser Leu 
1 5 10 15 

Glu Gin Asn Ser Ala T!hr Val Glu Pro Ser Ser His Glu He Leu His 
20 25 30 

Leu Leu Gin Asn Cys Kie Glu Leu Leu Arg Thr Ser Thr Ser Gin Cys 
35 40 45 

Ohr Glu Gly He Pro Cys Ala Lys He Pro Glu Trp Val Thr His Leu 

55 60 

Thr Trp Gin Thr Leu Lys Asn Ser 
65 70 



(2) INFXJRMATION FOR SBQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQXTnt: 45 amino acids 

(B) TTPE: amino acid 
(D) TOPaLOGY: linear 



W0 98A3SM48 



PCT/US98/CM493 



680 



(xi) SBC3UENCE DESCRIPTICN: SBQ ID NO: 554: 
Val Leu Arg He He Cys Leu Trp Pro Cys Gly Thr Thr Leu Pro Leu 

5 



^5 10 15 



Val Glu Lys Ala His Asp Ser His Ser Ala Asp Pro Val Cys Pro Gly 
20 25 30 

Leu Thr Ala His Leu Pro Val Leu Leu Tyr Val Gin Leu 
10 35 40 45 



(2) INPORMATION FOR SBQ ID NO: 555: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LMrPH: 251 amino acids 

(B) TsrpE: amino acid 
(D) TOPOLOGY: linear 

^0 (xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 555: 

Met Lys His Ala Asp Pro Arg He Gin Gly Tyr Pro Leu Met Gly Ser 
1 5 10 15 



25 



Pro Leu Leu Met Thr Ser He Leu Leu Thr Tyr Val Tyr Phe Val Leu 
20 25 30 



Ser Leu Gly Pro Arg He Met Ala Asn Arg Lys Pro Phe Gin Leu Arg 

30 



35 40 45 



Gly Phe Met He Val Tyr Asn Phe Ser Leu Val Ala Leu Ser Leu Tyr 
50 55 60 

He Val IVr Glu Phe Leu Met Ser Gly Trp Leu Ser Itir Tyr Thr Trp 
65 70 75 80 

Arg Cys Asp. Pro Gin Asp Cys Thr Leu Gly Gin Cys Pro Ser Val Pro 
85 90 95 



40 



Ser Pro Xaa Thr Pro Val Thr Lys Ala Tyr Val Val Arg Thr Glu Gin 
100 105 110 



Gly Thr Gly Pro Pro Leu Pro Thr Ala Ala Leu Gin Gly Pro Arg Leu 

45 



120 125 



Trp Phe Leu Oto His ttie Pro Arg Ala Ala Pro Gly Met Trp Pro His 
130 135 140 

^ ^ ^ Leu Gin Ser Trp Gly Leu Lys Gly Leu Tyr Ser Tyr 
150 155 160 

Phe Pro Leu Pro Ala Leu Lys Leu Gly Arg Gly Ala Leu Arg Ala Gly 
165 170 175 

55 Pro Thr Lys Gly Leu Val Ala Phe Phe Leu Thr Gin Lys Arg Ser Ala 
180 185 190 



He Met Ser Leu Trp Ite Gin Ser His ser Ser Thr Pro His Thr Glu 

60 



1^5 200 205 



681 



Ala Val Ala Ser Gly Pro Lys Val 
210 215 

Gin Pro Val Glu Ala Ala Tyr Ser 
225 230 

Arg Gly Asn His Glu Lys Lys Lys 
245 



Arg Val Gly Gly Gly Leu Gly He 
220 

*thr Cys Val Leu He Lys Ser Asp 
235 240 

Lys Lys Lys 
250 



(2) INPORMATICN FOR SEQ ID 1^; 556: 

(i) SEQUENCE CHARACTERISTICS: 

(A) L^STH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 556: 

Gly Leu Ala Gly Leu Cys Gly Gin Leu Ser Ser Pro Ala Leu Cys Val 
15 10 15 

Asn Arg Leu 



(2) INFORMATION FOR SEQ ID NO: 557: 

(i) SBQOENCE CHARACTERISTICS: 

(A) LQKaTH: 217 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 557: 

Met He Thr Glu Lys Trp Gly Leu Asn Met Glu Tyr Cys Arg Gly Gin 
^5 XO 15 

Ala Tyr He Xaa Ser Ser Gly Phe Ser Ser Lys Met Lys Val Val Ala 
20 25 30 

Ser Arg Leu Leu Glu Lys Tyr Pro Gin Ala He Tyr Thr Leu Cys Ser 
35 40 45 

Ser Cys Ala Leu Asn Met Trp Leu Ala Lys Ser Val Pro Val Met Gly 
50 55 60 

Val Ser Val Ala Leu Gly Thr He Glu Glu Val Cys Ser Phe Phe His 
^5 70 75 80 

Arg Ser Pro Gin Leu Leu Leu Glu Leu Asp Asn Val He Ser Val Leu 
85 90 95 

Phe Gin Asn Ser Lys Glu Arg Gly Lys Glu Leu Lys Glu He Cys His 
100 105 110 

Ser Gin Trp Thr Gly Arg His Asp Ala Phe Glu He Leu Val Glu Leu 
115 120 125 

Leu Gin Ala Leu Val Leu Cys Leu Asp Gly He Asn Ser Asp Thr Asn 
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130 



135 



140 



lie Arg Trp Asn Asn Tyr He Ala Gly Arg Ala Phe Val Leu Cys Ser 
150 155 160 

Ala Val Ser Asp Phe Asp Phe He Val Thr He Val Val Leu Lys Asn 
165 170 175 

Val Leu Ser Phe Thr Arg Ala Phe Gly Lys Asn Leu Gin Gly Gin Thr 
180 185 190 

Ser Asp Val Phe Phe Ala Ala Gly Ser Leu Thr Ala Val Leu His Ser 
195 200 205 



Leu Asn Glu Val He Gly Lys Tyr Xaa 
210 215 



(2) INFORMATION FOR SBQ ID NO: 558: 

(i) SBt^JHTCE CHARACTERISTICS: 

(A) LQTOTH: 82 axaino acids 

(B) TyPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE CESCRIPTICN: SBQ ID NO: 558: 

Leu Leu Lys Val Leu Cys He Leu Pro Val Met Lys Val Glu Asn Glu 
1 5 10 15 

Arg Tyr Glu Asn Gly Arg Lys Arg Leu Lys Ala Tyr Leu Arg Asn Thr 
20 25 30 

Leu -Kir Asp Gin Arg Ser Ser Asn Leu Ala Leu Leu Asn He Asn Hie 
35 40 45 

A^ He Lys His Asp Leu Asp Leu Met Val Asp Thr Tyr He Lys Leu 
50 55 60 

Tyr Thr Ser Lys Ser Glu Leu Pro Thr Asp Asn Ser Glu Tto val Glu 
70 75 80 

Asn Thr 



(2) INFORMATION FOR SBQ ID NO: 559: 

(i) SEQOENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 
(D) TOPCLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 559: 

Met Val Leu He Leu Leu Asn Leu Leu Leu Gly Gin Ete Ser Cys Met 
1 5 10 15 

Ser Pro Ala Ser His His Cys His Pro Leu Pro Thr Glu Jfet Pro Cys 
20 25 30 
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Ser Ser Asp Trp Gly Phe Asp Ser His Thr Val Tyr Pro Ser Cys Val 
35 40 45 

5 Asp Ala Leu Leu Pro Lys Pro Ser Ala Asn Ser Phe Pro Asn Gly Ser 
50 55 60 

Cys His Cys Gin Gly Leu Tyr Asn Gin Gin Gin Gin Asn Leu His Ala 
65 70 75 on 

10 

Ala Glu Gly Pro Ala Ser Leu Arg Cys Asn Lys Tyr Val Ser Thr 
85 90 95 



15 

(2) INFX)RMATICN FOR SBQ ID NO: 560: 

(i) SEQUENCE CHARACTERIffTICS : 

(A) LEHGTH: 54 amino acids 
20 (B) TyPE: amino acid 

(D) TOPOLOGy: linear 
(xi) SEQUENCE DESCRIPnON: SEQ ID NO: 560: 

Met lie Pro Ala Tyr Ser Lys Asn Arg Ala Tyr Ala He Phe Phe He 
^1 5 . 10 15 

Val Phe Thr Val He Gly Asp Ala Pro Gly Ala Val Leu Ser Cys Ala 
20 25 30 

30 Gly His Pro Cys Val Gly Phe Ala Ala Val Leu Val Ala Pro Leu Thr 
35 40 45 

Val Ala Val Ser Ser Xaa 
50 

35 



(2) INFORMATION FOR SBQ ID NO: 561: 

40 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 108 amino acids 

(B) TyPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID (K): 561: 



45 



Met Glu Val Pro Pro Pro Ala Pro Arg Ser Phe Leu Cys Arg Ala Leu 
15 10 15 



Cys Leu Phe Pro Arg Val Phe Ala Ala Glu Ala Val Thr Ala Asp Ser 
50 20 25 30 

Glu Val Leu Glu Glu Arg Gin Lys Arg Leu Pro Tyr Val Pro Glu Pro 
35 40 45 

55 Tyr Tyr Pro Glu Ser Gly Trp Asp Arg Leu Arg Glu Leu Phe Gly Lys 
^0 55 60 



Asp Thr Val Asn Thr Ser Leu Asn Val Tyr Arg Asn Lys Asp Ala Leu 

60 



^5 70 75 80 
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Ser His Phe Val He Ala Gly Ala Val Thr Gly Ser Leu Phe Arg He 
85 90 95 

Asn Val Gly Leu Arg Gly Trp Trp Leu Val Ala Xaa 
100 105 



(2) INFORMATION FOR SEQ ID NO: 562: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: andno acid 
(D) TOPOLOGV: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 

Met Asn Trp Gly Leu Ser He Trp Leu His Tyr Tyr Glu Lys Lys Lys 
1 5 10 15 

Glu Gin Val Phe Leu Val He Leu Ala His Val Val Arg Arg Cys Ala 
20 25 30 

Ser Asp Gly He Leu Gin Phe Glu Ser Ser Leu Leu Lys Met Arg Arg 
35 40 45 

Ala Pro 
50 



(2) INPaRMATION FOR SEQ ID NO: 563: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LHJGTH: 253 andno acids 

(B) TypE: amino acid 
(D) TOPOLOGy: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 563: 

Met Val Lys Val Cys Asn Asp Ser Asp Arg Trp Ser Leu He Ser Leu 
1 5 10 15 

Ser Asn Asn Ser Gly Lys Asn Val Glu Leu Lys Phe Val Asp Ser Leu 
20 25 30 

Arg Arg Gin Hie Glu Phe Ser Val Asp Ser Hie Gin He Lys Leu Asp 
35 40 45 

Ser Leu Leu Leu Phe Tyr Glu Cys Ser Glu Asn Pro Met Thr Glu Thr 
50 55 60 

Phe His Pro Thr He He Gly Glu Ser Val Tyr Gly Asp Phe Gin Glu 
65 70 75 80 

Ala Phe Asp His Leu Cys Asn Lys He He Ala Thr Arg Asn Pro Glu 
85 90 95 

Glu He Arg Gly Gly Gly Leu Leu Lys Tyr Cys Asn Leu Leu Val Arg 
100 105 110 

Gly Phe Arg Pro Ala Ser Asp Glu He Lys Itor Leu Gin Arg Tyr Met 
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115 120 125 

Cys Ser Arg Phe Phe He Asp Phe Ser Asp He Gly Glu Gin Gin Arg 
130 135 140 

Lys Leu Glu Ser Tyr Leu Gin Asn His Phe Val Gly Leu Glu Asp Arg 
145 150 155 160 

Lys Tyr Glu Tyr Leu Met Thr Leu His Gly Val Val Asn Glu Ser Thr 
165 170 175 

Val Cys Leu Met Gly His Glu Arg Arg Gin Thr Leu Asn Leu He Thr 
180 185 190 

Met Leu Ala He Arg Val Leu Ala Asp Gin Asn Val He Pro Asn Val 
195 200 205 

Ala Asn Val Thr Cys Tyr Tyr Gin Pro Ala Pro Tyr Val Ala Asp Ala 
210 215 220 

Asn Phe Ser Asn Tyr Tyr He Ala Gin Val Gin Pro Val. Phe Thr Cys 
225 230 235 240 

Gin Gin Gin Thr Tyr Ser Thr Trp Leu Pro Cys Asn Xaa 
245 250 



(2) INFORMATION FOR SBQ ID NO: 564: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SECaUENCE DESCRIPTICN: SBQ ID NO: 564: 

Met Ser Phe Leu Met Trp Leu Met Ser Leu Ala He Thr Ser Gin Pro 
1 5 10 15 

Pro Met 



(2) INFORMATICN FOR SBQ ID NO: 565: 

(i) SBQDENCE CHARACTERISTICS: 

(A) L03GTH: 80 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQIIQJCE DESCRIPTION: SBQ ID NO: 565: 

Met Ala Pro Lys Gly Lys Val Gly Tta Arg Gly Lys Lys Gin He Phe 
15 10 15 

Glu Glu Asn Arg Glu Thr Leu Lys Phe Tyr Leu Arg He He Leu Gly 
20 25 30 

Ala Asn Ala He Tyr Cys Leu Val 'Vtir Leu Val Phe Phe Tyr Ser Ser 
35 40 45 
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Ala Ser Phe Trp Ala Trp Leu Ala Leu Gly Kie Ser Leu Ala Val Tyr 
50 55 60 

5 Gly Ala Ser Tyr His Ser Met Ser Ser Met Ala Arg Ala Ala Phe Phe 
65 70 75 80 



10 



(2) INPORMATICN FOR SBQ ID NO: 566: 

15 (x) SEQUENCE CHARACTERISTICS: 

(A) LENCTTH: 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 566: 

20 

His Leu Lys Asp Val lie Leu Leu Thr Ala He Val Gin Val Leu Ser 
15 10 15 

Cys Kie Ser Leu Tyr Val Trp Ser Phe Trp Leu Leu Ala Pro Gly Arg 
25 20 25 30 

Ala Leu Tyr Leu Leu Trp Val Asn Val Leu Gly Pro Trp Phe Thr Ala 
35 40 45 

30 A^ Ser Gly Thr Pro Ala Pro Glu His Asn Glu Lys Arg Gin Arg Arg 
50 55 60 

Gin Glu Arg Arg Gin Met Lys Arg Leu 
65 70 

35 



(2) INFOKMAnCN FOR SEQ ID NO: 567: 

40 (i) SBQUaJCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 567: 

45 

Met Asp Cys Pro Ala Leu Pro Pro Gly Trp Lys Lys Glu Glu Val He 
is 10 15 

Arg Lys Ser Gly Leu Ser Ala Gly Lys Ser Asp Val Tyr Tyr Phe Ser 
50 20 25 30 

Pro Ser Gly Lys Lys Phe Arg Ser Lys Pro Gin Leu Ala Arg Tyr Leu 
35 40 45 

55 Gly Asn Thr Val Asp Leu Ser Ser Phe Asp Phe Arg Thr Gly Lys Met 
50 55 60 

Met Pro Ser Lys Leu Gin Lys Asn Lys Gin Arg Leu Arg Asn Asp Pro 
65 70 75 80 

60 
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Leu Asn Gin Asn Lys Gly Lys Pro Asp Leu Asn Thr Thr Leu Pro He 
85 90 95 

Arg Gin Thr Ala Ser He Phe Lys Gin Pro Val Thr Lys Val Thr Asn 
100 105 110 

His Pro Ser Asn Lys Val Lys Ser Asp Pro Gin Arg Met Asn Glu Gin 
115 120 125 

Pro Arg Gin Leu Phe Trp Glu Lys Arg Leu Gin Gly Leu Ser Ala Ser 
130 135 140 

Asp Val Thr Glu Gin He He Lys Thr Met Glu Leu Pro Lys Gly Leu 
145 150 155 160 

Gin Gly Val Gly Pro Gly Ser Asn Asp Glu Thr Leu Leu Ser Ala Val 
165 170 175 

Ala Ser Ala Leu His Ttir Ser Ser Ala Pro He Thr Gly Gin Val Ser 
180 185 190 

Ala Ala Val Glu Lys Asn Pro Ala Val Trp Leu Asn Thr Ser Gin Pro 
195 200 205 

Leu Cys Lys Ala Phe He Val Thr Asp Glu Asp He Arg Lys Gin Glu 
210 215 220 

Glu Arg Val Gin Gin Val Arg Lys Lys Leu Glu Glu Ala Leu Met Ala 
225 230 235 240 

Asp He Leu Ser Arg Ala Ala Asp Thr Glu Glu Met Asp He Glu Met 
245 250 255 



Asp Ser Gly Asp Glu Ala Xaa 
260 



(2) XNFORMATION FOR SEQ ID NO: 568: 

(i) SBC2UQJCE CMARACTERISTICS: 

(A) LQJGIB: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBCJOENCE DESCRIPTION: SBQ ID NO: 568: 

Met Met Arg Pro PhG Tyr Leu Leu Leu Pro Val Leu Cys Thr Gin Ala 
1 5 10 15 

Leu Arg Gin Ser Gin Gly Lys Ser Pro Leu Leu Trp Lys Arg Thr Leu 
20 25 30 

Leu Phe Gly Leu Thr His Leu Asn Pro Ser Ala Lys Leu Leu Leu Ser 
35 40 45 

Gin Met Lys Thr Ser Gly Asn Arg Lys Ser Glu Tyr Ser Lys Tyr Ala 
50 55 - 60 



Arg Asn Trp Lys Lys His 
65 70 
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(2) INPORMftTICN FOR SEQ ID NO: 569: 

(i) SBQU0ICB CHARACmOSnCS : 

(A) LSIGTH: 34 amino acids 

(B) TYPE: amino cicid 
(D) TOPOUXSY: linear 

(xi) SBQfOENCE DESCRIPTION: SBQ ID ISOz 569: 

Met Pro Val Thr Ser Lys Arg Thr Leu Phe Phe Pro Asp Pro Cys Ser 
15 10 15 

Tyr Asp Thr Pro Pro Pro Asp Cys His Cys His Ser Phe Arg Ala Glu 
20 25 30 

Leu Leu 



(2) mFOPMMTON FOR SEQ ID NO: 570: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQKSTH: 104 amino acids 

<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SBQOENCE DESCRIPnCN: SEQ ID NO: 570: 

Met Asn Ser Arg Gly Met Trp Leu Thr Tyr Ala Leu Gly Val Gly Leu 
15 10 15 

Leu His He Val Leu Leu Ser He Pro Phe Phe Ser Val Pro Val Ala 
20 25 30 

Trp Thr Leu Thr Asn He He His Asn Leu Gly Met Tyr Val Phe Leu 
35 40 45 

His Ala Val Lys Gly Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ser 
50 55 60 

Lys Ala Pro Asn Ser Leu Gly Thr Thr Gly Leu Trp Ser Thr Val Tyr 
65 70 75 80 

He I^e Thr Glu Val Phe His Asn Phe Ser Asn Asn Ser He Phe Ser 
85 90 , 95 

Gly Lys Phe Leu Tyr Glu Val Xeia 
100 



(2) INFORMATICN FOR SEQ ID NO: 571: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUQICE DESCRIPTICN: SBQ ID NO: 571: 
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Met Trp Leu Thr Tyr Ala Leu Gly Val Gly Leu Leu His He VaX Leu 
15 10 15 

Leu Ser He Pro Phe Phe Ser Val Pro Val Ala Trp Thr Leu Thr Asn 
20 25 30 

He He His Asn Leu Gly Met Tyr Val Phe Leu His Ala Val Lys Gly 
35 40 45 

Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ala Arg Leu Leu Thr His 
50 55 60 

Trp Glu Gin Leu Asp Tyr Gly Val Gin Phe Thr Ser Ser Arg Lys Phe 
^5 70 75 80 

Phe Thr He Ser Pro He He Leu Tyr Phe Leu Ala Ser Hie Tyr Tlir 
85 90 95 

Lys Tyr A^ Pro Thr His Phe He Leu Asn Thr Ala Ser Leu Leu Ser 
100 105 110 

Val Leu He Pro Lys Met Pro Gin Leu His Gly Val Arg He Phe Gly 
il5 120 125 



He Asn Lys Tyr 
130 



(2) INFORMATION FOR SBQ ID NO: 572: 

(i) SBQUHflCE CHARACTERISTICS: 

(A) LQSGTH: 32 amino acids 

(B) TSfPE: araino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 572: 

tet Asn Lys Trp He Cys Glu Met His Cys Tyr Leu Val Leu Leu Ser 
1 5 10 15 

/al Cys Ser Pro Ser Ala Leu Arg Arg Val Arg His Thr Leu Ser Arg 
20 25 30 



(2) INFORMATION FOR SBQ ID NO: 573: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 axoino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SBQUQJCE DESCRIPTION: SBQ ID NO: 573: 

fct Pro Val Leu Ser Leu Leu cys Thr Leu He Val Ser Phe Gin Ser 
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Ala Asp Ser Cys Glu Val Phe Leu Asn Cys Ser Leu 
20 25 

5 

(2) INFORMATICN FOR SBQ ID NO: 574: 

(i) SBQCJENCE CHAPACTERISTICS : 

(A) L£N[?ni: 40 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOIiOGy: linear 
(xi) SEQUENCE DESCRIPriCN: SBQ ID NO: 574: 

Met Lys Val Ser Thr Met Leu Trp Phe Leu Cys Trp Glu Gin Ser His 
15 1 5 10 15 

Phe Leu Arg Glu Tzp Glu A^ Leu Ser Thr Phe Leu lie Leu lie Gin 
20 25 30 

20 Met Glu Cys Gin Tyr Gly Asn Ser 
35 40 

25 (2) INFORMATION FOR SEQ ID NO: 575: 

<i) SBQOENCE CHARACTERISTICS: 

(A) LENGTTH: 30 amino acids 

(B) TifPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBC2UEWCE EESCRIPTICN: SBQ ID NO: 575: 

Met Gly Leu Pro Leu Met Ala Leu Met Trp Ser Thr Leu Pro Ala Ser 
15 10 15 

Ala Gly Val Asn Hie He Leu Ala Leu Pro Leu Leu Xaa Leu 
20 25 30 



30 



35 



40 



50 



55 



(2) INPCX^MATICK FOR SBQ ID NO: 576: 



(i) SBQUOJCE GHARACTEEOSTICS: 

(A) LENGTH: 29 amino acids 
4 J (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 576: 



Met Lys Arg Gly Cys Leu Gly Leu Leu Phe E*e Ser Cys Cys Ser Ser 
Is 10 15 

Ala Pro Thr Met Leu Leu Cys A^ Tyr Leu Asn Trp Phe 
20 25 



(2) INPORMATICN FCSi SBQ ID NO: 577: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQJCTTH: 92 amino acids 
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<B) TYPE: amino acid 
(D) TOPGLOGY: linear 
(xi) SBQUEWCE DESCMPTION: SBQ ID NO: 577: 

Met Lys Leu Leu Leu Gly lie Ala Leu Leu Ala Tyr Val Ala Ser Val 
15 10 15 

Trp Gly Asn Phe Val Asn Met Arg Ser lie Gin Glu Asn Gly Glu Leu 
20 25 30 

Lys He Glu Ser Lys He Glu Glu Met Val Glu Pro Leu Arg Glu Lys 
35 40 45 

He Arg Asp Leu Glu Lys Ser Phe Thr Gin Lys Tyr Pro Pro Val Lys 
50 55 60 

Wie Leu Ser Glu Lys Asp Arg Lys Arg He Leu Xaa Asn Arg Arg Arg 
65 70 75 80 

Xaa Val Arg Gly Leu Pro Ser Xaa Leu Thr Asn Ser 
85 90 



(2) INPOFMATICN FOR SBQ ID NO: 578: 

(i) SEQqHOCE CHARACTERISTICS: 

(A) LEMGIH: 42 amino acids 

(B) TypE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPnON: SEQ ID NO: 578: 

Met Lys Phe Ser Leu Val Leu Leu He Lys He He Ser Phe Glu Arg 
^ 5 10 15 

Leu Leu He E*e Leu Phe Pro hea Ser Phe Leu Pro Asn He Trp Arg 
20 25 30 

Arg Val Met Val Asn Leu Asn He Leu Phe 
35 40 



(2) INFQRMATICN FOR SBQ ID NO: 579: 

(i) SBQqENCE CHARACTERISTICS: 

(A) LQ«GTH: 70 amino acids 

(B) TypE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 579: 

Leu Ala Gin Glu Cys Pro Pro His He Pro Ser Ser Phe Phe Leu Val 
15 10 15 

Lys Leu Leu Phe He Pro Trp Leu Ala Ser Leu Leu Pro Pro Leu Ser 
20 25 30 



Thr Wie Tfcr Ser Asp Phe Tyr Phe Met Glu Phe Gly He Glu Val Lys 
35 40 45 
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JLeu Gin Gin Cys Arg Gin His Gin Val Leu Gin Glu Lys Asn Tfer Lys 
50 55 60 



Lys Phe Asn Lys Lys Lys 
65 70 



(2) INFORMATION FOR SEQ ID NO: 580: 

(i) SBQDENCE CHARACTERISTICS: 

<A) LENGTH: 110 airdno acids 

<B) TYPE: andno acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580: 

Met Leu Arg Leu Leu Leu Leu Val Ala Phe Ala Leu Val Val Val Leu 
15 10 15 

Phe His Val Leu Leu Ala Pro He -rtir Ala Leu Phe His Thr His Phe 
20 25 30 

Tyr Asp Arg Leu Gin Asp Ala Gly Ser Arg Trp Pro Glu Leu Tyr Leu 
35 40 45 

Tyr Ser Arg Ala Asp Glu Val Val Leu Ala Arg Asp He Glu Arg Met 
50 55 60 

Val Glu Ala Arg Leu Ala Arg Arg Val Leu Ala Arg Ser Val Asp Phe 
^5 70 75 80 

Val Ser Ser Ala His Val Ser His Leu Arg Asp Tyr Pro Thr Tyr Tyr 
85 90 95 

Thr Ser Leu Cys Val Asp Phe Met Arg Asn Cys Val Arg Cys 
100 105 110 



(2) INFORMATICN FOR SBQ ID NO: 581: 

(i) SEQDENCE CHARACTERISTICS: 

(A) La«3TH: 30 amino cLcids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 581: 

Met Phe Lys Leu Glu Glu Cys Gly Lys Thr Thr Phe L^ Leu Ser Met 
1 5 in 



5 10 

Ala Leu OVr Phe Trp Trp lie Val Gin Thr Thr Lys Gly Cys 
20 25 30 



15 



(2) INFORMATICN FOR SBQ ID NO: 582: 

(i) SBQUQJCE CHARACTERISTICS: 

(A) hOXJURz 71 amino acids 
<B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) DESCRIPnON: SEQ ID NO: 582: 

Met Glu Ser Asp Ala Leu Leu Leu Thr lie Phe Trp He He Ala hrg 
15 10 15 

Ser Ser Val Arg Ser Val Gly Lys Ser Ser Gin Arg Ser Phe Thr Thr 
20 25 30 

He Thr Gin Leu Arg Ser Thr His Thr Gly Pro Ser Arg Arg Ser Tyr 
35 40 45 

Leu He Trp Trp Asn Gly Gly Pro Lys Arg Thr He Ser Tyr Val Ser 
50 55 60 

Arg Arg Phe Arg Ser Phe Arg 
65 70 



(2) INPORMATICN FOR SEQ ID NO: 583: 

(i> SBQUQfCE CHARACTERISTICS: 

(A) LEMQTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 583: 

Val Gly Leu Phe Gin Pro Lys Thr Phe Gin Val Pro Val Thr Asp Leu 
^5 10 15 

Tyr He Phe He Lys He Tyr Ser Glu He Gly Pro He Met His Val 
20 • 25 30 

Leu Cys Pro Gly Tyx Ser Gin Ser Pro Ser Thr Pro Pro Trp Thr 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 584: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LBISTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUHJCE DESCRIPTICN: SEQ ID NO: 584: 

Met Trp Phe Gly Ser Asp Arg Ser Asp Leu Arg He Gly Thr Ala Phe 
Is 10 15 

Leu Phe Asp Leu Val Cys Asp Leu Cys He His Ala Trp Lys Pro Pro 
20 25 30 

Gly Leu Val Arg Phe Ser 
35 



(2) mPORMftTIOK FOR SEQ ID NO: 585: 
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(i) SBQOaJCE CHARACTERISTICS: 

(A) LENCTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBC3UENCE DESCRIPTIC3N: SBQ ID ^D: 585: 

Met Leu Asn Thr Ala Ser Leu Asn Leu Pro Trp Lys Val Gin Leu Phe 
15 10 15 

Ala His Ala 



(2) INFORMATION FOR SBQ ID ND: 586: 

(i) SEQUQJCB CHARACTEKISTICS: 

<A) LENG?I«: 23 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SBQUHJCE DESCRIPTION: SBQ ID NO: 586: 

Met Ser Ala Cys Leu Leu Leu Phe Leu Ala Phe Ser Trp Lys Arg Lys 
15 10 15 

Gly Leu Trp Ser Gly Pro Gly 
20 



(2) DTORMATION FX>R SBQ ID NO: 587: 

(i) SBQqOJCE CHARACTERISTICS: 

(A) LQffaTH: 69 amino acids 

(B) TYPE: amino cu:id 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 587: 

Met Leu Pro Pro Phe Ser Leu Val Tyr Thr His I*ie Leu Val Ala Ser 
1 5 10 15 

Leu Leu Pro Val He Leu Ala Val Phe Pro Asp Ser Ala Gin He Val 
20 25 30 

Pro Leu Leu Lys Pro He Pro Arg Pro Gin Pro Glu Val He Phe Pro 
35 40 45 

Ser Ser Glu Leu Leu Glu Gin Leu Leu Ser Val Gin Phe Val Trp Gin 
50 55 60 

Ala His Thr Val Ala 
65 



(2) INPORMATIGN FOR SBQ ID NO: 588: 

(i) SBQUQJCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTICW: SBQ ID NO: 588: 

Met Gly Pro Pro Met Lew Gin Glu lie Ser Asn Leu Phe Leu lie Leu 
5 1 5 10 15 

Leu Met Met Gly Ala He Phe Thr Leu Ala Ala Leu Lys Glu Ser Leu 
20 25 30 

10 Ser Thr Cys He Pro Ala He Val Cys Leu Gly Phe Lai Leu Leu Leu 
35 40 45 

Asn Val Gly Gin Leu Leu Ala Gin Thr Lys Lys Val Val Arg Pro Thr 
50 55 60 

15 

Arg Lys Lys Thr Leu Ser Thr Phe Lys Glu Ser Trp Lys 
65 70 75 



20 

(2) INFORMATION FOR SBQ ID NO: 589: 

(i) SEQUENCE GHftRACTTERISTICS: 

(A) LENGTH: 155 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 589: 

Met Ala Leu Leu Leu Ser Val Leu Arg Val Leu Leu Gly Gly E^e Phe 
30 1 5 10 15 

Ala Leu Val Gly Leu Ala Lys Leu Ser Glu Glu He Ser Ala Pro Val 
20 25 30 

35 Ser Glu Arg Met Asn Ala Leu Phe Val Gin Phe Ala Glu Val Phe Pro 
35 40 45 

Leu Lys Val Phe Gly Tyr Gin Pro Asp Pro Leu Asn Tyr Gin He Ala 

40 

Val Gly Kie Leu Glu Leu Leu Ala Gly Leu Leu Leu Vca Met Gly Pro 
65 70 75 80 

Pro Met Leu Gin Glu He Ser Asn Leu Kie Leu He Leu Leu Met Met 
85 90 95 

Gly Ala He E^e Thr Leu Ala Ala Leu Lys Glu Ser Leu Ser Tbr Cys 
100 105 110 

50 He Pro Ala He Val Cys Leu Gly Phe Leu Leu Leu Leu Asn Val Gly 
115 120 125 

Gin Leu Leu Ala Gin Hut Lys Lys Val Val Arg Pro Thr Arg Lys Lys 

135 140 

Tte Leu Ser Hir E*ie Lys Glu Ser Trp Lys Xaa 
145 150 155 



60 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(2) INFX5RMATI0N FOR SEQ ID NO: 590: 

(i) SEQUEMCE CHARAdERISriCS: 

tA) LOflCHH: 24 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 590: 

Met Pro Glu Thr Arg Leu Gly His Arg Gin Gin Phe Ala Val Phe His 
^ 5 10 



15 



Leu Xaa Pro Val Pro Pro Cys Gly 
20 



(2) INPOroiATION Fm SEQ ID NO: 591: 

(i) SBC?UENCE OftRACTERISTICiS : 

(A) LQ3GTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQCnSWCE DESCRIPTION: SEQ ID NO: 591: 

Met Leu Thr Phe Leu Phe Ser Ala Cys Ala Thr Cys Leu Gly Lys Leu 
^5 10 15 

Ala ser Pro Leu Ala Pro Val Gly Pro Gin Gin Arg Gly Xaa Pro Pro 



20 25 



30 



Gly Pro Pro Leu Leu Ser 
35 



(2) INFQRMATIQN FOR SBQ ID NO: 592: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LB9GTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQOENCE DESCRIPTiaN: SEQ ID NO: 592: 

Met Asp Pro Phe His Tyr Asp Tyr Gin Thr Leu Arg He Gly Gly Leu 
^ 10 15 

Val Phe Ala Val Val Leu Phe Ser Val Gly He Leu Leu He Leu Ser 
20 25 30 

Arg Arg Cys Lys cys Ser Ete Asn Gin Lys Pro Arg Ala Pro Gly Asp 
35 40 45 

Glu Glu Ala Gin Val Glu Asn Leu He Thr Ala Asn Ala Thr Glu Pro 

55 60 

Gin Lys Ala Glu Asn 
65 



60 
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(2) INFTORMATICN FOR SBQ ID NO: 593; 

(i) SEQUENCE CHARACIERtSnCS : 

(A) LENGTH: 308 amino acids 
5 (B) TYPE; amino acid 

<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 593: 

Asn Leu Arg Val Arg Leu Gly Asp Val He Ser He Gin Pro Cys Pro 
10 1 5 10 15 

Asp Val Lys Tyr Gly Lys Arg He His Val Leu Pro He Asp Asp Ttir 
20 25 30 

15 Val Glu Gly He Thr Gly Asn Leu Phe Glu Val Tyr Leu Lys Pro Tyr 
35 40 45 

Phe Leu Glu Ala Tyr Arg Pro He Arg Lys Gly Asp He Phe Leu Val 
50 55 60 

20 

Arg Gly Gly Met Arg Ala Val Glu E^e Lys Val Val Glu 'Rir Asp Pro 
65 70 75 80 

Ser Pro Tyr Cys He Val Ala Pro Asp Thr Val He His Cys Glu Gly 
25 85 90 95 

Glu Pro He Lys Arg Glu Asp Glu Glu Glu Ser Leu Asn Glu Val Gly 
100 105 110 

30 Tyr Asp Asp He Gly Gly Cys Arg Lys Gin Leu Ala Gin He Lys Glu 
115 120 125 

Het Val Glu Leu Pro Leu Arg His Pro Ala Leu Ptie Lys Ala He Gly 
130 135 140 

35 

Val Lys Pro Pro Arg Gly He Leu Leu Tyr Gly Pro Pro Gly Thr Gly 
145 150 155 160 

Lys Thr Leu He Ala Arg Ala Val Ala Asn Glu Thr Gly Ala Rtie Phe 
40 165 170 175 

Phe Leu He Asn Gly Pro Glu He Met Ser Lys Leu Ala Gly Glu Ser 
180 185 190 

45 Glu Ser Asn Leu Arg Lys Ala Hie Glu Glu Ala Glu Lys Asn Ala Pro 
195 200 205 

Ala He He l^e He Asp Glu Leu Asp Ala He Ala Pro Lys Arg Glu 
210 215 . 220 

50 

Lys Thr His Gly Glu Val Glu Arg Arg He Val Ser Gin Leu Leu Thr 
225 230 235 240 

Leu Met Asp Gly Leu Lys Gin Arg Ala His Val He Val Met Ala Ala 
55 245 250 255 

Thr Asn Arg Pro Asn Ser He Asp Pro Ala Leu Arg Arg Phe Gly Arg 
260 265 270 



60 



Phe Asp Arg Glu Val Asp He Gly He Pro Asp Ala Thr Gly Arg Leu 
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275 280 285 

Glu He Leu Gin He His Ito Lys Asn Met Lys Leu Ala Asp Asp Val 
290 295 300 

Asp Leu Glu Gin 
305 



(2) INFORMATION FOR SBQ ID NO: 594: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO; 594: 

Met Gin He Lys Leu Leu Lys Ser Val Lys Thr Val Phe Ala He Thr 
15 10 15 

Leu Leu Val Leu Phe Leu 
20 



(2) INPORMATICN FOR SBQ ID NO: 595: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B> TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SBQOQICE DESCRIPTION: SBQ ID NO:. 595: 

Met Phe Pro Lys Hie Cyrs Pro He Leu Ser Leu Val Asp Phe He Ser 
1 5 10 15 

His Arg Asp Lys Pro Glu Thr Glu 
20 



(2) INFORMATION FOR SBQ ID NO: 596: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(xi) SBQOENCE DESCRIPTION: SBQ ID NO: 596: 

Met Leu He Glu Cys Ala Trp Gin Leu Met Phe Leu Leu Leu Lys Val 
I s 10 15 

Glu Gin Leu Gly He Leu Asp Lys 
20 



(2) INFORMATION FOR SEQ ID NO: 597: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENCJTH: 1 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (xi) SEQUENCE OESCRIPTICaJ: SEQ ID HO: 597: 

Met 
1 

10 

(2) INFORMATICN FOR SEQ ID NO: 598: 

<i) SBQOQJCE CHARACTERISTICS: 

(A) 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 598: 

20 Met Cys lie Met Ser Ala Leu Val 
1 5 



25 



30 



35 



(2) mPORMATICN FOR SEQ ID NO: 599: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGrni: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 599: 

Met Phe Leu Val Trp Phe Phe Trp Gly Leu He Ser Ala Leu Ser Asn 

1 =i in 



5 10 

Val His Tbr Pro Ser Arg Leu Pro Ala 
20 25 



15 



40 



50 



55 



(2) INFORMATICN FOR SEQ ID WO: 600: 



(i) SEQUraJCE CHARACTERISTICS: 
. - (A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 600: 



ifet Xaa Gly Leu Ser Leu He Leu Thr Val Thr Leu Leu Ala Val Ser 
1 5 10 15 

A^ Ser Ala Ala Tlir Cys He Val Ala Lys Gly 
20 25 



(2) INPaRMATION FOR SEQ ID NO: 601: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQXaTH: 61 amino acids 



wo 98/39448 



700 



(B) TYPE: amixio acid 
(D) TOPOUXSY: linear 
(xi) SBQUQICE DESCRIPTION: SEQ ID NO: 601: 

Met Trp Thr Arg Ser Ser Arg Cys Leu Leu Leu Cys He Pro Gly Xaa 
1 5 10 15 

Ser Arg Arg Arg Arg Ala Gly Ser Gly Met Lys Pro Arg Ser Trp Ser 
20 25 30 

Ala Trp Arg Pro Ser Gly Gly Thr Gly Thr Ser Ser Ser Gin Ser Ser 
35 40 45 

Thr Gin Ser Arg Thr Leu Ser Ala Thr Ala Ser Pro Ala 
50 55 60 



(2) IMFORMATIGN FOR SEQ ID NO: 602: 

(i) SEQUENCE CHARACTEtaSTICS : 

(A) LHJGTH: 29 amino acids 
. (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SECKJENCE DESCRIPTICN: SEQ ID NO: 602: 

Met Arg Glu Thr Ser He Arg Val Leu Leu Met Leu Pro Ala Leu Glu 
15 10 15 

Ser Thr Ser Gly Leu Ser Ala Fhe Met Gly Leu Gly Thr 
20 25 



(2) INFORHATIGN FOiR SEQ ID NO: 603: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) Tn>E: amino acid 
(D) TOPOLOGy: linear 

<xi) SUQUidJCK EBSCRXFnCN: SEQ ID NO: 603: 

Met Pro Pro Lys Gin Glu Leu Gly Ser Gly Val Gly Glu Leu Ala Lys 
15 10 15 

Asn Ser Lys Arg Gin His Trp Asn His Arg Trp Lys Lys Tyr Leu Lys 
20 25 30 

Leu He Arg Trp Glu Asp Gly Leu Leu Leu Glu Gly Leu Leu Leu Val 
35 40 45 

Leu Glu His Cys Ala Thr Met Ala Trp Asp Cys Leu Met Arg Leu Glu 
50 55 60 

Leu Leu Lys Arg Leu 



(2) INPORMATIGN FOR SEQ ID NO: 604: 
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(i) SBQUEigCE CHARACTERISTICS: 

(A) hENCrTH: 24 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 604: 

Lys He Val Tyr He Leu Gly Asn Pro Leu Lys Phe Asn Ser Arg Val 
1 5 10 . 15 

10 

He His His Leu Val Leu Leu Gin 
* 20 



15 



(2) INFX)RMATICN FOR SEQ ID NO: 605: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENC?rH: 35 amino acids 
20 (B) TypE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605: 

Met Asn Leu His Gin Arg Arg Leu Leu Leu He Gly His Leu Met Thr 
25 1 5 10 15 

Leu Val Lys Ala Ser Lys Ser Phe Ser Phe Thr Glu He Thr Ser Ser 
20 25 30 

30 Arg Lys Lys 
35 



35 (2) INFORMATION FOR SEQ ID NO: 606: 



40 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LOJCTTH: 130 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEC^JENCE DESCRIPTION: SEQ ID NO: 606: 

Leu Leu Gly Tyr Gly Leu Phe Gly His Cys He Val Leu Phe He Thr 
15 10 15 

Tyr Asn He His Leu His Ala Leu Phe Tyr Leu Phe Trp Leu Leu Val 
20 25 30 

Gly Gly Leu Ser Thr Leu Arg Met Val Ala Val Leu Val Ser Arg Thr 
35 . 40 45 

Val Gly Pro Thr Gin Arg Leu Leu Leu Cys Gly Thr Leu Ala Ala Leu 
50 55 60 

His Met Leu Phe Leu Leu Tyr Leu His Phe Ala Tyr His Lys Val Xaa 
65 70 75 80 



60 



Glu Gly He Leu Asp Thr Leu Glu Gly Pro Asn He Pro Pro He Gin 
85 90 95 
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Arg Val Pro Arg Asp He Pro Ala Met Leu Pro Ala Ala Arg Leu Pro 
100 105 110 

Tbr Thr Val Leu Asn Ala Thr Ala Lys Ala Val Ala Val Thr Leu Gin 
115 120 125 

Ser His - 
130 



(2) INPORHATICM FOR SEQ ID NO: 607: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGrrH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQ^UE^^CE DESCRIPTION: SEQ ID NO: 607: 

Met Leu Val He Phe Leu Hie Thr Ser Leu Leu Lys He Pro Ser Ser 
15 10 15 

Val Pro Gly Leu He Asn Val 
20 



(2> INPORMATIQN FOR SEQ ID NO: 608: 

(i) SBC3UENCE CHARACTERISTICS: 

(A) LQK7IH: 6 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608: 

Glu Leu Asp Tyr He Leu 
1 5 



(2) INFORMKTION FOR SEQ ID NO: 609: 

(i) SEQUOICE CHARACTERISTICS: 

(A) L3QK3TH: 232 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 609: 

Met Ala Pro Pro Gly Trp Gin Xaa Xaa Xaa Xaa Xaa Trp Leu Ala Cys 
15 10 15 

Pro Asp Arg Gly Glu Leu Ser Ser Arg Ser Pro Pro Cys Arg Leu Ala 
20 25 30 

Arg Trp Ala Glu Gly Asp Arg Glu Thr Arg Thr Cys Leu Leu Glu Leu 
35 40 45 

Ser Ala Gin Ser Trp Gly Gly Arg Ph& Arg Arg Ser Ser Ala Val Ser 
50 55 60 
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Ala Gly Ser Pro Ser Arg Leu His Phe Leu Pro Gin Pro Leu Leu Leu 
65 70 75 80 

Arg Ser Ser Gly He Pro Ala Ala Ala Thr Pro Trp Pro Gin Pro Ala 
85 90 95 

Gly Leu Pro Val Arg Pro Thr Pro Thr Arg Thr Gly Glu Glu Asp Arg 
100 105 110 

Thr Leu Asp He Ser He Cys Thr Glu Val Leu Ala Gly Thr Glu Gin 
115 120 125 

Pro Pro Pro Pro Arg Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg 
130 135 140 

Leu Glu Thr Leu Asp Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg 
145 150 155 160 

Gly Lys Leu Asp Phe Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe 
165 170 175 

Gin Gly Glu Asp Arg Lys Phe Ala Pro Ser Asp Lys Ser Gin Pro Pro 
180 185 190 

Thr Thr Glu Arg Glu Gin Val Pro Val Ser Arg He Gin Thr Asp Leu 
195 200 205 

Thr Glu He Gly Ser Ser Met Arg Ser Pro Gly Val Ser Pro Arg He 
210 215 220 



Trp Leu Asp Phe Gin Ser Thr Xaa 
225 230 



(2) INFX3RMATICN FOR SEQ ID NO: 610: 

(i) SEQUQJCE C3aRACTERISnCS: 

(A) LENGTH: 34 amino acids 

(B) TyPE: amino acid 
(D) TOlPOLOl^i linear 

(xi) SEQUENCE EESCRIPTICN: SEQ ID NO: 610: 

Met Val Leu Leu Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr He Leu 
1 5 10 15 

Leu Leu Asn Met Leu He Ala Leu Met Xaa Arg Asp Arg Gin Gin Cys 

20 25 . 30 

Arg His 



(2) INFORMATICN FOR SEQ ID NO: 611: 

(i) SEQUQJCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TypE: amino acid 
(D) TOPOLOGY: linear 
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(xi) SBQOENCE DESCRIPTIC3N: SBQ ID NO: 611: 

Met Val Phe Glu Gly Phe Ser Ser Ala Phe Cys Leu Ser Ser Thr Ala 
15 10 15 

Pro Thr Ser His Pro 
20 



(2) INFt>RMATiaN FOR SEQ ID NO: 612: 

(i) SBQfUaaCE CHARACTERISTICS: 

(A) LENGmi: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612: 

Gly Lys Lys Asn Gin Leu Leu Val lie 
1 5 



(2) INFORMATICaJ FOR SEQ ID m: 613: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENOTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBC^JENCE DESCRIPTION: SEQ ID NO: 613: 

Met Val Trp Val Leu Trp Ser Ala Pro Ser Leu Ala Pro Pro Trp Val 
15 10 15 

Gly Pro Cys Trp Pro Ser Oto Gly Asn Cys Cys Leu Cys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 614: 

(i) SEQJWCE CHARACTERISTICS: 

(A) LBIGIH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUHJCE DESCRIPTION: SEQ ID NO: 614; 

Met Ala Lys Arg Ser Pro Gly Gly Cys Gly Ser Gly Leu He Leu Leu 
15 10 15 

Cys Cys Gin Pro Cys Arg Pro Thr Ser Ser Ala Pro Met Arg 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 615: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino cicid 
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(D) TOPOLCXrY: linear 
(xi) SEQUENCE OESCRIPTICN: SEQ ID NO: 615: 

He Thr He Ala He Gin Met He Cys Leu Val Asn Xaa Glu Leu Tyr 
5 1 5 10 15 

Pro Thr Phe Val Arg Asn Xaa Gly Val Met Val Cys Ser Ser Leu Cys 
20 25 30 

10 Asp He Gly Gly He He Thr Pro F*ie He Val Phe Arg Leu Arg Glu 
35 40 45 

Val Trp Gin Ala Leu Pro Leu He Leu Phe Ala Val Leu Gly Leu Leu 
50 55 60 

15 

Ala Ala Gly Val Thr Leu Leu Leu Pro Glu Thr Lys Gly Val Ala Leu 
65 70 75 80 

Pro Glu Thr Met Lys Asp Ala Glu Asn Leu Gly Arg Lys Ala Lys Pro 
20 85 90 95 

Lys Glu Asn Thr He Tyr Leu Lys Val Gin Thr Ser Glu Pro Ser Gly 
100 105 110 

25 Thr 



30 (2) INFORMATION FOR SEQ ID ND: 616: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQ«GrrH: 18 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGV: linear 

(xi) SEQUENCE DE9CRIPTICN: SEQ ID NO: 616: 

Thr Met Lys Asp Ala Glu Asn Leu Gly Arg Lys Ala Lys Pro Lys Glu 
1 5 .10 15 

40 

Asn Thr 



45 

(2) INFORMATICN FOR SEQ ID fHO: 617: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 
50 (B) TVTE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: 

Pro Arg Val Arg Asn Ser Pro Glu Asp Leu Gly Leu Ser Leu Thr Gly 
55 1 5 10 15 

Asp Ser Cys Lys Leu 
20 



60 
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(2) INFORMATION FOR SBQ ID ^30: 618: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LETXtTH: 52 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 618: 

Gin Ala Asp Asp Leu Gin Ala Thr Val Ala Ala Leu Cys Val Leu Arg 
1 5 10 15 

Gly Gly Gly Pro Trp Ala Gly Ser Trp Leu Ser Pro Lys Thr Pro Gly 
20 25 30 

Ala Met Gly Gly Asp Leu Val Leu Gly Leu Gly Ala Leu Arg Arg Arg 
35 40 45 

Lys Arg Leu Leu 
50 



(2) INFORMATION FOR SBQ ID NO: 619: 

(i) SEQUE^3CE CHARACTERISTICS: 

(A) LENGTH: 232 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 619: 

Glu Gin Glu Lys Ser Leu Ala Gly Trp Ala Leu Val Leu Ala Xaa Xaa 
1 5 10 15 

Gly lie Gly Leu Met Val Leu His Ala Glu Met Leu Trp Phe Gly Gly 
20 25 30 

Cys Ser Ala Val Asn Ala Thr Gly His Leu Ser Asp T^ir Leu Trp Leu 
35 40 45 

lie Pro lie Thr Phe Leu Thr lie Gly Tyr Gly Asp Val Val Pro Gly 
50 55 60 

Thr Met Trp Gly Lys lie Val Cys Leu Cys Thr Gly Val Met Gly Val 
65 70 75 80 

Cys Cys Thr Ala Leu Leu Val Ala Val Val Ala Arg Lys Leu Glu Phe 
85 90 95 

Asn Lys Ala Glu Lys His Val His Asn Phe Met Met Asp He Gin Tyr 
100 105 110 

Thr Lys Glu Met Lys Glu Ser Ala Ala Arg Val Leu Gin Glu Ala Trp 
115 120 125 

Met Phe Tyr Lys His Thr Arg Arg Lys Glu Ser His Ala Ala Arg Xaa 
130 135 140 



His Gin Arg Xaa Leu Leu Ala Ala He Asn Ala Phe Arg Gin Val Arg 
145 150 155 160 



wo 98/39448 



PCT/US98/04493 



707 



Leu Lys His Arg Lys Leu Arg Glu Gin Val Asn Ser Met Val Asp He 
165 170 175 

5 Ser Lys Met His Met He Leu Tyr Asp Leu Gin Gin Asn Leu Ser Ser 
180 185 190 



10 



Ser His Arg Ala Leu Glu Lys Gin He Asp Thr Leu Ala Gly Lys Leu 
195 200 205 

Asp Ala Leu Thr Glu Leu Leu Ser Thr Ala Leu Gly Pro Arg Gin Leu 
210 215 220 



Pro Glu Pro Ser Gin Gin Ser Lys 
15 225 230 



20 



(2) INFORMATION FOR SBQ ID NO: 620: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LE^3GTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 620: 



30 



35 



50 



Tyr Gin Ala His His Val Ser Arg Asn Lys Arg Gly Gin Val Val Gly 
15 10 15 

Thr Arg Gly Gly Phe Arg Gly Cys Thr Val Trp Leu Thr Gly Leu Ser 
20 25 30 

Gly Ala Gly Lys 
35 



<2) INFORMATICN FOR SBQ ID NO: 621: 

40 (i) SEQUQJCE CHARACTERISTICS: 

(A) LEtlOrra: 57 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

ixi) SEC^mSCE DESCRIPTICN: SBQ ID NO: 621: 

45 

. Leu Gin Cys Glu He Cys Gly Phe Thr Cys Arg Gin Lys Ala Ser Leu 
15 10 15 

Asn Trp His Met Lys Lys His Asp Ala Asp Ser l^e Tyr Gin Phe Ser 
20 25 30 

Cys Asn He Cys Gly Lys Lys E^e Glu Lys Lys Asp Ser Val Val Ala 
35 40 45 

55 His Lys Ala Lys Ser His Pro Glu Val 
50 55 



60 



(2) INFORMATION FOR SBQ ID NO: 622: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 19 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPnON: SBQ ID NO: 622: 

lie Thr Ser Thr Asp lie Leu Gly Thr Asn Pro Glu Ser Leu Thr Gin 
15 10 15 

Pro Ser Asp 



(2) INFORMATION FOR SBQ ID NO: 623: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGrrH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SBQ ID NO: 623: 

Asn Ser Thr Ser Gly Glu Cys Leu Leu Leu Glu Ala Glu Gly Met Ser 
15 10 15 

Lys Ser Tyr 



(2) INFORMATION FOR SBQ ID NO: 624: 

(i) SBQUQ4CE CHARACTERISTICS: 

(A) LOJCTH: 51 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 624: 

Cys Ser Gly Thr Glu Arg Val Ser Leu Met Ala Asp Gly Lys He Phe 
15 10 15 

Val Gly Ser Gly Ser Ser Gly Gly Thr Glu Gly Leu Val Met Asn Ser 
20 25 30 

Asp He Leu Gly Ala Thr Thr Glu Val Leu He Glu Asp Ser Asp Ser 
35 40 45 

Ala Gly Pro 
50 



(2) INFORMATION FOR SBQ ID NO: 625: 

(i) SBQUQICE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 625: 
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He Gin Tyr Val Arg Cys Glu Met Glu Gly Cys Gly Thr Val Leu Ala 
15 10 15 

His Pro Arg Tyx Leu Gin His His He Lys Tyr Gin His Leu Leu Lys 
20 25 30 

Lys Lys Tyr Val Cys Pro His Pro Ser Cys Gly Arg Leu Phe Arg Leu 
35 40 45 

Gin Lys Gin Leu Leu Arg His Ala Lys His His Thr 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 626: 

(i) SEQUENCE CHARACTERISTICS: 

(A) DQIGTH: 31 amino acids 

(B> TyPE: andno acid 

(D) TOPOLOGY: linear 
(xi) SBQUe«CE DESCRIPTION: SEQ ID NO: 626: 

Asp Gin Arg Asp Tyr He Cys Glu Tyr Cys Ala Arg Ala Phe Lys Ser 
1 5 10 15 

Ser His Asn Leu Ala Val His Arg Met He His Thr Gly Glu Lys 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 627: 

(i) SBQfUENCE CHARACTERISTICS: 

(A) LENCTTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627: 

Arg Ser Ser Arg Ser Lys Thr Gly Ser Leu Gin Leu He Cys Lys Ser 
15 10 15 

Glu Pro Asn Thr Asp Gin Leu Asp Tyr 
20 25 



(2) INFORMATION FOR SEQ ID NO: 628: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQK?m: 183 amino acids 

(B) TYPE: amino cicid 
(D) TOPOLOGY: linear 

(xi) SBQUETCE DESCRIPTION: SBQ ID NO: 628: 

Leu Gin Cys Glu He Cys Gly Phe Thr Cys Arg Gin Lys Ala Ser Leu 
1 5 10 15 



Asn Trp His Met Lys Lys His Asp Ala Asp Ser Phe Tyr Gin Phe Ser 
20 25 30 
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Cys Asn lie Cys Gly Lys Lys Phe Glu Lys Lys Asp Ser Val Val Ala 
35 40 45 

His Lys Ala Lys Ser His Pro Glu Val Xaa He Thr Ser Thr Asp He 
50 55 60 

Leu Gly Thr Asn Pro Glu Ser Leu Thr Gin Pro Ser Asp Xaa Asn Ser 
65 ' 10 75 80 

Thr Ser Gly Glu Cys Leu Leu Leu Glu Ala Glu Gly Met Ser Lys Ser 
85 90 95 

Tyr Xaa Cys Ser Gly Thr Glu Arg Val Ser Leu Met Ala Asp Gly Lys 
100 105 110 

He Phs Val Gly Ser Gly Ser Ser Gly Gly Thr Glu Gly Leu Val Met 
115 120 125 

Asn Ser Asp He Leu Gly Ala Thr Thr Glu Val Leu He Glu Asp Ser 
130 135 140 

Asp Ser Ala Gly Pro Xaa Gin Arg Asp Tyr He Cys Glu Tyr Cys Ala 
145 150 155 160 

Arg Ala Phe Lys Ser Ser His Asn Leu Ala Val His Arg Met He His 
165 170 175 

Thr Gly Glu Lys His Tyr Xaa 
180 



(2) INFORMATION FOR SEQ ID NO: 629: 

(i) SBCJUENCE CHTUIACTERISTICS: 

(A) L^TTH: 60 amino acids 

(B) TXTE; amino acid 
<D) TOPQLOGy: linear 

(xi) SEC^JENCE E3ESCRIPTICN: SEQ ID NO: 629; 

Gin Tyr Val Arg Cys Glu Met Glu Gly Cys Gly Thr Val Leu Ala His 
1 5 10 15 

Pro Arg Tyr Leu Gin His His He Lys Tyr Gin His Leu Leu Lys Lys 
20 25 30 

Lys Tyr Val Cys Pro His Pro Ser Cys Gly Arg Leu Phe Arg Leu Gin 
35 40 45 

Lys Gin Leu Leu Arg His Ala Lys His His Thr Asp 
50 55 60 



(2) njFORMATICN FOR SEQ ID NO: 630: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TyPE: amino acid 



wo 98/39448 



711 



(D) TOPOUXV: linear 
(xi) SBC2UENCE DESCRIPTION: SBQ ID »30: 630: 

Pro Phe Lys Asp Asp Pro Arg Asp Glu Thr Tyr Lys Pro His Leu Glu 
15 10 15 

Arg Glu T!hr Pro Lys Pro Arg Arg Lys Ser Gly 
20 25 



(2) IKFDHMATICa* FOR SBQ ID NO: 631: 

(i) SBQOSKrE CHARACTERISTICS: 

(A) L0K3TH: 110 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGy: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 631: 

Glu Met Phe Asp Ser Leu Ser Tyr Phe Lys Gly Ser Ser Leu Leu Leu 
1 5 10 15 

Met Leu Lys Thr Tyr Leu Ser Glu Asp Val Phe Gin His Ala Val Val 
20 25 30 

Leu Tyr Leu His Asn His Ser Tyr Ala Ser lie Gin Ser Asp Asp Leu 
35 40 45 

Trp Asp Ser Phe Asn Glu Val Thr Asn Gin Thr Leu Asp Val Lys Arg 
50 55 60 

Met Met Lys Thr Trp Thr Leu Gin Lys Gly Phe Pro Leu Val Thr Val 
^5 70 75 80 

Gin Lys Lys Gly Lys Glu Leu Phe He Gin Gin Glu Arg Phe Phe Leu 
85 90 95 

Asn Met Lys Pro Glu He Gin Pro Ser Asp Tiir Arg Tyr Met 
100 105 no 



(2) INFORMATION FOR SEQ ID NO: 632: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 632: 

Leu Glu Lys Val Ala Ser Val Gly Asn Ser Arg Pro Thr Gly Gin Gin 
i 5 10 15 

Leu Glu Ser Leu Gly Leu Leu Ala 
20 



<2) INFORMATION FOR SEQ ID NO: 633: 
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(i) SBCJOaCE CHARACTERISTICS: 

(A) LB^GTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBCJUENCE INSCRIPTION: SEQ ID 633: 

Val His Arg Glu Glu Ala Ser Cys Tyr Cys Gin Ala Glu Pro Ser Gly 
1 5 10 15 

Asp Leu 



(2) INPORMATICN FOR SEQ ID NO: 634: 

(i) SBC^J^SCE CHARACTOaSTICS : 

CA) L^KTTH: 22 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634: 

Arg Pro Ala Leu Arg Gin Ala Gly Gly Gly Thr Arg Glu Pro Arg Gin 
1 5 10 15 

Lys Arg Trp Ala Gly Leu 
20 



(2) INPORMATICN FOR SEQ ID NO: 635: 

(i) SBQUraiCE CHARACTERISTICS: 

(A) LQTCTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUETTCE 0BSCRIPTICN: SEQ ID NO: 635: 

Ala Val Asn Kie Arg Pro Gin Airg Ser Gin Ser Met 
15 10 



(2) INFORMATICN FOR SEQ ID NO: 636: 

(i) SBQqENCE CHARACTERISTICS: 

(A) LQJGTH: 37 amino acids 

(B) TyPE: amino acid 
(D) TOPOLOGy: linear 

(xi) SBQtlQJCE DESCRIPTION: SEQ ID NO: 636: 

Met lie Tiir Asp Val Gin Leu Ala lie Phe Ala Asn Met Leu Gly Val 
1 5 10 15 

Ser Leu Phe Leu Leu Val Val Leu Tyr His Tyr Val Ala Val Asn Asn 
20 25 30 

Pro Lys Lys Gin Glu 
35 
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(2) INFORMATICN FOR SEQ ID NO: 637: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LQlGrrH: 342 amino acids 

(B) TYPE: amino acid 
(D) TOPaLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 637: 

10 

Glu Glu Met Ala Asp Ser Val Lys Thr Phe Leu Gin Leu Ala Arg 
15 10 15 

Gly lie Lys Asp Ser lie Trp Gly lie Cys Thr lie Ser Lys Leu Asp 
15 20 25 30 

Ala Arg lie Gin Gin Lys Arg Glu Glu Gin Arg Arg Arg Arg Ala Ser 
35 40 45 

20 Ser Val Leu Ala Gin Arg Arg Ala Gin Ser He Glu Arg Lys Gin Glu 
50 55 60 

Ser Glu Pro Arg He Val Ser Arg He Phe Gin Cys Cys Ala Trp Asn 
65 70 75 80 

25 

Gly Gly Val Phe Trp Phe Ser Leu Leu Leu Phe Tyr Arg Val Phe He 
85 90 95 

Pro Val Leu Gin Ser Val "Thr Ala Arg He He Gly Asp Pro Ser Leu 
30 100 105 110 

His Gly Asp Val Trp Ser Trp Leu Glu Phe Kie Leu Thr Ser He Rie 
115 120 125 

35 Ser Ala Leu Trp Val Leu Pro Leu Kie Val Leu Ser Lys Val Val Asn 
130 135 140 

Ala He Trp E^e Gin Asp He Ala Asp Leu Ala Phe Glu Val Ser Gly 
145 150 155 160 

40 

Arg Lys Pro His Pro Phe Pro Ser Val Ser Lys He He Ala Asp Met 
165 170 175 

Leu Phe Asn Leu Leu Leu Gin Ala Leu Mie Leu He Gin Gly Met ttie 
45 180 185 190 

Val Ser Leu Phe Pro He His Leu Val Gly Gin Leu Val Ser Leu Leu 
195 200 205 

50 His Met Ser Leu Leu Tyr Ser Leu Tyr Cys Phe Glu Tyr Arg Trp Phe 
210 215 220 

Asn Lys Gly He Glu Met His Gin Arg Leu Ser Asn He Glu Arg Asn 
225 230 235 240 

Trp Pro Tyr Tyr Phe Gly Phe Gly Leu Pro Leu Ala Kie Leu Thr Ala 
245" 250 255 



60 



Met Gin Ser Ser Tyr He He Ser Gly Cys Leu Phe Ser He Leu Phe 
260 265 270 
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Pro Leu Phe He He Ser Ala Asn Glu Ala Lys Thr Pro Gly Lys Ala 
275 280 285 

Tyr Leu Phe Gin Leu Arg Leu Phe Ser Leu Val Val Phe Leu Ser Asn 
290 295 300 

Arg Leu Phe His Lys Thr Val Tyr Leu Gin Ser Ala Leu Ser Ser Ser 
305 310 315 320 

Thr Ser Ala Glu Lys Kie Pro Ser Pro His Pro Ser Pro Ala Lys Leu 
325 330 335 

Lys Ala Thr Ala Gly His 
340 



(2) INPOTMATION FOR SBQ ID NO: 638: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENSTH: 529 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 638: 

Met Ala Lys Phe Met Thr Pro Val He Gin Asp Asn Pro Ser Gly Tzp 
1 5 10 15 

Gly Pro Cys Ala Val Pro Glu Gin Phe Arg Asp Met Pro Tyr Gin Pro 
20 25 30 

Phe Ser Lys Gly Asp Arg Leu Gly Lys Val Ala Asp Trp Thr Gly Ala 
35 40 45 

Thr Tyr Gin Asp Lys Arg Tyr Thr Asn Lys Tyr Ser Ser Gin Phe Gly 
50 55 60 

Gly Gly Ser Gin Tyr Ala Tyr Phe His Glu Glu Asp Glu Ser Ser Rie 
65 70 75 80 

Gin Leu Val Asp Ttao Ala Arg Thr Gin Lys Ote Ala Tyr Gin Arg Asn 
85 90 95 

Arg Met Arg Phe Ala Gin Arg Asn Leu Arg Arg Asp Lys Asp Arg Arg 
100 105 110 

Asn Met Leu Gin Phe Asn Leu Gin He Leu Pro Lys Ser Ala Lys Gin 
115 120 125 

Lys Glu Arg Glu Arg He Arg Leu Gin Lys Lys Phe Gin Lys Gin Phe 
130 135 140 

Gly Val Arg Gin Lys Trp Asp Gin Lys Ser Gin Lys Pro Arg Asp Ser 
145 150 155 160 

Ser Val Glu Val Arg Ser Asp Trp Glu Val Lys Glu Glu Met Asp Phe 
165 170 175 



Pro Gin Leu Met Lys Met Arg Tyr Leu Glu Val Ser Glu Pro Gin Asp 
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180 185 190 

He Glu Cys Cys Gly Ala Leu Glu Tyr Tyr Asp Lys Ala Phe Asp Arg 
195 200 205 

lie Thr Tte Arg Ser Glu Lys Pro Leu Arg Xaa Xaa Lys Arg He Phe 
210 2X5 220 

His Thr Val Thr rhr Thr Asp Asp Pro Val He Arg Lys Leu Ala Lys 
225 230 235 240 

Thr Gin Gly Asn Val Phe Ala Thr A^ Ala He Leu Ala Thr Leu Met 
245 250 255 

Ser Cys rhr Arg Ser Val Tyr Ser Trp Asp He Val Val Gin Arg Val 
260 265 270 

Gly Ser Lys Leu Phe Phe Asp Lys Arg Asp Asn Ser Asp Phe Asp Leu 
275 280 285 

Leu thr Val Ser Glu Thr Ala Asn Glu Pro Pro Gin Asp Glu Gly Asn 
290 295 300 

Ser rtie Asn Ser Pro Arg Asn Leu Ala Met Glu Ala Thr Tyr He Asn 
305 310 315 320 

His Asn. Phe Ser Gin Gin Cys Leu Arg Met Gly Lys Glu Arg Tyr Asn 
325 330 335 

Phe Pro Asn Pro Asn Pro Phe Val Glu Asp Asp Met Asp Lys Asn Glu 
340 345 350 

He Ala Ser Val Ala Tyr Arg Tyr Arg Ser Gly Lys Leu Gly Asp Asp 
355 360 365 

He Asp Leu He Val Arg Cys Glu His Asp Gly Val Met Thr Gly Ala 
370 375 380 

Asn Gly Glu Val Ser Phe He Asn He Lys Thr Leu Asn Glu Trp Asp 
385 390 395 400 

Ser Arg His Cys Asn Gly Val Asp Trp Arg Gin Lys Leu Asp Ser Gin 
405 410 415 

Arg Gly Ala Val He Ala Thr Glu Leu Lys Asn Asn Ser Tyr Lys Leu 
420 425 430 

Ala Arg Trp Thr Cys Cys Ala Leu Leu Ala Gly Ser Glu Tyr Leu Lys 
435 440 445 

Leu Gly Tyr Val Ser Arg Tyr His Val Lys Asp Ser Ser Arg His Val 
450 455 460 

He Leu Gly tbr Gin Gin Phe Lys Pro Asn Glu Phe Ala Ser Gin He 
465 470 475 480 

Asn Leu Ser Val Glu Asn Ala Trp Gly He Leu Arg Cys Val He Asp 
485 490 495 

He Cys Met Lys Glu Glu Gly Lys Tyr Leu He Leu Lys Asp Pro 
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500 505 
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510 



Asn Lys Gin Val He Arg Val Tyr Ser Leu Pro Asp Gly Thr Phe Ser 
515 520 525 



5 

Ser 



10 

(2) INFORMATICN FOR SBQ ID NO: 639: 

(i) SEQUE^KIE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGy: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID KO: 639: 

Lys Lys Arg His Thr Asp Val Gin Phe Tyr Thr Glu Val Gly Glu He 
1 5 10 15 

Thr Thr Asp Leu Gly Lys His Gin His Met His Asp Arg Asp Asp Leu 
20 25 30 

25 Tyr Ala Glu Gin Met Glu Arg Glu Met Arg His Lys Leu Lys Thr Ala 
35 40 45 



15 



20 



Phe Lys Asn Phe He Glu Lys Val Glu Ala Leu Thr Lys Glu Glu Leu 
50 55 60 

Glu Phe Glu Val Pro Phe Arg Asp Leu Gly Phe Asn Gly Ala Pro Tyr 
^5 70 75 80 

Arg Ser Thr Cys Leu Leu Gin Pro Thr Ser Ser Ala Leu Val Asn Ala 
85 90 95 

Thr Glu Trp Pro Pro Kie Val Val Thr Leu Asp Glu Val Glu Leu He 
. 100 105 110 

40 His Phe Xaa Arg Val Gin Phe His Leu Lys Asn Phe Asp Met Val He 
115 120 125 



30 



35 



45 



50 



val Tyr Lys Asp Tyr Ser Lys Lys Val Thr Met He Asn Ala He Pro 
130 135 140 

Val Ala Ser Leu Asp Pro He Lys Glu Trp Lai Asn Ser Cys Asp Leu 
150 155 160 

Lys Tyr Thr Glu Gly Val Gin Ser Leu Asn Trp -Rir Lys He Met Lys 
165 170 175 

Thr He Val Asp Asp Pro Glu Gly Phe Phe Glu Gin Gly Gly Trp Ser 
180 185 190 



55 Hie 



Leu 



(2) INPORMftTION FDR SEQ ID NO: 640: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LOKHH: 70 amino acids 

(B) TYPE: amino acid 
<D) TOPOIXGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640: 

Arg Ser Gly Leu Gly Leu Gly He Thr He Ala Phe Leu Ala Thr Leu 
^5 10 15 

He Thr Gin Phe Leu Val Tyr Asn Gly ValTyr Gin lyr 'ttir Ser Pro 
20 25 30 

Asp Phe Leu TVr He Arg Ser Trp Leu Pro Cys He Phe Phe Ser Gly 
35 40 45 

Gly Val Tbx Val Gly Asn He Gly Arg Gin Leu Ala Met Gly Val Pro 
50 55 50 

Glu Lys Pro His Ser Asp 
65 70 



(2) INFORMAnCN FOR SEQ ID NO: 641: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641: 

Val Thr Gin Pro Lys His Leu Ser Ala Ser Met Gly Gly Ser Val Glu 
^ 5 10 15 

He Pro Phe Ser Phe Tyr TVr Pro Trp Glu Leu Ala Xaa Xaa Pro Xaa 
20 25 30 

Val Arg He Ser Trp Arg Arg Gly His Phe His Gly Gin Ser Kie Tyr 
35 40 45 

Ser Olir Arg Pro Pro Ser He His Lys Asp Tyr Val Asn Arg Leu Phe 
50 55 60 

Leu Asn Trp Tfcr Glu Gly Gin Glu Ser Gly Phe Leu Arg He Ser Asn 
^5 70 75 80 

heM Arg Lys Glu Asp Gin Ser Val Tyr Phe Cys Arg Val Glu Leu Asp 
85 90 95 

*I*ir Arg Arg Ser Gly 
100 



(2) INFORMATION PGR SEQ ID NO: 642: 

(i) SEQUQ>}CE CHARACTERISTICS: 

(A) LaJOTH: 233 amino acids 

(B) TYI^: amino acid 
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(D) TOPOLOGY: linear 
(xi) SBtyJENCE DESCRIPTIOW: SBQ ID NO: 642: 

Met Glu Ala Gin Gin Val Asn Glu Ala Glu Ser Ala Arg Glu Gin Leu 
i 5 10 15 

Gin Xaa Leu His Asp Gin He Ala Gly Gin Lys Ala Ser Lys Gin Glu 
20 25 30 

Leu Glu Thr Glu Leu Glu Arg Leu Lys Gin Glu Phe His Tyr He Glu 
35 40 45 

Glu Asp Leu Tyr Arg Thr Lys Asn Thr Leu Gin Ser Arg lie Lys Asp 
50 55 60 

Arg Asp Glu Glu He Gin Lys Leu Arg Asn Gin Leu Thr Asn Lys Thr 
65 70 75 80 

Leu Ser Asn Ser Ser Gin Ser Glu Leu Glu Asn Arg Leu His Gin Leu 
85 90 95 

thr Glu Thr Leu He Gin Lys Gin Thr Met Leu Glu Ser Leu Ser Thr 
100 105 110 

Glu Lys Asn Ser Leu Val E^e Gin Leu Glu Arg Leu Glu Gin Gin Met 
115 120 125 

Asn Ser Ala Ser Gly Ser Ser Ser Asn Gly Ser Sex He Asn Met Ser 
130 135 140 

Gly He Asp Asn Gly Glu Gly rhr Arg Leu Arg Asn Val Pro Val Leu 
150 155 160 

Phe Asn Asp Thr Glu Thr Asn Leu Ala Gly Met Tyr Gly Lys Val Arg 
. 165 170 175 

Lys Ala Ala Ser Ser He Asp Gin Phe Ser He Arg Leu Gly He PhB 
180 185 190 

Leu Arg Arg Tyr Pro He Ala Arg Val Phe Val He He Tyr Met Ala 
195 200 205 

Leu Leu His Leu Trp Val Met He Val Leu Leu Thr Tyr Thr Pro Glu 
210 215 220 

Met His His Asp Gin Pro Tyr Gly Lys 
225 230 



(2) XNFXMMATION FOR SBQ ID NO: 643: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICN: SBQ ID NO: 643: 



He Arg His Glu Gin His Pro Asn Ete Ser Leu Glu Met His Ser Lys 
15 10 15 
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Gly ser Ser Leu Leu Leu phe Leu Pro Gin He Leu He Leu Pro 
20 25 30 
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5 Val Cys Ala His Leu His Glu Glu Leu Asn Cys 
35 - 40 



10 (2) INFORMATICN FX2R SEQ ID NO: 644: 

(i) SBC2UQJCE CHARACTERISTICS: 

(A) LENGrrH: 63 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEWSNCB DESCRIPTICN: SEQ ID NO: 644: 



Ser ttie Phe He Ser Glu Glu Lys Gly His Leu Leu Leu Gin Ala Glu 

20 



15 10 15 



Arg His Pro Trp Val Ala Gly Ala Leu Val Gly Val Ser Gly Gly Leu 
20 25 30 

-nir Leu Tiir Thr Cys Ser Gly Pro Thr Glu Lys Pro Ala Thr Lys Asn 
^ 35 40 45 

Tyr Phe Leu Lys Arg Leu Leu Gin Glu Met His He Arg Ala Asn 
5° 55 60 

30 



35 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule I3bis) 



A, The indications made below lelate to the microoiganism lefeired to in the description 
on page 116 , line N/A 



a IDENTinCATlON OF DEPOSIT Further deposiU arc identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (inchuUng postai code and country) 

12301 Pariclawn Drive 
RodcvUle, Maryland 20852 
United States of America 



Date of deposit February 26, 1997 I Accession Number 97897 



C ADDITIONAL INDICATIONS (leave blank if not applieabh) This information is continued on an additional sheet 



n 



a DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ftftt^lnmcation.arcnctforaade^^ 



E. SEPARATE FURNISHING OF INDICATIONS (ha^ blank tfnot^^i^) 

iwT^^^r* ^""^ ^ International Bureau later i^i^ the general nature of the indicalians. e.^. ^Accession 



For receiving Office use only . 



This sheet was received with the mtemationa] application 



Authorized officer 



Susan White 

Intematkmal tJMston 



> For Intmiational Bureau use only < 



□ 



This sheet was received by the IntemMiona] Bureau on: 



Authorized ofltocr 



Form PCT/RO/m (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule Ubis) 



A. The indications made bdow relate to the miooorganism referred to in the description 
on page 116 ,line N/A 



a IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of dq>ositaiy institution {including postal code and cotmiry) 

12301 Paikiawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15. 1997 



Accession Number 209043 



C. ADDITIONAL INDICATIONS (leave blartk if not a^iicab^) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WfflCH INDICATIONS ARE MADE oritei.^ 



E. SEPARATE FURNISHING OF DTOICATIONSrtem^6i^y«orq^ 

j^^^^^^^ wi» be submitted to the International Bureau later the genend nature of the indications, e.g.. ^Accession 



For receiving Office use only . 



This sheet was received with the tmemstiona] application 



Authorized officcr 



Susan White 

PCT Intemattonal DMsfen 



* For International Bureau use only i 



□ 



This sheet was rccdvcd by the Imesnationa) Bureau on: 



Authorized ofTioer 



Fonn PCr/RCVm (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule I3bis) 



A. The indications made below relate to the microoiganism referred to in the description 
on page 119 .line N/A 



a roENTIFICATION OF DEPOSIT Further deposits aie idenUfied on an additional sheet 



□ 



Name of dqxisitaiy institution 

American Type Culture Collection 



Address of dq>ositaiy institution (including postal code and country) 

12301 Parklawn I>nve 
Rodcville, Maryland 20852 
United States of America 



Date of deposit September 4, 1 997 



Accession Number 209235 



C ADDITIONAL INDICATIONS (have blank if not appiicabk) This infonnation is continued on an additional sheet Q 



D. DKIGNATED STATES FOR WHICH INDICATIONS ARE MADE (?/-Ae 



E> SEPARATE FURNISHING OF INDIC ATIONS (have blank if na applicable) 

i^iS^?^ ^'"'^ ^ ^^"^ iBtemational Bureau later (specif the general nature ofihe indications, e.^. ''Acc^cn 



For recei vmg Office use only . 



^^T his sheet was received with the international applicalion 



Authorized officer 



iWhIte 
PCX tntemaHontf Division 



• For International Bureau use only < 



□ 



This sheet was received by die bitemational Bureau on: 



Attthorized officer 



Fonn PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuJe I3bis) 



A. The indications made below relate to the microof;gaiiism referred to in the description 
onpagc 1^ ,linc N/A 



a IDEimFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Paiklawn Drive 
Rodcville, Maryland 20852 
United States of America 



Date of deposit February 26, 1997 


Accession Number 97898 


a ADDITIONAL INDICATIONS (leave blank If not of^icable) This infonnation is continued on an additional sheet n 



D. DESIGNATED CTATES FOR WHICH INDICATIONS ARE MADE (Ifthelndlaacm are nctM all de,tg«aud States) 



E. SEPARATE FURNISHING OF INDICATIONS fleow Want 



iw^J^O^ ^ ^bmitted to the International Bineau later OpedJi^ the general mm.r, ofihemdicoiions. e.g.. 'Acc^ 



For receiviiig Office use only , 



sheet was received with the international application 



Authorized ofiBcer 



Susan Whto 

»*CT Intematter^ Kvlsfon 



> For International Bureau use only • 



□ 



This sheet was received by the Interaational Bureau on: 



Authorized offiocr 



Forn VCimmM (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule nbis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 122 .line N/A 



a IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional dteet 



Name of depositary institution 



American Type Culture Collection 



Address of dqwsitary institution (including pcstal code and country) 

12301 Parklawn Drive 
Roclcville, Maryland 20852 
United States of America 



Date of deposit May 15. 1997 



Accession Number 209044 



C. ADDrnONAL INDICATIONS <Zemv^i!anA{^nor<99>Ji^ This mformation is continued on an additional sheet Q 



D» DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE CtfOiemiSaawmaFemotMfMde^gnatedStatfs) 



E. SEPARATE FURNISHING OF INDICATIONS <W ifnotcppUcabk) 



TTicM^^toed below will be submitted to the International Bureau later iflpeci^Otegcntrtil nature ^tk^ indications, eg,. "Accession 



For receiving Office use only ■ 




'This sheet was received with the mtcmatkmal application 



Authorized officer 



Susen White 

Per Intemsttonal Division 



• For International Bureau use only < 



□ 



This shcctwasrcceivedby the intcmational Bureau on: 



Authorized officer 



Fonn PCr/RO/134 (July \992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

- (PCTRulel3AK; 



A- The indications made below relate to the microorganisin refened to in the desdiptiim 
on page 126 .line N/A 



a roENTlFlCATlON OF DEPOSIT Pudber deposits aie identified on an additional sheet Q 



Name of dqx>sitaiy institution 

American Type Culture Collection 



Address of depositary institution {inchtding postal code and country) 

12301 Paiklawn Ehive 
RockviUe, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 97899 



C ADDITIONAL INDICATIONS (leave blank If not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WfflCH INDICATIONS ARE MADE r/<*e« 



E. SEPARATE FURNISHI NG OF INDICATIONS (lea^ blank ifnoiaj^icable) 

iW^i^S^ "^"^ ^ submitted to the International Bureau \aiei^^>e<^a*e general naturt of the indications. e,g,. ^toS^ 



For receiving Office use only . 



This shea was received with the intcmationa] application 



AuifaaizBd officer 



Susan White 

PCTfmemattonal DMston 



i For Internationa] Bureau use only i 



□ 



This sheet was received by die Intenmiona) Bureau on: 



Authorized officer 



Farm PCr/RCV134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule Ubis) 



A. The indications made below relate to the mic^ooiBanisn referred to in the desmption 
on page 126 .line N/A 



R IDENTIFICATION OF DEPOSIT Further deposits are idaitified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collecdon 



Address of depositary institution (inclitding pasted code and country) 

12301 Parklawn Drive 
Rodcville, Maryland 20852 
United States of America 



Date of deposit May 1 5, 1 997 


Accession Number 209045 


C. ADDITIONAL INDICATIONS {have blank if not applicable) This information is continued on an additional sheet [~] 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the imScatiom are not for aU des^iuoed States) 



E. SEPAIL\TEFURNlSlflNG OF INDICATIONS r3S»i« Mont i^n^ 



The indications listed below will be submitted to the International Bureau later {specify the general nature t^the indications, e.g.. "Accession 
Numlfer of Deposit^ 



For receiving Office use only , 



jar 



she^ was received with the international ai^lication 



Authorized officer 



Susan White 

POT tntemational Division 



> For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



Foim PCT/RQn34 (July 1992) 



wo 98/39448 



727 



PCT/US98/04493 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule mis) 



A. The indications made below relate to the microorganism referred to in the descriptio 
on past 130 jine N/A 



R IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



□ 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution (including postaJ code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of Arooica 




DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (l/i**la^«.«««/s.<^*,<;;;:^I^ 




g., "Accession 



For receiving Office use only , 



This sheet was received with the interoatioiial application 



Authorized officer 



While 



• For International Bureau use only . 



□ 



This sheet was received by the Intemational Bureau on: 



Authorized officer 



Fonn PCr/RO/134 (loly 19W) 



W09809448 



728 



PCTAJS98A)4493 



INDICATIONS RELATING TO A DEPOSITED NflCROORGANISM 

(PCX Rule \3bis) 



A. The indications made below relate to the microorganism refenred to in the description 
on page 131 ,line N/A 


a roENTIFICATION OF DEPOSIT 


Further deposits are identified on an additional sheet [~] 


Name of depositary institution 

American Type Culture Collection 


Address of depositary institution {including postal code and country) 


12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 




Date of deposit February 26. 1 997 


Accession Number 97900 



C. ADDITIONAL INDICATIONS (have blank if not appiicable) This infcwmation is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (iftt^ imScaHom are not for aU designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank ifnotaf^icable) 



The indications listed below will be submitted to the International Bureau later {^i^ the general nature of the indications, e g , "Accession 
Number of Deposif^ 



For receiving Office use only , 



[2P 



This sheet was received with the tntemattonal application 



Authorized ofitccr 



IKrr imemattonal Wv»$ten 



> For Intmational Bureau use only • 



□ 



This sheet was received by the Internationa) Bureau on: 



Authorized ofTicer 



Fcrai PCr/RO/m (July 1992) 



WOW3W48 PCT/OS98m493 

729 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule Ubis) 



A. The indications made below relate to the miooorganism referred to in the description 
on page 137 , line N/A 



a TOENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code andcotmtry) 

12301 Paridawn Drive 
Rodcville, Maryland 20852 
United States of America 



Dale of deposit February 26, 1 997 



Accession Number 97901 



C ADDTTlOfiAhMDJCATlONS (have bianki/noiof^icahle) This infomiation Is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (Ifihe indications ma for aU de^igna^ 



E, SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicalde) ' 

J^WS/O*^ ^''''^ ^ submitted to the International Bureau later {spec^ the general nature of the indications, e.g.. "AccesS^ 



For receiving Office use only . 



This sheet was received with the intenialional application 



Aidtoized ofiBcer 



Susan White 

PCT f ntematlonal DMsion 



* For Intemational Bureau use only ■ 



□ 



This sheet was received by the bitemational Bureau on: 



Authorized officer 



Fonn PCT/RjCV134 (July 1992) 



wo 98/39448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the mtoooTganism refored to in the description 
on page 131 ,line N/A 



B. roENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet Q| 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, ivlaiyland 20852 
United States of America 



Date of deposit May 1 5, 1997 


Accession Number 209046 


C. AlWITlUN AL< UNDICATIONS (leave blank if not eq^licable) This information is continued on an additional sheet [~[ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE O/theimBaaiomarenoiforaUdesignaitdStata) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indi^ons listed below will be submitted to the International Bureau later i:pecij^ the general nature of the indications, e.g:. "Accession 
Number of Deposit'^ ^ -r, © ^ -r c-, 



For receiving Office use only , 



This sheet was received with the international application 



Authorized officer 



Susan WMto 

Per fntefrmBDnal CMvWon 



> For International Bureau use only i 



□ 



This sheet was received by the International Bureau c 



Authorized officer 



Fomj PCr/RO/I34 (July 1992) 



wo 98/39448 



PCT/US98«M493 



731 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 137 Jine N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 



Anfierican Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Ehive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15, 1997 


Accession Number 209047 


C. ADDITIONAL INDICATIONS (have blank if not applicable) This information is continued on an additional sheet [~| 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE fif the imSca/iam are not/or attdaignated:^ata) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {^xdjy the general nature of the indications, e.g., 'Accession 
Number of Deposit'^ 



— For receiving Office use only , 



sheet was received with the interaational application 



Authorized offiw SUSSn White 

PCTtrttemctttonal Dtviston 



> For International Bureau use only • 



□ 



This sheet was received by the Internationa) Bureau on: 



Authorized officer 



Fomi PCr/RO/I34 (July 1992) 



W09805M48 PCT/DS98A)4493 

732 

INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule nbis) 

A. The indications made below relate to the microorganism referred to in the description 
on page 137 .line N/A 



B. IDENTIFICATION OF DEPOSIT Fimhcr dqxisits are identified on an additional sheet |^ 

Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Dale of deposit May 22, 1 997 


Accession Number 209076 


C ADDITIONAL INDICATIONS (letzve blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (^the indiattionM art not for aU designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not ^icable) 



The indications listed below will be submitted to the international Bureau later {^jecffy the general nature of the indications, eg,, 'Accession 
l^ber of Deposit'^ 



For receiving Office use only , 



0™- 



sheet was received with the tntcmationa} application 



Authorized o£Qccr 



SusenWhte 

PCT t i i iamg ltenal CMvlsion 



* For International Bureau use only < 



□ 



This sheet was received by the International Bureau cm: 



Authorized officer 



Form PCr/R(V134 (July 1992) 



wo 98/39448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 



(PCTRuIe \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 140 , line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Coliectton 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 » 
United States of America 



Date of deposit August 2 1 , 1 997 



Accession Number 2092 1 5 



C. ADDITIONAL INDICATIONS (leave blank if not tgjplicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the imScaOom art not for aU designated States) 



£• SEPARATE FURNISHING OF INDICATIONS (leave blank ifnot^icabte) 



The indications listed below will be submitted to the international Bureau later (ipecij^ the general natm of the indications, e.g., "Accession 
Nvmber of Deposit"^ 



For receiving Office use only , 



l^^^fhis sheet was received with the intematioDal application 



Authorized officer 



SusanWhIM 

POT Intemattonal Ohrtskw* 



> For International Bureau use only < 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



Forai PCT/RO/134 (July 1992) 



W098A}9448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the desoiption 
on page 160 » line N/A 



B. roENTIFlCATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Dale of deposit February 26. 1 997 


Accession Number 97904 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet [~~) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE frf the im&cadons art not for aa designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (lem blank if not applicable) 



The indications listed below wilt be submitted to the International Bureau later {specify the general nature the indications. e.g., '"Accession 
Number of Deposif^ 



For receiving OfiTice use only ■ 



This sheet was received with the international appiicatton 



Auihortzcd officer SUSSO White 

PCT brtsmattonal DfvbtoP 



• For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RQ/134 (July 1992) 



wo 98/39448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuIe nbis) 

A. The indications made below relate to the microorganism referred to in the description 

on page 154 » line N/A 

B. roENTIFICATION OF DEPOSIT Further depoats arc identified on an additional sheet Q 

Name of depositary institution 

Ainerican Type Culture Collection 

Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rodcville, Maryland 20852 
United States of America 



Dale of deposit July 3, 1997 


Accession Number 209139 


C. ADDITIONAL INDICATIONS (lem blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ftftheindicaiiattsarenot/oraUdaignatedSiuies) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable} 



The indications listed below will be submitted to the International Bureau later (specif the general nantre of the indications, e.g., "Accession 
Number of Deposit'^ 



For receiving Office use only , 



This sheet vfas received with the intcmaiionaS application 



Authonzed officer 



Susen Whits 

PCT Intemattonal [Mvlsion 



i For International Bureau use only i 



□ 



This sheet was received by the International Bureau on: 



Authorized ofTtccr 



Form PCT/RO/134 (July 1992) 



wo 98/39448 



736 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The uidications made below relate to the microorganism referred to in the description 
on page 153 .line N/A 



a IDENTIFICATION OF DEPOSIT 



Further dqx>sits are identified on an additional sheet Q] 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 1 5, 1 997 



Accession Number 209049 



C- ADDITIONAL INDICATIONS O^ove blank if not applicttble) This information is continued on an additional sheet 



D, DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicatiom art not for aU designated Sma) 



E. SEPARATE FURNISHING OF INDICATIONS (leave Monk if not m^icable) 



The indications listed below will be submitted to the International Bureau later {specij^ the general nature of the indications, e g , "Accession 
Number cf Deposit*^ 



For receiving Office use only ■ 



This sheet was received with the imcmational application 



Authorized officer 



Susan WMte 

PCT International Dh/lsion 



> For International Bureau use only < 



□ 



This sheet was received by the International Bureau on: 



Authorized DfTjccr 



Foraj PCr/R(yi34 (July 1992) 



W09809448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 



(PCTRule \3bis) 



A. The indications made below relate to the micFOorganism referred to in the description 
on page 153 - , line , N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 

Nante of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockviile, Maryland 20852 
United States of America 



Date of deposit February 26, 1997 



Accession Number 97903 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicadom are not for aU designated ^iaej) 



E. SEPARATE FURNISHING OF INDICATIONS fieaye blank ifnotapplicabte) 



The indications listed below will be submitted to the international Bureau later {specify the general namre of the indications, e.g "Accession 
Number of Deposit*^ 



l^^This sheet ^ 


vas received with the intcmaiional applicatbn 


Authorized officer 






Susan WMte 




PCT international Dhrifiion 



> For International Bureau use only < 



□ 



This sheet was received by the Imcmational Bureau on: 



Authorized officer 



Fonn PCr/R(VI34 (July 1992) 



wo 98^9448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 



(PCX Rule \3bis) 



A. The indications made below rclalc to the microorganism referred to in the description 
on page 142 Jine N/A 



B. roENTIFICATION OF DEPOSIT Further deposits arc identified on an additional sheet 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit June 12, 1997 



Accession Number 209 1 1 9 



C ADDITIONAL INDICATIONS (teave blank if not applicable) This information is continued on an additional sheet Q] 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the bu&catiom are not for all designated Siam) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the hitemational Bureau later {specify the general nature of the indications, e.g. 'Accession 
Number of Deposit'^ 



For receiving Office use only , 



0r 



sheet was received with the international application 



Authorized officer 



PCT tmematfonaf DMslon 



> For International Bureau use only > 



□ 



This sheet was received by the tntcmational Bureau on: 



Authorized officer 



Form PCT/RO/I34 (July 1992) 



wo 98/39448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule \3bis) 



A. The indications made below relate to the microorganisn referred to in the description 
on page 146 , line N/A 



B. IDENTinCATION OF DEPOSIT 



Further dq>os!ts arc identified on an additional sheet Q 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 97902 



C. ADDITIONAL INDICATIONS (lea)/e blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if iheinmeations are not far aUdaigiutud States) 



E. SEPARATE FURNISHING OF INDICATIONS Oeave blank if not amicable) 



The indications listed below will be submitted to the international Bureau later {^i_fy^ the general nature of the indications, e,g.. ''Accession 
Number of Deposit'^ 



For receiving Office use only ■ 



EC 



This sheet was received with the intcmstional application 



Authorized officer 



PCT Intsmaltenri OhAston 



■ For International Bureau use only 



□ 



This sheet was leceived by the Internationa) Bureau on: 



Authorized officer 



Fbmi PCT/R(V134 (July 1992) 



wo 98/39448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 13^«; 



A. The indications made below rciate lo the mi(70organism referred to in the description 
on page 146 .line N/A 



R IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an addilionaJ sheet Q 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



£>ate of deposit May 15, 1997 



Accession Number 209048 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet [J 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (ifthemdicatiomarenotforaUiksignaiedStam) 



E, SEPARATE FURNISHING OF INDICATIONS (have blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the 
Number of Deposif^ 



general nature of the indications, e.g.. "Accession 



For receiving Office use only . 



[3" 



is sheet w^ received with the international application 



Authorized officer 



Susan Whitt 

PCT tntsmadoral OKfMon 



> For International Bureau use only < 



□ 



This sheet was received by the Imcmationai Bureau on: 



Authorized officer 



Form PCr/RO/134 (July 1992) 



W098/3W48 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 136«r; 



A, The indications made below relate to the microorganism refcfred to in the dcscriptic 
on page 160 jine N/A 



B. roENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



□ 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville. Maryland 20852 
United States of America 



Date of deposit May 15, 1997 


Accession Number 209050 


C ADDITIONAL. INDICATIONS ftemv blank if not applicable) This information is continued on an additional sheet (-| 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE Of the inmcations are not for aU designated States) 



E. SEPARATE FURN ISHING OF INDICATIONS (leave blank if not applicable) 
Thci 



^ indiwtions listed below will be submitted to the International Bureau later {specify the 
Inanber of Deposit"^ ^ ■'■^ 



general nantre of tfte indications, e.g., "Accession 



For receiving Office use only , 



This sheet was received wiih the international application 



Authorized officer 



SuasnWhte 

PCT fntsmstlonA) Division 



> For Intemati nal Bureau use only < 



□ 



This sheet was received by the tntcmattonal Bureau on: 



Authorized officer 



Fonn PCr/RO/134 (July 1992) 



W098A39448 



742 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulel36w; 



A. The indications made below relate to the microorganism referred to in the description 
on page 142 ,line N/A 



B. roENTIFlCATION OF DEPOSIT Further deposits are identified on an additional sheet 



Name of depositary institution American Type Culture Collection 



Address of depositary institution {including pasta! code and country) 

12301 Paiklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 12^ 1998 



Accession Number 209627 



C. ADDITIONAL INDICATIONS (ieave blank if noi applicable} This information is continued on an additional sheet Q] 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ftf the imBea^om art ma for aUtUn^mO^ States) 



E. SEPARATE FURNISHING OF INDICATIONS I4ank if not i^icMe) 



The indications listed below will be submitted to the International Buieau later {specify the general nature cfthe a 
Number of Deposit^ 



indications, e.g., "Accession 



For receiving OfHce use only . 



w 



This sheet was received with the international application 



SusanWhte 

PCT tnt Br natton a l DMsk>n 



• For International Bureau use only • 



□ 



This sheet was received by the Internationa] Bureau on: 



Authorized offiGcr 



Form PCT/R(V134 (July 1992) 



W0 9809448 
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What Is Claimed Is: 

1 . An isolated nucleic acid molecule con^rising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence selected fiom the gjcoxxp 
consisting of: 

(a) a polynucleotide fragment of SEQ E) NO:X or a polynucleotide fragment of 
the cDNA sequence included in ATCC Deposit No£, which is hybridizable to SEQ ID 
NO:X; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID NOiY or a 
polypeptide fragment encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(c) a polynucleotide encoding a polypeptide domain of SEQ ID NO: Y or a 
polypeptide domain encoded by the cDNA sequence included in ATCC Deposit Noi, 
which is hybridizable to SEQ ID NO:X; 

(d) a polynucleotide encoding a polypeptide epitope of SEQ ID NO: Y or a 
polypeptide epitope encoded by the cDNA sequence included in ATCC Dqx>sit Noi, 
which is hybridizable to SEQ ID NO JC; 

(e) a polynucleotide encoding a polypeptide of SEQ ID NO: Y or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X, 
having biological activity; 

(0 a polynucleotide which is a variant of SEQ ID NO:X; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO:X; 

(h) a polynucleotide which encodes a species homologue of the SEQ ID NO: Y; 

(i) a polynucleotide enable of hybridizing under stringmt conditions to any 
one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not 
hybridize undo: stringent conditions to a nucleic acid molecule having a nucleotide 
sequence of only A residues or of only T residues. 

2 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding a secreted protein. 

3 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding the sequence 
identified as SEQ ID NO: Y or the polypeptide encoded by the cDNA sequence included 
in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X. 
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4 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide firagment comprises the entire nucleotide sequence of SEQ ID NO:X or 
the cDNA sequence included in ATCC Deposit NoiZ. which is hybridizable to SEQ ID 
NO:X. 

5 . The isolated nucleic acid molecule of claim 2, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-termmus or the N- 
terminus. 

6 . The isolated nucleic acid molecule of claim 3, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

7. A recombinant vector comprising the isolated nucleic acid molecule of 
claim 1. 

8. A method of making a recombinant host cell comprising the isolated 
nucleic acid molecule of claim 1 . 

9. A recombinant host cell produced by the method of claim 8. 

1 0. The recombinant host cell of claim 9 comprising vector sequences. 

11. An isolated polypeptide comprising an amino acid sequence at least 95 % 
identical to a sequence selected from the group consisting of: 

(a) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No2; 

(b) a polypeptide fragment of SEQ ID NO: Y or tibe racoded sequence included 
in ATCC Deposit No:Z, having biological activity; 

(c) a polypeptide domain of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit Noi; 

(d) a polypeptide epitope of SEQ ID NO: Y or die encoded sequence included in 
ATCC Deposit No^; 

(e) a secreted form of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(f) a fiill lengtii protein of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; 
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(g) a variant of SEQ ID NO: Y; 

(h) an allelic variant of SEQ ID NO: Y; or 

(i) a species homologue of the SEQ ID NO: Y. 

12. The isolated polypeptide of claim 1 1, wherein the secreted fonn or the 
ftill length protein comprises sequential amino acid deletions from either the C-terminus 
or the N-terminus. 

13. An isolated antibody that binds specifically to the isolated polypeptide of 
claim 11. 

14. A recombinant host cell that expresses the isolated polypeptide of claim 

11. 

15. A method of making an isolated polypeptide comprising: 

(a) culturing the recombinant host cell of claim 14 under conditions such that 
said polypeptide is expressed; and 

(b) recovering said polypeptide. 

1 6. The polypeptide produced by claim 1 5. 

17. A method for preventing, treating, or ameliorating a medical condition, 
comprising administmng to a mammalian subject a therapeutically efiBsctive amount of 
the polypeptide of claim 1 1 or die polynucleotide of claim 1. 

1 8. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject conq}rising: 

(a) determining the presence or absence of a mutation in the polynucleotide of 
claim 1; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or absence of said mutation. 

19. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or amoimt of expression of the polypeptide of 
claim 1 1 in a biological sample; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or amount of expression of the polypeptide. 
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20. A method for identifying a binding partner to the polypeptide of claim 1 1 
comprising: 

(a) contacting the polypeptide of claim 1 1 with a binding partner, and 

5 (b) determining whether the binding partner effects an activity of the 

polypeptide. 

2 1 . The gene corresponding to the cDNA sequence of SEQ ID NO: Y. 

10 22. A method of identifying an activity in a biological assay, wherein the 

method comprises: 

(a) ex|Messing SEQ ID NO JC in a ceU; 

(b) isolating the supernatant; 

(c) detecting an activity in a biological assay; and 

15 (d) identifying the protein in the supematant having the activity. 



23. 



The product produced by the method of claim 22. 



I iSntSST'^"'' ^^^^ I imcnmionaj application NO. Uoassigncd 

I — : 1 Pi — Mc: 



BUI Mc; cqiFt -nU-LLQ^ 



. 03'APRfiS8 

INDICATIONS RELATING TO A feEPOSITED MICROORfr 




PCT 



(PCTRuJe {3bi5) 



fVNISM 



A. The indications made below relate to the miaoorganism r^errcd to in the description 
on page 160 Jinc N/A 



a IDENTinCATION OF DEPOSIT Further deposits axe identified on an additional sheet 



IS 



Name of depositary instimtion Ammcan Type Culture Collection 



Address of depositary institotion (mcittding postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 97904 



C M>mnOriKhltmCKTlOfiS (leave bh^ This infonnation is continued on an addltkmal sheet Q 



^If^^^^ <^ignations in which a European Patent is sought a sample of die deposited microorganism will be 
made available until die publicauon of die mention of die grant of die European patent or until die date on which 
application has been refused or wididrawn or is deraied to be wididrawn, <Hily by die issue of such a sample to an expert 
nominated by the person requesting die sample (Rule 28 (4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE fl/iheindia^iiomarenciMaildt,^^ 



E. SEPARATE FURNISHING OF INDICATIONS fleave blank tf not appiicabie) 

A^^i^S^io^ ^^"""^ ^ submitted to the International Bureau later ispecijy the general natu^ of the indications. e.g., U^^Z^ 



For receiving Office use only . 



This sheet was received with the imcmaiionaJ application 



Authorized olTiccr 



> For International Bureau use only • 



□ 



This sheet was received by the Internsttonal Bureau on: 



Authorized ofTicer 



Fonn PCT/RO/134 (July 1992) 



PC" "US 9S/OHm-Q3 



CANADA 

The applicant requests tfiat until eittier a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement or is withdrawn^ the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests fliat the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without haviiig been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when ttie 
application is made available to the public xmder Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by the applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsingrifefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention ^egtdation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

Hie applicant hereby requests that, until the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been filially decided upon by the National Board of Patents and 
Registration without having been laid open to public irispection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests tiiat the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for tfie international 
publication of tihe application. 

DENMARK 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by die Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in tfie art The request to this effect shall be filed by the applicant 
with tfie Danish Patent Office not later that at the time when the application 
is made available to the public under Sections 22 and 33(3) of the Danish 
Patents Act If such a request has been filed by the applicant; any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that; tmtil the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in die art. The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from ttie 
priority date -(preferably on the Form PCT/ RO/ 134 reproduced in aimex Z 
of Volume I of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or until the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rtdes only by the issue of a sample to an expert The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicant's or agent* s fite 
reference numbo- 



PS002PCT 



I International applicatir- No. Unassigned 
pri _Lis_aa/ 



PCI nc:S ' OMm-Q^ 



INDICATIONS RELATING TO A DEPOSli^^ES&CROORGAraSM 

(PCTRufe 13A«; 



A. The indications made below relate to the microorganian referred to in the description 
on page 116 .line N/A 



a roENTIFICATION OF DEPOSIT 



Fiutfaer dq>osits arc identified on an additional sheet 



Name of depositary instimUon American Type Culture Collection 



Address of depositary institution (including postal code and country) 

1230! Paiklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15, 1997 | Accession Number 209043 



C ADDITIONAL INDICATIONS (leme blank if not m>iicabie) This information is continued on an additicmal sheet n 



k^'^^^^'^J??^ "^^'c^"" E"™Pean J^nt is sought a sample of the deposited microoiCTiism will be 
made avaih^le until the publication of the mention of the grant of the European patent ofimtil the date ^hich 

"^^^ °' withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert 
nommated by the person requestmg the sample (Rule 28 (4) EPC). ^^y^^ ^ au cApcn 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ftfi/^ indicaiians notM oO de.^ 



E. SEPARATE FURNISHING OF IND ICATIONS (leaye blank if na applicable) ' ■ 
ifa^r i^^y ^^"""^ ^ submitted to the International Bureau later {specific ibe general n ature ofihe indications, e.g., Accession 



For receiving Office use only . 



This sheet was received with the international application 



Authorized omccr SOSftTI Wh^ 

PCT Intemattonal DMsion 



■ For International Bineau use only < 



□ 



Thb sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



PCI us 98/04-4-93 



CANADA 

The applicant requests that, until eitiier a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement, or is withdrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in tiie application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by tfie Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
witih the Norwegian Patent Office not later than at the time when the 
application is made available to the pubUc undengectioris 22 and 33(3) of the 
Norwegian Patents Act. If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by the applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorgaiiism shall only be effected prior to the grant of a patent, or prior 
to the lapsingr^efusal or withdrawal of the application^ to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The requ^t to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of the application. 

DENMARK 

The applicant hereby requests that until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in Ae art The request to this effect shall be filed by the applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public under Sections 22 and 33(3) of the Danish 
Patents Act. If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, imtil the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date -(preferably on the Form PCT/RO/134 reproduced in annex Z 
of Volume I of tfie PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for tiie furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that imtil the date of a grant of a Netherlands 
patent or imtil the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicant's or agcnf s file PS002PCT 
reference number 



Internationa] application No. Unassigned 

: BUZ Oia/nM-M-ci-:; 



REC'D 03 APR .1898 



INDICATIONS RELATING TO A DEPOm!mMl£SQQS£j^[S| 

(PCTRnlel3A«; 



A. The indications oiade below relate to the nucroorganism referred to in the description 
on page 116 .line N/A 



a IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 



El 



Name of depositary inflation American Type Culture Collection 



Address of depoataiy institution {inciuding postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 


Accession Number 97897 


C ADDITIONAL INDICATIONS ^eTO*/anAi/no#^^/caWe> This information is continued on an additional sheet 



In respect to those designations in which a European Patent is sought a sample of the deposited mtciDorgani^ will be 
made available until the publication of the mention of the grant of the European patent or until the date on v^liich 
application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an exoert 
nommated by the person requesting the sample (Rule 28 (4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE fif the indieaticm are not/or oadmgnaiedStateM) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not appiicable) 



The indirations listed below will be submitted to the International Bureau later {specify the generaJ nature of the indications, e.z. "Accession 
Number of Deposit^ 



For receiving Office use only , 



sheet was received with the inicmational application 



Authorized officer 



Susan White 

PCT IrtterrtBXkmi Division 



» For International Bureau use only . 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RCVI34 (July 1992) 



J 



PCT US 



CANADA 

The applicant requests that, xintil either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement, or is withdrawn, the 
Conunissioner of Patents only au tfiorizes tiie furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement, inform the International Bureau accordin^y before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests diat the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be efiEected to an 
expert in the art The request to this effect shall be filed by the applicant 
witii the Norwegian Patent Office not later than at the time when the 
application is made available to the public under Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by tiie applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsing^ifefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, imtil the application has been laid open 
to pubUc inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert: The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of tihe application. 

DENMARK 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public under Sections 22 and 33(3) of the Danish 
Patents Act, If such a request has been filed by the applicant any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date -(preferably on the Form PCT/RO/134 reproduced in annex Z 
of Voltune I of the PCT Applicant's Guide). If such a request has been filed 
by the appliceutt any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or until the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicant's or agent's file 
reference numbo" 



PS002PCT 



International applicatr ~ No. Unassigned 

PCT J5 g5/DWq3 



REC'D 0 3 APR 1998 



WIPO 



PCT 



INDICATIONS RELATING TO A DEPOSIILD IVllL'kOUUUANlSM 

(PCT Rule 136£y; 



A. The indications made below relate to the microorganism leferred to in die description 
on page 1 19 , line N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of dqxjsitary institution American Type Culture Collection 



Address of depositaiy institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit September 4, 1 997 


Accession Number 209235 


C. AUDI ! IONAL INDICATIONS (kttve blank if rtta applicable) This information is continued on an additional sheet [~[ 



In respect to those designations in which a European Patent is sought a sample of the deposited microorganism will be 
made available until the publication of the mention of the grant of the European patent or until the date on which 
application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert 
nominated by the person requesting the sample (Rule 28 (4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE OfOteinmeaaomartnotJor^de^g'^aedStam) 



E. SEPARATE FURNISHING OF WmCXlXO^S (leave blank if not of^ticabU) 



The indications listed below will be submitted to the International Bureau later {specify the generai nature of the indications, e.g., "Accession 
Number of Deposit'^ 



For receiving Office use only • 



ST 



lis sheet was leceived with the intemattonal applicalion 



Aulhorizcdom«g^^^ 

PCT Intemaltonal TMsSoHI 



• For International Bureau use only « 



□ 



This sheet was received by the Intemattonal Bureau on: 



Authorized ofTiccr 



Fomi PCT/RO/134 (July 1992) 



PCT US gS/OMrH^g3 



CANADA 

The applicant requests that; until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement, or is withdrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by die Commissioner, the applicant must, by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by tiie Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by ihe applicant 
with the Norwegian Patent Office not later than at fhe time when the 
application is made available to the public xmder Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by tf\e applicant in the individual case, 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shcJl only be effected prior to the grant of a patent, or prior 
to the lapsingripefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

HNLAND 

The applicant hereby requests that, imtil the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public ir\spection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of the application. 

DENMARK 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with tfie Danish Patent Office not later that at the time when the application 
is made available to the public under Sections 22 and 33(3) of the Danish 
Patents Act If such a request has been filed by the applicant; any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date^eferably on the Form PCT/RO/134 reproduced in annex Z 
of Volume I of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or tmtil the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicants or agent's file PS002PCT 
reference number 



(ntcmational appltcaticHi No. Unassigaed 




INDICATIONS RELATING TO A DEPOSITED MICROORG^ 

(PCTRulenAw; 



A. the indications made below relate to the microorganisni referred to in the description 
on page 122 • N/A 



R IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional she^ ^ 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (including pcstai code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15. 1997 



Accession Number 209044 



C. ADDITIONAL INDICATIONS (lecnfe blank if not appliaMe) This information is continued on an additional sheet [~| 



In respect to those designations in which a European Patent is sought a sample of the deposited microorganism will be 
made available until the publication of the mention of the grant of die European patent or until the date on which 
application has been refused or withdrawn or is deemed to be withdrawn^ only by the issue of such a sample to an expert 
nmninated by the person requesting the sample (Rule 28 (4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE CtfAeimSemlonsm not for all designated Stated 



£. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable} 



The indications listed below will be submitted to the International Bureau later {specif the general nature of the indications, e.g., "Accession 
Number of Deposit'^ 



For receiving Office use only . 



This sheet was received with the international application 



Authorized officer SUSOH White 

^ 'ntemattonal Dh/tsfon 



• For International Bureau use only • 



□ 



This sheet was received by the Intemattonal Bureau on: 



Authorized officer 



FofTO PCT/RO/134 (July 1992) 



PC" us QB/OM-q-QB 



CANADA 

The applicant requests that, until either a Canadian patent has been issued 
on llie basis of an application or tiie application has been refused, or is 
abandoned and no longer subject to reinstatement or is withdrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been fiiudly 
decided upon by tfie Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by tfie applicant 
with the Norwegiem. Patent Office not later than at the time when the 
application is made available to the public under Sections 22 and 33(3) of the 
Norwegian Patents Act. If such a request has been filed by the applicant, any 
request made by a diird party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by die applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsingrifefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Reg^ations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open 
to pubUc inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be filed by the applicant with the Inteniational Btueau 
before tfie completion of the technical preparations for the international 
publication of tfie application. 

DENMARK 

The applicant hereby requests that, tmtil the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public under Sections 22 and 33^) of the Danish 
Patents Act. If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in tiie art. The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date -(preferably on the Form PCT/ RO/ 134 reproduced in emnex Z 
of Voltune 1 of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be tised. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or until the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicant's or agent* s file 
reference number 



PS002PCT 



Intemationai application No. Unasstgned 

PCI us cfay OM-M-Q^ 



INDICATIONS RELATING TO A DEPOSIl 

(PCX Rule I3bis) 



RBC'D 03 

tOORGANI^ 



A. The indications made below relate to the microorganisn referred to in the description 
on page 122 , line N/A 



R IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet ^ 



Name of dcpositaiy mstitution American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 97898 



C ADDITIONAL INDICATIONS (leave hkmk if not applicable) This information is continued on an additional sheet Q 



In respect to those designations in whidi a European Patent is sought a sample of the deposited microorganism will be 
made available until the publication of the mention of the grant of the European patent or until die date on which 
application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert 
nominated by the person requesting the sample (Rule 28 (4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ftfthe indkadom arenoiforaU designated States) 



IL SEPARATE FURNISHING OF INDICATIONS (leay>e bUmk if not applicable) 



The indications listed below will be submitted to the intemationai Bureau later {specif the general nature of the indicationx e.g.. 'Accession 
Number of Deposit'^ 



For receiving Office use only • 



This sheet was received with the international application 



Authorized officer 



Su8BnWhft9 

PCTlntemattonal DMsion 



I For International Bureau use only * 



□ 



This sheet was received by the International Biueau on: 



Authorized officer 



Form PCT/RCVI J4 (July 1992) 



PCT US 93/04-4-93 



CANADA 

The applicant requests that until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement, or is withdrawn, the 
Conunissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made available to the public tmder Sections 22 and 33(3) of the 
Norwegian Patents Act. If such a request has been filed by the applicant, any 
request made by a third party for die furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by die applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
nucroorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsin^-ifefusal or withdrawal of the application, to a person who is a 
skilled addressee v/ithout an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, tmtil the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNTTED KINGDOM 

The applicant hereby requests fliat the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of the application. 

DENMARK 

The applicant hereby requests that until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspectiorv the furnishing of a sample shall only be effected to an 
expert in the art The requMt to this effect shall be filed by the applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public xmder Sections 22 and 33(3) of the Danish 
Patents Act If such a request has been filed by the applicant any request 
made by a tftird party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by die Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, xmtil the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the Intemationcil Bureau before the expiration of 16 months from the 
priority date -(preferably on the Form PCT/RO/134 reproduced in annex Z 
of Volimie I of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for tiie furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or until the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by tJie issue of a sample to an expert. The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public imder Section 22C or Section 25 of tiie Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicant's or agent's file 
reference numba" 



PS002PCT 



International application No. Unsigned 



REC'D 03 APR 199$ 



INDICATIONS RELATING TO A DEPOSltmftQCROORGAtQj^M 

(PCTRule \3bis) 



A. The indications made below relate to the microorganian referred to in the description 
on page 126 .line N/A 



R IDENTIRCATION OF DEPOSIT 



Further deposits arc identified on an additional sheet |^ 



Name of depositary institution American Type Cuhiire Collection 



Address of depositary institution {including postal code and country) 

12301 Paiklawn Drive 
Roclcville, Maiyland 20852 
United States of Ammca 



Date of deposit May 15, 1997 



Accession Number 209045 



C. AODmONAL INDICATIONS ^eaiv6JM^mw<99Vk»Afe; This information is continued on an additional sheet □ 



In respect to those designations in which a European Pat^t b sought a sample of the deposited microorganism will be 
made available until the publication of the mention of the grant of the European pat^t or until the date on which 
application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert 
nominated by the person requesting the sample (Rule 28 (4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if tkeimSctaim art not for aU designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (Uave blank if not t^iicabk) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the imUcations, eg., "Accession 
Number of Deposit') 



For receiving Office use only . 



This sheet was received with the international application 



Auihorizcd omccr Susw Whfte 
PCT 



□ 



»^ For International Bureau use only ^ 
This sheet was received by the International Bureau on: 



Authorized officer 



Fonn PCT/RO/134 (July 1992) 



PCI us 98. 04-4-93 



CANADA 

The applicant requests that until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement or is withdrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
appUcation. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office witiiout having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in die art The request to this effect shaU be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made available to the public under Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by tiie applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to tiie lapsingripefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been fioally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of tfie application. 

DENMARK 

The applicant hereby requests that, imtil the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public tmder Sections 22 and 33(3) of the Danish 
Patents Act. If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The appliccmt hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in tfie art. The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date -(preferably on the Form PCT/RO/134 reproduced in annex Z 
of Voliune I of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or imtil the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicants or agent* s file PS002PCT 
reference numba- 



International application No, Unassigned 

PF'y 




INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuIe i3A«r; 



A. The indications made below relate to the microorganism referred to in the description 
on page 126 , line N/A 



R IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet ^ 



Name of depositary institution American Type Culture Collection 



Address of depositary institution {inducing postal code and country) 

12301 Paridawn Drive 
RockvUle, ^4aIyla^d 20852 
United States of America 



Date of deposit Febmaiy 26, 1 997 



Accession Number 97899 



C, ADDITIONAL INDICATIONS Oeave blank if not a/^lkabU) This infomiation is continued on an additional sheet 



In respect to tfiose d^ignadons m which a European Patent is sought a sample of the deposited microorganism will be 
made avail^Ie until the publication of the mention of the grant of the European patent or until the date Si which 
application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert 
nommated by the person requesting Ae sample (Rule 28(4) EPC). *iucApcn 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE nfihemmcaiiomarenaiforaUdeugnaudSiaies> 



E. SEPARATE FURNISHING OF INDICATIONS 



'h^'^So^ '^'"'^ ^ submitted to the International Bureau later {specify , he antral nature of the indications, eg.. -Accession 



For receiving Office use only . 



lis sheet was received with the tntemattonal a^ltcaiion 



Authorized officer 



SiMnWhtte 

PCT International Division 



> For f ntmiational Bureau use only • 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



Fonn PCT/RQ/134 (July 1992) 



PC-, us 98/0^4-93 



CANADA 

The applicant requests that, until eitfier a Canadian patent has been issued 
on the basis of an application or tfie application has been refused, or is 
abandoned and no longer subject to reinstatement, or is wididrawn, the 
Commissioner of Patents only authorizes ttie furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by tiie Commissioner, the applicant must^ by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made available to the public under Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by the applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shedl only be effected prior to the grant of a patent, or prior 
to the lapsingrrefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that;, until the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the ftimishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be filed by the applicant with the International Bureau 
before tfie completion of the technical preparations for the international 
publication of die application. 

DENMARK 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public imder Sections 22 and 33(3) of the Danish 
Patents Act If such a request has been filed by tfie applicant; any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date -(preferably on the Form PCT/RO/134 reproduced in annex Z 
of Volume 1 of the PCT Applicant' s Guide). l£ such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
sample shaU indicate the expert to be used: That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The appliccmt hereby requests that until the date of a grant of a Netherlands 
patent or imtil the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicant's or agcnf s file 
reference number 



PS002PCT 



Intentational application No. Unassigned 

EH ng; QR/OM-i4.Q^ 



WIPO - PCT f 

INDICATIONS RELATING TO A DEPQSITEl) MitkOOtt(; ANIS!^ 



(PCT Rule 13^ty; 



A. The indications made below relate to the microorganism referred to in the description 
on page 130 , line N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet ^ 



Name of depositary instinition American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rodcville, Maryland 20852 
United States of America 



Date of deposit April 28, 1997 


Accession Number 209011 


C ADDITIONAL INDICATIONS ^3bm6/anit(/'nor<9V»/i^/e> This information is continued on an additional sheet Q 



In respect to those designations in which a European Patent is sought a sample of the deposited micrxK>rganism will be 
made available until die publication of the mention of the grant of die European patent or until the date on which 
plication has been refused or withdrawn or is deemed to be wididrawn, only by the issue of such a sample to an expert 
nominated by die person requesting die sample (Rule 28 (4) EPC). 



D, DESIGNATED STATES FOR WHICH INDICATIONS ARE NfADE (if the tnOcatiom art not for aOde^gnmed Stated 



E. SEPARATE FURNISHING OF INDICATIONS /W*ton*///io#*v!p/«c«6te> 



The indications listed below will be submitted to the International Bureau later {specify the 
Number of Deposit'^ 



general nature of the indications, e.g, "Accession 



For receiving Office use only . 



This sheet was received with the international application 



Authorized cfTiccr 



Gusan White 



— « For International Bureau use only ^ 
p~| This shea was received by the International Bureau on: 



Authorized ofTiccr 



Fom PCT/RCVI34 (July 1992) 



CANADA 

The appUcant requests that imtil either a Caiudian patently * 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement or is withdrawn^ the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in tfie application to an 
independent expert nominated by the Commissioner, the applicant must by 
a written statement inform tfie International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made available to the public under Sections 22 and 33(3) of the 
Norwegian Patents Act. If such a request has been fSed by the applicant any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by the applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent or prior 
to ttie lapsingrfefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that until the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration vdfhout having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
tihis effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of tihe application. 

DENMARK 

The applicant hereby requests that; until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by titxe applicant 
with the Danish Patent Of&ce not later that at the time when the application 
is made available to the public under Sections 22 and 33(3) of the Danish 
Patents Act. If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be cmy person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in liie individual case. 

SWEDEN 

The applicant hereby requests that until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in ihe art. The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date -(preferably on the Form PCT/RO/134 reproduced in annex Z 
of Voltune I of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized exf)erts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or until the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert. The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicant's or agent's file 
reference number 



PS002PCT 



I International application No. Unassigned 




INDICATIONS RELATING TO A DEPOSITED MICROORG 

(PCX Rule n^wr; 



A. The indications mash below relate to the microorganism rcfened to in the description 
on page 131 .line N/A 



R IDENnnCATIONOFDEP€>SIT 



Further deposits are identified on an additional sheet ^ 



Name of depositary institotion American Type Culture Collection 



Address of depositary institution (Otduding pcsuU code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Dateofdq)osit N4ay 15, 1997 


Accession Number 209046 


C ADDITIONAL INDICATIONS (leave blank if not of^icablej This information is continued on an additional sheet n 



n n^pcct to diose designations m which a European Patent is sought a sample of the deposited microorcanism will be 
made available until the publication of the mention of the grant of Ae European patent or until the date on which 
application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an exDert 
nommated by die pCTSon requestmg the sample (Rule 28 (4) EPC). iwaiicA|jcii 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE fiftheindicaiion. are noiforaU design 



E. SEPARATE FURNISHING OF INDICA TIONS (leave blank if noiapplicabk} \ \ 

mJI^r SS?^ ^^""^ ^'^^^ submitted to the international Bureau later (^J(k the general nan^re of the indications, eg., 'Accession 



For receiving Office use only , 



l^^^ ms sheet was received wiih the intemational application 



Authorized ofliccr 

POT Internationa! OMSton 



> For Intmiational Bureau use only • 



□ 



This sheet was received by the Intemational Bureau on: 



Authorized ofTicer 



Form PCT/RO/t34 (July 19M) 



PCX JS 98/OM-H-g3 



CANADA 

The applicant requests that, until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement, or is withdrawn^ the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by tfie Commissioner, the applicant mtast, by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made available to the public tmder Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by tfie applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to tiie lapsingr^^efusal or withdrawal of the application, to a person who is a 
skiDed addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicemt hereby requests that until the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
niicroorganism shall only be made available to an expert The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of tfie application. 

DENMARK 

The applicant hereby requests that until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Danish Patent QEfice not later tfiat at the time when die application 
is made available to the public imder Sections 22 and 33(3) of the Danish 
Patents Act If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts dravm up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date -(preferably on the Form PCT/RO/134 reproduced in annex Z 
of Volume I of tfie PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a Ust of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or tmtil the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



^^^^ lintenuoional application No. Unassigned 

' 1 • ^T^' -^'r^^-ma 




INDICATIONS RELATING TO .^WPWsriTn ^TT^ffflT>^fit ,\Nnftl 

(PCX Rule 136iy; 



A. The indications made below relate to the microorganism referred to in the description 
on page 131 .line N/A 



R IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 



El 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maiyland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 97900 



C. ADDITIONAL INDICATIONS (leave blank if not applicable} This information is continued on an additional sheet □ 



In respect to those designations in which a European Patent is sought a sample of the deposited microorganian will be 
made avail^le until the publication of the mention of the grant of the European patwit or until the date on which 
application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an exoert 
nommated by the person requesting the sample (Rule 28 (4) EPC). f «* «^F«i 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (iftheu«Banion.arenaiforandesignatt^ 



E. SEPARATE FURNISHING OF mMCKTlOliS (leave blank if noiaf^icabh) 

^b^f^^^H^^ ^^"""^ "^'^ ^ submitted to the Internationa) Bureau later {specif the general nature of the indicaiions. eg., "Accession 



For receiving Office use only ■ 



lis sheet was received with the tntcmaiional application 



AuUmrizcd ofTiccr 



Smal l mt t s 

PCT International Dtvtebn 



» For International Bureau use only < 



□ 



This sheet was received by Ihc International Bureau on: 



Authorized ofTiccr 



Form PCT/RCI/134 (July 1992) 



PCI us 9S/0"4-H^g3 



CANADA 

The applicant requests that^ until either a Canadian patent has been issued 
on the basis of an application or the appKcation has been refused, or is 
abandoned and no longer subject to reinstatement, or is withdrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement inform the International Bureau accordingly before 
completion of technical preparations for publication of the intemationcd 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without haviiig been laid 
open inspection, the furnishing of a sample shall only be effeicted to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made available to the public under Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant; any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by the applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsingrirefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australicm Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public iiispection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of tfie technical preparations for the international 
publication of the application. 

DENMARK 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, tfie furnishing of a sample shall only he effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public imder Sections 22 and 33(3) of the Danish 
Patents Act. If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, imtil the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date:(f^eferably on the Form PCT/RO/134 reproduced in annex Z 
of Volume I of the PCT Applicant's Guide). If such a request has been fUed 
by the applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or imtil the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert. The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicant's or agent* s file 
reference number 



PS002PCT 



International application No. Unassigned 
I Pri BLC5_^^yt D'^M-Q-zg 



Rgc'D 0 3 APR 1S98 ! 



WIPO 



PCT 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuIel3AAj; 



A. The indications made below relate to the miaoorganisra refencd to in the description 
on page 137 jinc N/A 



Further deposits arc identified on an additioral sheet 



IS 



Name of depositary institution American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 




^dSlnh^'ImtJlft'^^^^ if '^^'^1'' Eiiropean Patent is sought a sample of the deposited microorganism will be 
made avail^le until the pubhcation of die mention of the grant of die European patent or until die date oTwhich 
SSE^Iw^^Tk^^^ wididrawn or is deemed to be wididmwn. ^hV^ '^^of ^ch a^p^e to an expert 

nommated by die person requestmg die sample (Rule 28 (4) EPG). ^ ^ 

D, DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (IftHeindicadonsarena^foraa^edS^) 



E. SEPARATE FURNISHING OF INDICATIONS (tea^ hUmk ifna applicable) 



l!!^Ur^^:^ *>^««^ ^ ^bmitted to me intemaUonal Bureau later (specif iHe general nature of 0^ indicasions, "Ao 



M For receiving Office use only , 



0™' 



This sheet was received with the international application 



Authorized officer 



*^'nfBmatfonalDMsjon 



' For International Bureau use only • 



□ 



This sheet was received by the hitcmational Bureau on: 



Authorized ofHcer 



Fonn PCT/RCV134 (July 1992) 



PCX us 



CANADA 

The applicant requests that, until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement, or is withdrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in ttie application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegicm Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at tiie time when the 
application is made available to the public under Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by flie applicamt in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent or prior 
to the lapsingrrefusal or v/ithdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, tmtil the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
fumisliing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of the application. 

DENMARK 

The applicant hereby requests that, imtil the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by liie applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public imder Sections 22 and 33(3) of the Danish 
Patents Act. If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date -(preferably on the Form PCT/RO/134 reproduced in annex Z 
of Volume I of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a tlturd party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or until the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public tmder Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands/ whichever of the two dates occurs earlier. 



Applicants or agent's fitc 
reference number 



PS002PCT 



Intcmational application No. Unassigoed 



REC'D a:3'AI?Ri135B 



INDICATIONS RELATING TO A DEPOSIT glWigeROORG ANftgg 

(PCTRule \2bis) 



A. The tndicatioiK made below relate to the niicrooigamsm referred to in the description 
on page 137 Jinc N/A 



a IDENTIFICATION OF DEPOSIT 



Further dq)osits are identified on an additional sheet ^ 



Name of depositary institution American Type Culture Collection 



Address of dqxksitary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20S52 
United States of America 



Date of deposit May 1 5, 1 997 


Accession Number 209047 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This infonnation is continued on an additional sheet | — [ 



In respect to those designations in which a European Patent is sought a sample of the deposited microorganism will be 
made available until the publication of the mention of the grant of the Euio^an patent or until the date on which 
application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert 
nominated by the person requesting the sample (Rule 28 (4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ftftheinOcations are noiforaU designated Staia) 



E. SEPARATE FURNISHING OF WmCKTlOSS (leave blank if not tn^pUcable) 



The indications listed below will be submitted to the bitemational Bureau later {specif the general nature oftl^ indications, e.g., "Accession 
Number of Deposit**} 



For receiving Office use only , 



This sheet was received with the tntemaitonal application 



Authorized officer SUSaP WhltB 

PCT Intemattona! DMsion 



■ For International Bureau use only • 



□ 



This sheet was received by the Internationa] Bureau cm: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



PCT US 98/04-4-93 



CANADA 

The applicant requests that until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement or is witfidrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must; by 
a written statement inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made avciilable to the public xmder Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by tfie applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsingrrefus<J or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that until the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be fUed by the applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of tfie application. 

DENMARK 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public under Sections 22 and 33(3) of the Danish 
Patents Act If such a request has been filed by tiie applicant; any request 
made by a third party for the furnishing of a sample shall indicate tfie expert 
tobeused. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The appUcant hereby requests that until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public iiispection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by tiie applicant 
with the International Bureau before the expiration of 16 months from the 
priority date .(preferably on the Form PCT/RO/134 reproduced in annex Z 
of Volume I of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by tiie Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NEIHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or until the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be funushed by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public tmder Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Appljcanfsoragenfsfile PS002PCT 
reference number 



International applicaiion No. Unassigaed 



INDICATIONS RELATING TO A DEF 

(PCTRule \3bis) 




MICROORfE^SM 



A, The indications made bdow relate to the microoriganism referred to in the description 
on page 137 .line N/A 



a IDENTinCATION OF DEPOSIT 



Further deposits arc identified on an additional ^leet ^ 



Nanw; of depositary institution American Type Culture CoUection 



Address of depositary institution {including posted code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 


Accession Number 97901 


C ADDITIONAL INDICATIONS (leave blank if not appiicahk} This information is continued on an additional sheet fl 



In rppect to those dKignabons in which a European Patent is sought a sample of the deposited microorganism wUl be 
made available untU the publicaUon of the menuon of the grant of the European patent or until the date onwhich 
an>hcation has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert 
nominated by die person requesting die sample (Rule 28(4) EPC) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (iftkeMiaulons are nct/br all OaigmM Sato) 



E. SEPARATE FURNISHING OF INDICATIONS (ka^ blank if«,iapplieabk) 



iw^S^-O^ ^^'^ ^ ^ tatemational Bureau later Cv««j5- general na^ of the indicaUans. eg.. 'Ac^an 



For receiving Office use only , 



This sheet was received with the intemationaj application 



Authorized officer 



Susan wniw 

W5T Jrtemattonal DIvbion 



■ For International Bureau use only < 



□ 



This sheet was received by the Internationa] Bureau on: 



Authorized officer 



Form PCT/R(Vl34(July 1992) 



PCX US 



98/04-4-93 



CANADA 

The applicant requests tiiat tmtil either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement, or is withdrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests tihat the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been fijoally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
e?q>ert in the art The request to this effect shall be filed by the applicant 
with tfxe Norwegian Patent Office not later than at the time when the 
application is made available to the public imder Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by tiie applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsingrfefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a Scimple shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requ^ts tihat the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for tfie international 
publication of ttie application. 

DENMARK 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public under Sections 22 and 33(3) of the Danish 
Patents Act. If such a request has been filed by the applicant any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office wifliout having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the International Biureau before the expiration of 16 months from the 
priority date-^eferably on tfie Form PCT/RO/134 reproduced in annex Z 
of Volume I of the PCT Applicant's Guide), If such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or tmtil the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



^^^^ llntcmaticm^appficaliooNo. Unassiffied 1 



REC'D 0 3 AFR 1338 ! 



WIPO PCT 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 13*«; 



A. The indications made below relate to the microoTganism rcfened to in the description 
on page 140 jine N/A 



R IDENTinCATION OF DEPOSIT Further deposits are identified on an additional sheet 



Name of depositary institution Ammcan Type Culture Collection 



Address of depositary institution (including postal code and country) 

1 230 1 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit August 2 1 , 1997 I Accession Numbo- 209215 



C. ADDITIONAL INDICATIONS biank ifnoiappHctUOe) This information is continued on an additionaJ sheet 



□ 



mJ^^ii?KU K?""" "^^'r^"" ^"^V^ Patent is sought E Sample of the deposited miciOTrganism will be 

made avatl^dble untd die pubhcauon of the mention of the grant of the European patent oVuntil die date on which 
^^^^^.^ °' w'**^wn or is deemed to be wididrawn, oityb^e issue of such a sampllto an expert 

nommated by die person requestmg the sample (Rule 28 (4) EPC). tuoiicAiKn 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MAI^E wtHe indications or. noi far oil .i^^ 



E, SEPARATE FURNISHING OF INDICATIO NS Oea^i>iank if not applicable) 

lJ!^fSSn^ '^'''^ ^ ^^"^"^^ International Bureau later (specify tl^e general nature of the indication^ e.j,. Accession 



For receiving Office use only . 



This sheet was received with the inicmational application 



Authorized officer ftjirtlH WfUtf 



PCT fntemattonal IMsSm 



* For International Bureau use only < 



□ 



This sheet was received by the International Bureau on: 



Authorized ofinoer 



Form PCT/RO/134 (July 1992) 



PCT US 98. 08+M-g3 



CANADA 

The applicant requests that, until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement or is withdrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
appUcatioiu 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be efiEected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made available to the public under Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by ttie applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice tiiat the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsingrrefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, tmtil the application has been Iddd open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
fliis effect mtist be filed by the applicant with the International Bureau 
before the completion of die technical preparations for the international 
publication of tiie application. 

DENMARK 

The appUcant hereby requests that until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by tiie applicant 
with tfie Danish Patent Office not later that at the time when the application 
is made available to the public tmder Sections 22 and 33(3) of tihe Danish 
Patents Act. If such a request has been filed by the applicant; any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert xnay be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date-(preferably on the Form PCr/RO/134 reproduced in annex Z 
of Volxmie I of the PCT Applicant's Guide). If such a request has been filed 
by file applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or until the date on which the application is refused or withdrawn or 
lapsed, the microorganism shaU be made available as provided in Ae 31F(1) 
of the Patent Rules only by the issue of a sample to an expert. The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicants or agenf s file 
reference oumbo- 



PS002PCT 



International application No. Unassigned 



REC'D 03 APR 



WIPO 



PCT 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 13 to; 



A. The indications made below relate to the miaooiganism referred to in the description 
on page 142 jinc N/A 



Further deposits are identified on an additional sheet 



Name of depositary institution American Type Cultuj« Collection 



Address of depositary institution {including posttii code and country) 

12301 Paiklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit June 12, 1997 



I Accession Number 209 1 1 9 



C ADDITIONAL INDICATIONS (lea^ blank ifna appHcabk) This information is continued on an additional sheet 



□ 



^I^l^^^ ^^"^u^ " '^^'^1'' European Patent is sought a sample of the deposited micrx>organism will be 
SS^t^ri^hi*" ^ *e publication of the mention of the grant of the European patent oruntil the ^T^2 
apphcatipn has been refused or withdrawn or is deemed to be withdrawn, oSy bythe issue of such^Wote toan exoert 
nommated by the person requesting the sample (Rule 28 (4) EPC) i:»^uc oi sucn a sample to an expert 



D. DESIGNATED STATES FOR WHICH INDICATi5nSAR^^ 



E, SEPARATE FURNISHING OF INDICATIONS (lea^biankifnoiapplicabie) 



The ir>dications 
Number of Deposit*^ 



below will be submitted to the International Bureau later {specify the g^al nature ^th: indications, e.g . "Ac^^ 



For receiving Office use only . 



is sheet was received with the international application 



Authorized officer 



Sudan White 

PCT Intemaftonal Dhrtalon 



ForaiPCT/R0/I34<Jiily 1992) 



• For International Bureau use only i 



□ 



This sheet was received by the IntemaiionaJ Bureau on: 



Authorized officer 



per JS 



CANADA 

The applicant requests that, until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement, or is witfidrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that tfie application has been laid open to 
public inspection (by the Norwegian Patent Office), or hcts been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this efifect shall be filed by the applicant 
with the Norwegian Patent Office not later tihan at the time when the 
application is made available to the public under Sections 22 and 33(3) of llie 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by tfie applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsin^ipefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open 
to pubUc inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by flie National Board of Patents and 
Registration without having been laid open fo public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests tfiat the furnishing of a sample of a 
microorganism shall only be made available to an expert: The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for tixe international 
publication of tiie application. 

DENMARK 

The applicant hereby requests that until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection^ the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public under Sections 22 and 33(3) of the Danish 
Patents Act If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on.a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 montfis from the 
priority date -(preferably on the Form PCT/RO/134 reproduced in annex Z 
of Volume I of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in flie individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or imtil the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert. The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicant's or agem's file 
reference numbor 



PS002PCT 



IlntcmauoKal applicaiion No. Uhassi^ed 



WIPO 



PCX 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule \3bis) 



A. The indications made betow relate to the miaoorganian referred to in the description 
on page 142 Jine- N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Nanie of depositary institution American Type Culture Collection 



Address of depoatary institution {including postal code and country) 

12301 Paridawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 1 2, 1 998 



Accession Number 209627 



C. ADDITIONAL INDICATIONS ^Zwree blank if not applicable) This information is continued on an additional sheet Q 



In respect to those designations in which a European Patent is sought a sample of die deposited microorganism will be 
made available until the publication of the mention of the grant of the European patent or until the date on which 
application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an exoat 
nommated by the person requesting the sample (Rule 28 (4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ftf the indicatwns are not far aU designated Stata) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if mt applicable) 



l^b^f^^!^^^ "^'^"^ ^ submitted to the International Bureau later {specif the general nature of the indicafionx e.g.. "Acce^ 



For receiving Office use only . 



0 



This sheet was received with the tniemattonal plication 



Authorized officer 



Susan White 

WSTIntemaitonaJ OMaon 



* For International Bureau use only < 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



FofiD PCT/RO/134 (JuJy 1992) 



PCI us 9S.^OM-M-g3 



CANADA 

The applicant requests that until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement^ or is witiidrawn, the 
Conunissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made available to the public xmder Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by tfie applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsingTYefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be filed by tfie applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of tfie application. 

DENMARK 

The applicant hereby requests tihat, until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public under Sections 22 and 33(3) of die Danish 
Patents Act. If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the apphcant 
with the International Bureau before the expiration of 16 months from the 
priority date-^eferably on tfie Form PCr/RO/134 reproduced in annex Z 
of Volume I of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netiierlands 
patent or imtil the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert. The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicant's or agenfs file 
reference number 



PS002PCT 



Intemationai application No. Unassigned 

PPT Mc:: -nM^tiQ-rt 



INDICATIONS RELATING TO A DEPOSITED 

(TCTRule \3bis) 




A. The indications made below relate to the microorganism referred to in the description 
on page 146 .line N/A 



R IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet ^ 



Name of dqx>sitaiy institution American Type Culture Collection 



Address of depositary institution {inciuding postal code and country) 

12301 Parkiawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15, 1997 


Accession Number 209048 


C. ADDITIONAL INDICATIONS ^Znnv6J:attik{^/Ktfd^ka6 This information is continued on an additional sheet Q 



n respect to those designations in which a European Patent is sought a sample of the deposited microorganism will be 
made available until the publication of the mention of the grant of the European patent or until the date on which 
application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert 
nommated by the person requesting the sample (Rule 28 (4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE CtftheindictuhmsmnoifaraaMgnatedStau^ 



E. SEPARATE FURNISHING OF INDICATIONS Oea^ blank if not applioAle) 



The indications listed below will be submitted to Uie Intemationai Bureau later {specify the general nature of the indications, ez. 'Accession 
Number of Deposit'^ 



For receiving Office use only . 



This shed was received with the tntemaiional application 



Authorized olficcr 



PCT {ntemfirttonai Division 



i For international Bureau use only • 



□ 



This sheet was received by the Intcmaitonal Buieau on: 



Authorized officer 



Fomi PCT/RCVm (July 1992) 



PC7 US 9a/oHm-g3 



CANADA 

The applicant requests that, until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement, or is withdrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made available to the public under Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by tihe applicant in the individucd case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to tfie lapsingrifefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, imtil the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be fUed by the applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of ihe application. 

DENMARK 

The applicant hereby requests that, xmtil the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public ins|>ection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Danish Patent Office not later diat at the time when the application 
is made available to the public tmder Sections 22 and 33(3) of the Danish 
Patents Act If such a request has been filed by the applicant any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date^eferably on the Form PCT/RO/134 reproduced in annex Z 
of Voliune I of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
scimple shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that imtil the date of a grant of a Netherlands 
patent or until the date on which the application is refused or wiflidrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert. The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicants or agent* s file PS002PCT 



ferawKcnumto j Intonaijonal application No. Unassigned 

1 ECT -'lig^ QP -'1 



REC'D 0 3 APR 



WIPO PCT 
INDICATIONS RELATING TO A DEPOS rfED ^^CROORQANISM 

(PCT Rule \3bis) 



A. The indicalioiis made below relate to the microorgaiiisiii refened to in the desoiptioD 
on page 146 .line N/A 



R IDENTinCATION OF DEPOSIT F.«her deposits an: identified on an additional sheet 



IS) 



Name of depositary institution American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Paiklawn Drive 
RockviUe, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 97902 



C. ADDITIONAL INDICATIONS (leave blank if not appticabh^ This information is continued on an additional sheet □ 



In respect to tfiose designauons m which a European Patent is sought a sample of the deposited microorganism will be 
made available until the publication of the mention of tfie grant of the European patent or until the date on which 
application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an exoert 
nommated by die person requesting the sample (Rule 28 (4) EPC). *^ 



a DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ftftheindicaiiom are not for aU designated States) 



E, SEPARATE FURNISHING OF INDICATIONS (leave blank if not amicable) 



Jh^blf^^in^ '^^^w ^" ^ submitted to the Intemaliona! Bureau later f^dj^ the general nawre of the indications, e.g.. Ucce^ 



For receiving Office use only . 



is sheet was received with the international appltcation 



Authoiizcd officer 



Sbsen White 

'*Cr Imemattonat EMvtsfen 



• For International Bureau use only • 



□ 



This sheet was received by the International Bureau on: 



Authorized ofiRcer 



Fonn PCT/R(yi34 (July 1992) 



PC" us Q8/OHm-93 



CANADA 

The applicant requests that, until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement, or is witfidrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biologiccd material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made available to the public imder Sections 22 and 33(3) of the 
Norwegian Patents Act. If such a request has been filed by the applicant; any 
request made by a third party for the furnishing of a S£imple shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by tfie applicEint in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsingrifefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for tfie international 
publication of ttie application. 

DENMARK 

The appUcant hereby requests that, imtil the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in tfie art The request to this effect shall be filed by the applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public under Sections 22 and 33(3) of the Danish 
Patents Act. If such a request has been filed by the applicant any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, imtil the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date -(preferably on the Form PCT/RO/134 reproduced in annex Z 
of Volimie I of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for ttxe fimushing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that imtil the date of a grant of a Netherlands 
patent or until the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
avcdlable to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



reference numb^ I IntenffltionaJ application Na Unassigped 

1 aJCJ QR/'i 



Applicant's or agcnf s file PS002PCT 



REC'D 0 3APRi998 



WIPO pcT 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 



A. The indic^ons made below relate to the niicroorganism rcfeiTed to in the description 
on page 153 .line N/A 



a ID£NnnCATION OF DEPOSIT 



Further deposits are identified on an additional sheet ^ 



Name of depositary institution American Type Culture Collection 



Address of depositary institution {including posted code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15. 1997 



Accession Number 209049 



C. ADDITIONAL IW)ICATIONS (feow Won* i/rtoTi^R^ This information is continued on an additional sheet Q 



In respect to those designations in which a European Patent is sought a sample of the deposited microorganism will be 
made available until the publication of the mention of the grant of the European patent or until the date on which 
apphcation has been refused or witfidrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert 
nominated by the person requesting the sample (Rule 2S (4) EPC). r- t™ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE Of the indications are not for aUdaigna^ 



E. SEPARATE FURNISHING OF INDICATIONS (ka^ biank ifnotapplicabie) 



The indications listed below will be submitted to the International Bureau later (jped^ the genend nature of the indications, e.g. "Accession 
Number of Deposit'^ *" 



For receiving Office use only , 



This sheet was received with the mtemational application 



Authorized officer 



Susan White 

PCT Intematfonal OhMm 



* For International Bureau use only < 



□ 



This sheet was received by the Internationa] Bureau on: 



Authorized officer 



Form PCT/RO/I34 (July 1992) 



PCT US g8/0M-H93 



CANADA 

The applicant requests that until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement, or is witfidrawn, the 
Conunissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must^ by 
a written statement, inform the International Bureau accordin^y before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made available to the public tmder Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by tihe applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsingrifefuscJ or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open 
to pubUc inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Boctrd of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furxiishing of a sample of a 
microorganism shall only be made available to an expert. The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of tihe application. 

DENMARK 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Danish Patent Office), or has bieen finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Demish Patent Office not later that at the time when the application 
is made available to the public imder Sections 22 and 33(3) of tfie Danish 
Patents Act. If such a requ^t has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in die art The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date -(preferably on the Form PCT/RO/134 reproduced in annex Z 
of Volume I of the PCT Applicant s Guide). If such a request has been filed 
by the applicant any request made by a third party for tfie furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or imtil the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



Applicant's or agents file PS002PCT 
reference numbo" 



I International application No. Unasstgned 



REC'D .03 APR ^ 



WIPO 



per 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule Ubis) 



A. The indications made below relate to the microorganism refened to in the description 
on page 153 , line N/A 



a IDENTIFICATION OF DEPOSIT 



Fiuthcr deposits are identified on an additional sheet ^ 



Name of depositary institution American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 


Accession Number 97903 


C. AUUit\0^\\^lHD\Q\T\Of^S(Uave blank if not applicable) This infonnation is continued on an additional sheet [~| 



In respect to those designations m which a European Patent is sought a sample of the deposited microorganism will be 
made available until the publication of the mention of the grant of the European patent or until the date on which 
application has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert 
nommated by die person requesting the sample (Rule 28 (4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ftf the U^caHom are nai for aU designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not a/^kable) 



The indi^ions listed below will be submitted to the International Bureau later (jspecnj^ the general nature of the indications, e.g., "Accession 
Number of Deposit") -r ^ 



For receiving Office use only . 



This sheet was received with the international application 



Authorized officer 



Susan Whtte 

W?r Intematfonal Dhffeton 



• For International Bureau use only • 



□ 



This sheet was received tyy the Intetnaikmal Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



CANADA 

The applicant requests that, until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement or is withdrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement, inform tfie International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made available to the public imder Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
flie expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by tihe applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsing^^?efusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been leiid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art 



PC us 98/04^4-93 



Page 2 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of the application. 

DENMARK 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been l£dd open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in tihe art The request to this effect shall be filed by llie applicant 
witii the Danish Patent Office not later that at the time when the application 
is made available to the public under Sections 22 and 33(3) of the Dariish 
Patents Act. If such a request has been filed by the applicant; any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, imtil the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date preferably on the Form PCT/RO/134 reproduced in annex Z 
of Volume I of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or imtil the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be furriished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



^^nS*"'"'*' ^^^^ I i-temational application No. Unassigned 

ittercncc number i ^^-J US^:S^^^^3 



jEc^D 03APRta9a 



PCT 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 



A. *rhe indications made below relate to the microorganism refenncd to in the description 
on page 154 .line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 



Name of depositary instittition American Type Culture Collection 



Address of depositary instituticm (including postal code and country) 

12301 Paiklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit July 3, 1 997 


Accession Number 209 1 39 


C. ADDI riUN AL INDICATIONS {leave blank ifnoi app/icabU) This infonnation is continued on an additional sheet □ 



In respect to Aose d^ignations m which a European Patent is sought a sample of the dqxwited microoreanism will be 
made available until the publication of the mention of the grant of the European patent or until die date on which 
applicauon has been refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert 
nommated by the person requesting die sample (Rule 28 (4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE Of O^inOcaiiom are not far att designated 



E. SEPARATE FURNISHING OF INDICATIONS (lea^ bkmk if not applicable) 



^^'J!^ ^'^"^ ^ submitted to the Intmialional Bureau later {specif the general nature of the indications, r.g., "AceeJIi^ 
riwnt)er eg txpostt^ 



For receiving Office use only . 



This shcci was received with the international application 



Authorized officer $1^980 WWt0 ' 

PCXr Intemattonaf E)fvWlwt 



• For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



CANADA 

The appKcant requests that, until either a Canadian patent has been issued 
on the basis of an application or the application has been refused, or is 
abandoned and no longer subject to reinstatement, or is withdrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, tfie applicant must, by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the international 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been jBnally 
decided upon by tiie Norwegian Patent Office witfiout having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
with the Norwegian Patent Office not later than at the time when the 
application is made available to the pubUc under Sections 22 and 33(3) of tixe 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by the applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorganism shall only be effected prior to the grant of a patent, or prior 
to the lapsingrifefusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Axistralian Patents Regulations). 

FINLAND 

The applicant hereby requests that, imtil the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a 
microorganism shall only be made available to an expert The request to 
this effect must be filed by the applicant with the International Bureau 
before the completion of the technical preparations for the international 
publication of the application. 

DENMARK 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Danish Patent Office), or has been finally 
decided upon by the Danish Patent office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect sfiaU be filed by ttie applicant 
with the Danish Patent Office not later that at the time when the application 
is made available to the public tmder Sections 22 and 33(3) of the Danish 
Patents Act If such a request has been filed by the applicant any request 
made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the Swedish Patent Office), or has been finally 
decided upon by the Swedish Patent Office without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an 
expert in tiie art. The request to tfiis effect shaD be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date -(preferably on die Form PGr/RO/134 reproduced in annex Z 
of Volume I of the PCT Applicant's Guide). If such a request has been filed 
by the applicant any request made by a third party for the furnishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by tixe Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands 
patent or until the date on which the application is refused or withdrawn or 
lapsed, the microorganism shall be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert The request to 
this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the public under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earlier. 



1^ PS002PCT llntemadon^licationNo. Unassigned 



INDICATIONS RELATING TO A DEPOSl 

(PCTRule \3bis) 




A. The indications made below relate to the microorganism referred to in the description 
on page 160 .line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 



Name of depositary institution American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parkiawn Drive 
Rockvilte, Maryland 20852 
United States of America 



Date of deposit Kfay 15, 1997 



Accession Number 209050 



C. ADDITIONAL INDICATIONS (leave blank if not oftpiicable) This information is continued on an additional sheet Q] 



In respect to those designations in which a European Patent is sou^t a sample of the deposited microorganism will be 
made available until the publication of die mention of die grant of die European patent or until die date on which 
applicadon has bera refused or withdrawn or is deemed to be withdrawn, only by die issue of such a sample to an expert 
nominated by die person requesting die sample (Rule 28 (4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ^theimBcatwrn are net/or all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (have blank i/not applicaMe) 



The indii^ons listed below will be submitted to the International Bureau later i^i^ the general nature of the indications, eg., ''Accession 
Number of Deposit'^ 



For receiving Office use only . 



-0 



This sheet was received with dw intemalional application 



Authorized officer 



SitoanWhtta 

PCT Intemattenal Dtw^n 



" For International Bureau use only ^ 



□ 



This sheet was received by the ImcmationaJ Bureau on: 



Authorized officer 



Fonn PCT/RO/134 (July 1992) 



PCT US gs/oM-M-gB 



CANADA 

The applicant requests that until either a Canadian patent has been issued 
on the basis of an application or the application has been refused^ or is 
abandoned and no longer subject to reinstatement, or is withdrawn, the 
Commissioner of Patents only authorizes the furnishing of a sample of the 
deposited biological material referred to in the application to an 
independent expert nominated by the Commissioner, the applicant must, by 
a written statement, inform the International Bureau accordingly before 
completion of technical preparations for publication of the intemational 
application. 

NORWAY 

The applicant hereby requests that the application has been laid open to 
public inspection (by the Norwegian Patent Office), or has been finally 
decided upon by the Norwegian Patent Office without having been laid 
open inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant 
witii the Norwegian Patent Office not later than at the time when the 
application is made available to the public imder Sections 22 and 33(3) of the 
Norwegian Patents Act If such a request has been filed by the applicant, any 
request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person 
approved by the applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a 
microorgcinism shall only be effected prior to the grant of a patent, or prior 
to the lapsing^efusal or withdrawal of the application, to a person who is a 
skilled addressee without an interest in the invention (Regulation 3.25(3) of 
the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open 
to public inspection (by the National Board of Patents and Regulations), or 
has been finally decided upon by the National Board of Patents and 
Registration without having been laid open to public inspection, the 
furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that tfie fumishing of a sample of a 
microorganisin shall only be made available to an expert The request to 
this effect must be filed by the applicant with the International Bureau 
before tiie completion of the technical preparations for tfie international 
publication of tiie application. 

DENMARK 



The apphcant hereby requests that, until the appUcation has been laid open 
to public inspection (by the Danish Patent Office), or has been finaUy 
decided upon by the Danish Patent ofBce without having been laid open to 
public inspection, the fumishing of a sample shall only be effected to an 
expert in flie art The request to tfiis effect shall be fUed by the applicant 
with the Danish Patent Office not later that at the time when the appUcation 
is made available to the public under Sections 22 and 33(3) of the Danish 
Patents Act If such a request has been filed by the applicant any request 
made by a third party for the fumishing of a sample shaU indicate tfie expert 
tobeused. That expert may be any person entered on a list of recognized 
experts drawn up by the Danish Patent Office or any person by the applicant 
in tiie individual case. 



SWEDEN 



The applicant hereby requests that until the appUcation has been laid open 
to pubUc inspection (by the Swedish Patent Office), or has been finaUy 
decided upon by the Swedish Patent Office without having been laid open to 
pubUc inspection, the fumishing of a sample shaU only be effected to an 
expert in the art. The request to this effect shaU be filed by the applicant 
with the International Bureau before the expiration of 16 months from the 
priority date^wreferably on the Form PCT/RO/134 reproduced in annex Z 
of Volume I of the PCT AppUcanfs Guide). If such a request has been filed 
by the appUcant any request made by a third party for the fumishing of a 
sample shall indicate the expert to be used. That expert may be any person 
entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a appUcant in the individual case. 

NETHERLANDS 



The appUcant hereby requests that until the date of a grant of a Netherlands 
patent or until the date on which the appUcation is refused or withdrawn or 
lapsed, the microorganism shaU be made available as provided in the 31F(1) 
of the Patent Rules only by the issue of a sample to an expert. The request to 
this effect must be funushed by the appUcant with the Netherlands 
Industrial Property Office before the date on which the application is made 
available to the pubUc under Section 22C or Section 25 of the Patents Act of 
the Kingdom of the Netherlands, whichever of the two dates occurs earUer. 



